At some point, usually in your 20s, you'll notice that the people around you stop believing in themselves. And no matter how hard you try, you can't save them. By all means, do not let it infect your mind. Stay on your path.
The feature I love most is running upstream and downstream dependencies for a model with one command.
dbt run -m model_name+ → runs the model and its downstream dependencies
dbt run -m +model_name → runs the upstream parents and the model
dbt CLI commands every data engineer should know:
- dbt run → builds models
- dbt test → validates data
- dbt seed → loads static data
- dbt snapshot → tracks changes
Before dbt:
❌ giant SQL scripts
❌ hidden business logic
❌ broken dashboards
After dbt:
✅ modular models
✅ tested data
✅ documented metrics
This is why dbt won 🚀
In the SQLMesh → Fivetran → dbt story, my key takeaway:
The data world is small. Be kind to your competitors, they might be your next teammate… or your boss 😅
Ever need to sample data without scanning everything? In Databricks SQL, use `TABLESAMPLE` in your SELECT to grab a subset of rows efficiently.
Syntax : `SELECT * FROM table TABLESAMPLE ( {percentage PERCENT} | {num_rows ROWS} | {BUCKET fraction OUT OF total} )