If you do the following things as a data engineer, I'll yell at you:
- build a pipeline with no data quality checks
- build a pipeline that is not idempotent
- build a pipeline without validating business value
- build a pipeline because a stakeholder asks you to "pull the data real quick"
- build a pipeline that does not model slowly changing dimensions correctly
- build a pipeline that needs a separate "backfill" pipeline
- build a pipeline that breaks pipeline homogeneity for no good reason
In this boot camp lecture (https://t.co/Z5vrLcMFCo), I'll be yelling at you for not building idempotent pipelines.
The most common mistakes in this regard are how people model their dimensions and how they parameterize their pipelines!
Enjoy the lecture for today that is published a few hours early since I'll be catching a flight back to Utah at 5 PM!
There's only 22 spots left in my paid boot camp that starts January 6th! you can get 20% off by using code ZACH20 at checkout here: https://t.co/UYgw0plGsF
Please repost this video to spread to knowledge!
GIVEAWAY ๐
I'm giving away $500 to a lucky winner.
TO JOIN:
- RT and LIKE this Tweet
- Comment this post
- Follow @SoulzBTC
Winner will be announced in 48h
Drop the ๐งผ below, click like and RT and I'll pick 2 of you to come trade live on stream with me for a month in https://t.co/5xrffCIxTz.
Includes access to all my videos, education and strategies that you can then take away and use yourself.
24 hours - Good luck ๐ค