Dagster

19 days ago

We adopted Astral’s Python type checker, ty, to speed up type checking in the Dagster monorepo. The performance gains were dramatic, but the bigger surprise was that ty caught real runtime bugs Pyright missed. See the full story here: https://t.co/wgDCoYsLAY

0

10

0

8

1K

20 days ago

Building data pipelines with @cursor_ai? Be sure to install the official Dagster Expert skill! https://t.co/tGY5boHZky

1

6

0

2

325

🏗 Your entire analytics engineering workflow 🧪 Built by @dbt_labs

21 days ago

Check out this post from @sspaeti that covers a complete guide of insights, tips, and predictions for the data platform engineer, just like an Almanack provides, with practical information for daily life.

Simon Späti 🏔️

@sspaeti

21 days ago

I wrote an Almanack like Charlie Munger and Naval Ravikant did, but about life wisdom and my time of using Dagster for data orchestration over the years. Since using it in 2018, the story has gone from complexity to composability and from orchestrating to a full data platform. Lots have changed since I started. We have shifted from execution-only to fully data-aware pipelines with shared resources and code locations separate of concern, from task-based DAGs to data-aware assets, and moved from pure data pipeline orchestration to provisioning for DevOps or automating other departments' tasks, too. One thing has stayed since day one: the focus on developer usability, being a toolbox for data engineers with best practices and functional data engineering applied by default. And the main goal, besides orchestration, is to deal with the complexity of the data architecture we inevitably have and reduce it through intelligent design principles (which may take a little more to learn at first, but will help us a lot down the road). This article tells my personal story of how I was introduced to Dagster, what convinced me early on, and why it has evolved into a fully open data platform today. I will take you through the best parts of Dagster and its capabilities, and why it's a little different from other orchestrators.

sspaeti's tweet photo. I wrote an Almanack like Charlie Munger and Naval Ravikant did, but about life wisdom and my time of using Dagster for data orchestration over the years. Since using it in 2018, the story has gone from complexity to composability and from orchestrating to a full data platform.

Lots have changed since I started. We have shifted from execution-only to fully data-aware pipelines with shared resources and code locations separate of concern, from task-based DAGs to data-aware assets, and moved from pure data pipeline orchestration to provisioning for DevOps or automating other departments' tasks, too.

One thing has stayed since day one: the focus on developer usability, being a toolbox for data engineers with best practices and functional data engineering applied by default. And the main goal, besides orchestration, is to deal with the complexity of the data architecture we inevitably have and reduce it through intelligent design principles (which may take a little more to learn at first, but will help us a lot down the road).

This article tells my personal story of how I was introduced to Dagster, what convinced me early on, and why it has evolved into a fully open data platform today. I will take you through the best parts of Dagster and its capabilities, and why it's a little different from other orchestrators.

1

18

2

11

2K

0

8

1

2

1K

Who to follow

dbt

@getdbt

DuckDB

@duckdb

DuckDB is an analytical in-process SQL database management system. "DuckDB" and the DuckDB logo are registered trademarks of the DuckDB Foundation.

Apache Superset

@apachesuperset

Modern, open source data exploration & visualization platform 📊. Open Source business intelligence (BI) is here, and here to win!

dagster retweeted

rex ledesma

@rexrledesma

about 1 month ago

At @poolsideai, we're announcing the first public models in the Laguna family, Laguna M.1 and Laguna XS.2! As part of the model factory team, it's been surreal to witness how our models and harnesses have co-evolved to be an incredible thought partner in our day-to-day engineering. We have put so much thought and care into building this model. Our alchemical journey of model development continues before us. I am beyond excited :)

1

26

2

1K

about 1 month ago

@rexrledesma :dagsir:

0

3

0

109

dagster retweeted

Poolside

@poolsideai

about 1 month ago

Today we’re releasing Laguna XS.2, Poolside’s first open-weight model. It’s a 33B total / 3B active MoE model built for agentic coding and long-horizon tasks. Trained fully in-house on our own stack. Runs on a single GPU. Released under Apache 2.0. Links 👇 Weights: https://t.co/HSo8L2gM64 API: https://t.co/DMJtNFrace Blog: https://t.co/BXEjQxtQoV

poolsideai's tweet photo. Today we’re releasing Laguna XS.2, Poolside’s first open-weight model.
It’s a 33B total / 3B active MoE model built for agentic coding and long-horizon tasks.
Trained fully in-house on our own stack. Runs on a single GPU. Released under Apache 2.0.
Links 👇
Weights: https://t.co/HSo8L2gM64
API: https://t.co/DMJtNFrace
Blog: https://t.co/BXEjQxtQoV

44

807

145

379

274K

dagster retweeted

colton @coltonpadden

about 2 months ago

Now when you finish onboarding to @dagster plus you'll be presented with instructions on how to get quickly started with our official skills!

coltonpadden's tweet photo. Now when you finish onboarding to @dagster plus you'll be presented with instructions on how to get quickly started with our official skills! https://t.co/UiBWm2Sg7B

1

2

1

2

491

dagster retweeted

about 2 months ago

Dagster 1.13 is out! Partitioned asset checks (at long last), virtual assets (preview), open-source AI skills for Claude Code/Codex/OpenCode, 20+ new components, and state-backed components on by default. Check out the release blog!

AlexNoonan6's tweet photo. Dagster 1.13 is out!

Partitioned asset checks (at long last), virtual assets (preview), open-source AI skills for Claude Code/Codex/OpenCode, 20+ new components, and state-backed components on by default.

Check out the release blog! https://t.co/uYjuntsWKr

1

12

1

3

835

2 months ago

🚀

colton @coltonpadden

2 months ago

Excited to release 1.13.0 of Dagster with lots of great features like official Dagster skills for, virtualized assets for modeling entities like views, partitioned asset checks, and more. Check out the blog post for more details.

coltonpadden's tweet photo. Excited to release 1.13.0 of Dagster with lots of great features like official Dagster skills for, virtualized assets for modeling entities like views, partitioned asset checks, and more. Check out the blog post for more details. https://t.co/yeX4styNkK

1

6

2

1

1K

0

1

0

513

dagster retweeted

colton @coltonpadden

2 months ago

What is the ideal setup for structuring git repositories in the age of AI? We've found that monorepos are key for cross-cutting changes and unified context, and we've done this by defining a hub-and-spoke model using Google's Copybara.

coltonpadden's tweet photo. What is the ideal setup for structuring git repositories in the age of AI?

We've found that monorepos are key for cross-cutting changes and unified context, and we've done this by defining a hub-and-spoke model using Google's Copybara. https://t.co/nIBQkHkTBR

2

9

1

4

687

dagster retweeted

2 months ago

Every tool you need to fix team coordination already exists: transcription, summarization, search, cataloging, orchestration. Nobody is wiring them together. We keep pointing AI at code generation while the real bottleneck is Slack threads nobody can find and institutional memory that walks out the door every two weeks.

AlexNoonan6's tweet photo. Every tool you need to fix team coordination already exists: transcription, summarization, search, cataloging, orchestration. Nobody is wiring them together.

We keep pointing AI at code generation while the real bottleneck is Slack threads nobody can find and institutional memory that walks out the door every two weeks.

1

11

3

0

777

dagster retweeted

YugabyteDB

@Yugabyte

2 months ago

Join @striimteam, @Yugabyte, and @dagster for an exclusive #AI After Party after the first day of #GoogleNEXT!🥂 📅 April 22, 6:00–8:30 PM - Rí Rá Irish Pub 📆 Don't miss: 🎶 Great music 🍴 Delicious food 🍸 An open bar 🤝 Chat with the sharpest minds in AI and data Because the best conversations don’t end when the sessions do!🔥 👉 RSVP today to save your spot: https://t.co/BpSShWCt1l

Yugabyte's tweet photo. Join @striimteam, @Yugabyte, and @dagster for an exclusive #AI After Party after the first day of #GoogleNEXT!🥂

📅 April 22, 6:00–8:30 PM - Rí Rá Irish Pub 📆

Don't miss:
🎶 Great music
🍴 Delicious food
🍸 An open bar
🤝 Chat with the sharpest minds in AI and data

Because the best conversations don’t end when the sessions do!🔥

👉 RSVP today to save your spot: https://t.co/BpSShWCt1l

0

1

0

279

dagster retweeted

2 months ago

If you use @dagster and have thoughts on how it should work, we want to hear from you. We're investing in making contributions easier to submit, review, and ship. Smarter review tooling, clearer guidelines, and better signals about where your work can have the most impact. Code and PRs are great, but docs, bug reports, examples, and feedback in Slack or GitHub all matter just as much. The project has always been shaped by the community using it.

AlexNoonan6's tweet photo. If you use @dagster and have thoughts on how it should work, we want to hear from you.

We're investing in making contributions easier to submit, review, and ship. Smarter review tooling, clearer guidelines, and better signals about where your work can have the most impact.

Code and PRs are great, but docs, bug reports, examples, and feedback in Slack or GitHub all matter just as much.

The project has always been shaped by the community using it.

2

6

2

1

414

dagster retweeted

3 months ago

If you can't waste hours, you'll waste years. An old boss told me that. AI was supposed to give us those hours back. Instead it filled them with planning, coordination, and status updates. A prison of my own making.

AlexNoonan6's tweet photo. If you can't waste hours, you'll waste years.

An old boss told me that. AI was supposed to give us those hours back. Instead it filled them with planning, coordination, and status updates.

A prison of my own making. https://t.co/soWzvGLvMe

2

30

1

4

1K

dagster retweeted

3 months ago

My did this analysis on dispersion, showed that $NFLX returned -89% in 2025. That didnt pass the sniff test and I spiraled a little bit finding what happened

AlexNoonan6's tweet photo. My did this analysis on dispersion, showed that $NFLX returned -89% in 2025. That didnt pass the sniff test and I spiraled a little bit finding what happened https://t.co/AdTBXbBHJC

2

11

1

801

dagster retweeted

3 months ago

How are all of you rolling your own orchestrators doing after daylight savings time? You know you can just pull open source tools off the shelf

AlexNoonan6's tweet photo. How are all of you rolling your own orchestrators doing after daylight savings time?

You know you can just pull open source tools off the shelf https://t.co/gEGc2rtgu0

3

13

1

0

519

dagster retweeted

3 months ago

We just launched a new free course on Dagster University: AI-Driven Data Engineering 8 lessons. Blank directory to production ELT pipeline. Built entirely from prompts. If you've been curious about using AI agents for real data engineering work and dont know where to start, this is the one for you!

AlexNoonan6's tweet photo. We just launched a new free course on Dagster University: AI-Driven Data Engineering

8 lessons. Blank directory to production ELT pipeline. Built entirely from prompts.

If you've been curious about using AI agents for real data engineering work and dont know where to start, this is the one for you!

2

40

3

31

1K

dagster retweeted

3 months ago

Databricks is a fantastic platform for compute and storage. But as your deployment scales across teams and workspaces, something needs to sit above it, coordinating dependencies, tracking lineage end-to-end, and giving every team visibility into what they own. We'll be hosting a hands-on deep dive showing how Dagster and Databricks work better together — specifically for teams managing multiple workspaces who need true cross-workspace orchestration without stitching together workarounds. We'll cover: → Connecting multiple Databricks workspaces into a single observable asset graph → Auto-discovering existing workspace jobs with zero code changes to get started → Dagster Pipes for bidirectional orchestration on top of your existing notebooks → The full reference stack: Fivetran + dbt + Databricks, coordinated from one control plane

AlexNoonan6's tweet photo. Databricks is a fantastic platform for compute and storage. But as your deployment scales across teams and workspaces, something needs to sit above it, coordinating dependencies, tracking lineage end-to-end, and giving every team visibility into what they own.

We'll be hosting a hands-on deep dive showing how Dagster and Databricks work better together — specifically for teams managing multiple workspaces who need true cross-workspace orchestration without stitching together workarounds.

We'll cover:
→ Connecting multiple Databricks workspaces into a single observable asset graph
→ Auto-discovering existing workspace jobs with zero code changes to get started
→ Dagster Pipes for bidirectional orchestration on top of your existing notebooks
→ The full reference stack: Fivetran + dbt + Databricks, coordinated from one control plane

1

4

1

0

529

dagster retweeted

3 months ago

New video 🔥 Dataops and reliability has been on my mind lately so I made a quick guide on how to improve your data platform performance with Dagster! Stakeholder trust is the most important thing when it comes to data work and dataops is a practice to minimize the risk of degrading trust. → Transient failures that resolve themselves without manual intervention → Resource protection so your warehouse doesn't get overwhelmed → Production jobs that always run first, no matter what's in the queue �� Zombie runs that get caught and killed before they drain your budget → Data quality gates that catch issues before they reach a dashboard → Tailored views in Dagster+ so every team member sees exactly what they own Check out the full video today! Link in the comments.

1

9

2

550

dagster retweeted