Joachim Rosskopf @jrosskopf - Twitter Profile

29 days ago

𝐀𝐬𝐤𝐢𝐧𝐠 𝐭𝐡𝐞 𝐰𝐫𝐨𝐧𝐠 𝐬𝐨𝐯𝐞𝐫𝐞𝐢𝐠𝐧𝐭𝐲 𝐪? Use Claude for the expensive part: 📚 AWS docs → spec 🧪 boto3 + real AWS → oracle Then Demo locally: Marila + RustFS + DuckDB. 𝐏𝐚𝐲 𝐟𝐨𝐫 𝐢𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞. Stop renting substrate. https://t.co/xhQKY0g3xa

jrosskopf's tweet photo. 𝐀𝐬𝐤𝐢𝐧𝐠 𝐭𝐡𝐞 𝐰𝐫𝐨𝐧𝐠 𝐬𝐨𝐯𝐞𝐫𝐞𝐢𝐠𝐧𝐭𝐲 𝐪?

Use Claude for the expensive part:
📚 AWS docs → spec
🧪 boto3 + real AWS → oracle

Then Demo locally: Marila + RustFS + DuckDB.

𝐏𝐚𝐲 𝐟𝐨𝐫 𝐢𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞.
Stop renting substrate.

https://t.co/xhQKY0g3xa https://t.co/4rrZ8AmW0U

0

1

0

92

Joachim Rosskopf @jrosskopf

about 1 year ago

⟳ We updated #ERPL extension to 🦆 @DuckDb v1.3.1 bugfix release. (Now available for v.1.3.0 and v.1.2.2). #ERPL connects 🦆 @DuckDB to #SAP ecosystem via standard interfaces: https://t.co/Z25xFnfJC6

0

67

Joachim Rosskopf @jrosskopf

over 1 year ago

@duckdb @polars @spark @snowflake Benchmarks https://t.co/VdmK1bDxIy Show This: → @DuckDB beats @Spark for small queries. → Even at 700GB, DuckDB (native files) is competitive. → Spark scales dynamically for 1TB+ workloads. 🔍 The lesson? If data fits a single-node go for it. Scale to MPP only when needed.

0

1

0

80

Joachim Rosskopf @jrosskopf

over 1 year ago

MPP vs. Single-Node Engines Small workloads? Use @DuckDb or @Polars for faster in-memory performance. Massive datasets? MPP systems like @Spark or @Snowflake scale dynamically. Experiment: @DuckDB outperformed Spark at <100GB 💡 Don't drive groceries shopping with a tank!

jrosskopf's tweet photo. MPP vs. Single-Node Engines

Small workloads? Use @DuckDb or @Polars for faster in-memory performance.
Massive datasets? MPP systems like @Spark or @Snowflake scale dynamically.

Experiment: @DuckDB outperformed Spark at <100GB

💡 Don't drive groceries shopping with a tank! https://t.co/Nz7kHarPmU

1

2

0

1

199

Who to follow

Mats Uddenfeldt

@uddenfeldt

Helping you master mindset, habits, and skills needed in your career. → LinkedIn for long form content.

Nicholas Dronen

@ndronen

Working stiff at United Metropolitan Improved Hot Muffin and Crumpet Baking and Punctual Delivery Company. Jean Kayak roadie with all the requisite insurance.

JÉRÔME COUTARD

@Filteris

FILTERIS - CANADA Jérôme Coutard, Ph.D. #IA #PublicPerceptionsResearch #ereputation #QOTMII

Joachim Rosskopf @jrosskopf

over 1 year ago

Why Are Object Stores So Attractive? 1️⃣ Scalability: Handle massive amounts of data. 2️⃣ Flexibility: Open formats like Iceberg for interoperability. 3️⃣ Advanced Features: Replication, immutability, and consistency. They became the backbone of modern distributed systems.

0

27

Joachim Rosskopf @jrosskopf

over 1 year ago

The Future of Distributed Systems Object storage like Amazon S3 has become a primary database—scalable & efficient for transactional & analytical workloads. Emerging programming models: 1️⃣ Distributed DBs 2️⃣ Serverless 3️⃣ Wasm

jrosskopf's tweet photo. The Future of Distributed Systems

Object storage like Amazon S3 has become a primary database—scalable & efficient for transactional & analytical workloads.

Emerging programming models:
1️⃣ Distributed DBs
2️⃣ Serverless
3️⃣ Wasm https://t.co/ebNwjYl2iG

1

0

39

Joachim Rosskopf @jrosskopf

over 1 year ago

What Are "One-Way Door" Risks? ❌ One-way doors = irreversible decisions. In tech: adopting new tools or models without clear exit paths.

1

0

26

Joachim Rosskopf @jrosskopf

over 1 year ago

The Iceberg Effect Modern data is evolving: → Iceberg now leads open table formats (Snowflake & Databricks adoption confirms it). → Cloud-native storage is a must (legacy systems won’t keep up). → AI thrives on scalable, open architectures. More innovation. Less lock-in.

jrosskopf's tweet photo. The Iceberg Effect

Modern data is evolving:
→ Iceberg now leads open table formats (Snowflake & Databricks adoption confirms it).
→ Cloud-native storage is a must (legacy systems won’t keep up).
→ AI thrives on scalable, open architectures.

More innovation. Less lock-in. https://t.co/sYDUY9Zlhf

0

76

Joachim Rosskopf @jrosskopf

over 1 year ago

Curious where the data comes from? 🔗 Snowset (Snowflake's dataset): https://t.co/vP40KU1d2z 🔗 Redset (Redshift's dataset): https://t.co/U0FYC1qpTO Both share real-world query samples, packed with insights into how data warehouses are used. Check them out!

0

16

Joachim Rosskopf @jrosskopf

over 1 year ago

What Do Data Warehouses Really Do? → $300K/year on Snowflake, and 90% is spent on queries. → Most queries are tiny (median: 100MB, 99.9% <300GB). → Most workloads = ingestion + transformation (not analytics). 💡 Small Data > Massive Complexity. We overpay for simplicity?

jrosskopf's tweet photo. What Do Data Warehouses Really Do?

→ $300K/year on Snowflake, and 90% is spent on queries.
→ Most queries are tiny (median: 100MB, 99.9% <300GB).
→ Most workloads = ingestion + transformation (not analytics).

💡 Small Data > Massive Complexity.
We overpay for simplicity? https://t.co/9O20csFG0Y

1

0

24

Joachim Rosskopf @jrosskopf

over 1 year ago

Think Small. Make Big Impact. More Data ≠ Better Results. → Recent data is the most valuable. → Smaller AI models deliver bigger impact. → Local-first development works. Stop relying on distributed complexity when single machines get the job done. #SmallData. Are you in?

jrosskopf's tweet photo. Think Small. Make Big Impact.

More Data ≠ Better Results.
→ Recent data is the most valuable.
→ Smaller AI models deliver bigger impact.
→ Local-first development works.

Stop relying on distributed complexity when single machines get the job done.

#SmallData. Are you in? https://t.co/jmPHLHQAKI

0

2

1

0

59

Joachim Rosskopf @jrosskopf

over 1 year ago

@eastoalex Thanks for the hint. I'm in! 🎉

0

10

Joachim Rosskopf @jrosskopf

over 1 year ago

#BigData isn’t the problem—it never was. Most enterprises have <100GB in active data but overpay for tools designed for massive scale (#Snowflake, #Databricks, etc.). Focus on #SmallData: → Easier to analyze → Cheaper to manage → Faster insights Time for #SmallData

jrosskopf's tweet photo. #BigData isn’t the problem—it never was.

Most enterprises have <100GB in active data but overpay for tools designed for massive scale (#Snowflake, #Databricks, etc.).

Focus on #SmallData:
→ Easier to analyze
→ Cheaper to manage
→ Faster insights

Time for #SmallData https://t.co/344VuqF2jv

1

0

70

Joachim Rosskopf @jrosskopf

over 1 year ago

@matsonj Thank you @matsonj for mentioning our work! In good old europe a lot of data projects in enterprises start and end in a SAP system. So it it was quite natural to try to eliminate the typical #databricks, #snowflake or #Excel file mess in between.

1

0

55

Joachim Rosskopf @jrosskopf

about 2 years ago

With that you can access your enterprise data from your #Mac, #Windows or #Linux PC. Using #Python, #R, #Java, #NodeJS, #RUST, #GO or #ODBC. https://t.co/0ITb32aLV4

0

69

Joachim Rosskopf @jrosskopf

about 2 years ago

Connect to @DuckDB to #SAP ERP, #ODP or #BICS in minutes. Since today also available on #OSX and #applesilicon. https://t.co/BAYXtXqOcY

1

2

1

0

94

Joachim Rosskopf @jrosskopf

about 2 years ago

@illyism The mechanism is super powerful. We created an extension to transparently load data from #SAP into #duckdb. If one is interested: https://t.co/Nr4UB5YaNL

0

1

0

22

Joachim Rosskopf @jrosskopf

about 2 years ago

@duckdb Very cool 😎! Our extension to load data from #SAP ERP, BW or #ODP is ready for 1.0.0. Find out more at https://t.co/Nr4UB5YaNL. DuckDB, the data ecosystem of the future.

0

2

0

126

Joachim Rosskopf @jrosskopf

over 2 years ago

@duckdb @mraasveldt Great news from @duckdb on multi-database support! For those in #SAP environments, our ERPL extension offers seamless integration into the SAP Business Warehouse, data replication with ODP or simply reading tables and calling RFC functions. Check out https://t.co/Nr4UB5XCYd

0

263

jrosskopf retweeted

Thomas Wiecki @twiecki

over 4 years ago

So excited to finally have the (restricted) beta of our Intuitive Bayes introductory course out (https://t.co/jJIT1xytu8), a long time in the making. Cool to see how people resonate with our code first approach and even come up with memes themselves for it (HT @robertmitchellv).

twiecki's tweet photo. So excited to finally have the (restricted) beta of our Intuitive Bayes introductory course out (https://t.co/jJIT1xytu8), a long time in the making. Cool to see how people resonate with our code first approach and even come up with memes themselves for it (HT @robertmitchellv). https://t.co/VRwbfaV9NQ

3

56

8

0

Joachim Rosskopf

@jrosskopf

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users