General Folders is an identity- and policy-aware transport layer for B2B data, replacing hacky SFTP, email, and ad-hoc pipelines.
Backed by top innovators 📈
🆕 New podcast episode is out today, covering some of the frequently asked questions about General Folders.
Thank you @JakeSearch and @MatchRelevant for having @djpardis on your podcast, we loved the episode! :)
https://t.co/1xlJXwNg6a
Who's in a reading mood this Friday?
@djpardis wrote about the evolution of software engineering from FORTRAN to LLMs to better understand the AI coding milestones underway.
It includes an in-depth look at major AI coding milestones.
https://t.co/H5m7uWWfqV
Now that ELT is so popular instead of ETL, the industry could use a standard protocol to define how raw replication of data is defined.
CDC is too fractured.
We need a way to replicate to down stream systems that connects as easily and reliably as replication typically does.
I uploaded 25+ new data engineering notes in preparation for my next book chapter #DataAssetReusabilityPattern.
The notes go from Apache Arrow Flight Protocol to Slowly Changing Dimensions Type 2. Here are some highlights:
↠ Apache Arrow Flight for efficient data transfer
↠ Apache Iceberg and ACID transactions for reliable data lakes
↠ Data locality and parametric pipelines for optimized processing
↠ WASM and the Declarative Data Stack for modern architectures
↠ Microservices, Protobuf, and schema registries for robust data systems
↠ Time travel capabilities and SCD Type 2 implementations
And much more. If you like these things, have a look at the «Data Engineering Vault»—links to dedicated notes below (including 8+ notes related to the Finding Flow article).
Stay tuned for the next book chapter coming out soon.
Building a Google Drive clone and it's pushing my disdain towards file systems to an all time high.
"Folders" are an arbitrary way to organize files. There are so many weird behaviors we just take for granted.
Why can't a file be in another file? Why can't a folder have two parents? Why can't I filter for files that exist in two different folders?
If folders worked as tags and not as hard filters, the “abstraction” would make way more sense. When you put something in a folder, it can no longer be in any other folder.
There’s a failure in specificity and in locality. The “single location” nature makes them nearly entirely useless outside of software dev architecture and URLs.
If you get too specific with how you break down your folders, everything becomes impossible to find outside of multi-directory search
If you don’t get specific enough, you are basically just reinventing file extensions
WinFS should have been completed and it breaks my heart it never was released
This excellent post discusses data platform options when working with your customers and vendors. Thank you for the shoutout @criccomini!
And we're loving the preview image! 😍🗂
This excellent post discusses data platform options when working with your customers and vendors. Thank you for the shoutout @criccomini!
And we're loving the preview image! 😍🗂