0.32 has shipped, and it's a massive release from @rerundotio. There's a ton of cool new features, and I wanted to highlight 2 in particular
1. OSS Server streaming from disk
2. Dataset review
I walk you through them in the video, so take a look. I'll have a much longer blog post next week about the entire pipeline. With 0.32, much of the foundation is set for a unified data layer for physical data, and I'll be getting into the details of it with all that I've built over the past year. This will cover
1. Raw Data Collection
2. Data Ingestion
3. Catalog Registration
4. Query and Review
5. Post Process
6. Training
so lots to share