The last 3+ years with my teammates we've been busy working on a platform for creating, evolving and operating data pipelines. I learned lots of stuff about data here and think what we've done is particularly interesting, so i'm happy we took some time to share our story.
Avec Miguel Liroz on présentera comment Criteo a rationalisé l’expérience de travail sur la data en consolidant tous ces besoins dans une solution intégrée de bout en bout.
How do we follow the "Continuous Delivery" approach for our data pipelines? 🤔 Check out our latest article to discover how our platform, BigDataFlow, provides static analysis and a CLI tool to make #CD without incidents 👇
https://t.co/gJoZRpfKgf
#CriteoDevXDays series 👩💻 #DX
We are excited to release Cadence 1.0! Used by many major companies, at @Uber it powers over 1,000 services with 100K+ updates a second. Learn how Cadence makes it easy to build complex distributed systems. @cadenceworkflow#UberEngineering
read more: https://t.co/ulJg11OR2A
Our latest work in @Nature today: #AlphaDev discovered a new faster sorting algorithm that we open sourced to the main C++ library for all developers to use. This is just the beginning of AI being used to find many more efficiencies in code in future https://t.co/tfACG2zcN6
China is pushing ahead in AI regulation—again. This time on deepfakes and generative AI more broadly, including AI-powered image, audio and text-generation software.
Here's why the world should pay attention. 👇
https://t.co/IIzY6CkJ22
@GergelyOrosz It’s also good for engineering quality: blog posts are a way to compare technical choices and have cross-pollination between engineering teams tackling similar issues in different organisations.
The last 3+ years with my teammates we've been busy working on a platform for creating, evolving and operating data pipelines. I learned lots of stuff about data here and think what we've done is particularly interesting, so i'm happy we took some time to share our story.
Hello,
pour ce meetup de novembre, rendez-vous le jeudi 24 novembre chez @ContentSquareFR pour deux talks sur:
- Scala steward (par @_AliFirat)
- Le Scheduling de Data pipelines avec Scala (par @heapoverflow)
Les inscriptions c'est par ici:
https://t.co/V8Kkv9FcBn
Hello,
pour ce meetup de novembre, rendez-vous le jeudi 24 novembre chez @ContentSquareFR pour deux talks sur:
- Scala steward (par @_AliFirat)
- Le Scheduling de Data pipelines avec Scala (par @heapoverflow)
Les inscriptions c'est par ici:
https://t.co/V8Kkv9FcBn
“The Code base […] is significant. We still do massive refactoring […] with ease and take this opportunity to thanks the scala ecosystem at large and the authors of the libraries that we use”: @typelevel folks and also @li_haoyi for fastparse. https://t.co/8LK7X6Yn8e
@morel_lang@julianhyde i wonder if extending Morel to include data pipeline scheduling can be a opportunity. Something similar to what has been done in this project
The last 3+ years with my teammates we've been busy working on a platform for creating, evolving and operating data pipelines. I learned lots of stuff about data here and think what we've done is particularly interesting, so i'm happy we took some time to share our story.
With a good level of abstraction a data platform can provide big productivity gains in an organization that manages a large variety of evolving data. We share our experience and some numbers in the last post introducing BigDataFlow! https://t.co/Wbx4GRha3A
The last 3+ years with my teammates we've been busy working on a platform for creating, evolving and operating data pipelines. I learned lots of stuff about data here and think what we've done is particularly interesting, so i'm happy we took some time to share our story.
In this data platform a parser and interpreter for a SQL dialect (with an extension for scheduling) is implemented. In Part 2 we detail the key ideas that lead to such a design decision! https://t.co/TqiGF31TiL