MagicLex

Miguel Otero Pedrido @moteropedrido

over 1 year ago

The top 10 fallacies of MLOps https://t.co/JoKm0gN6dc @Aurimas_Gr @paulabartabajo_ @iusztinpaul @moteropedrido

0

17

7

10

2K

TheMagicLex retweeted

Daniel D. Gutierrez

@ddgutierrez73

almost 2 years ago

Introducing The AI Lakehouse - https://t.co/nYVUtpx5b8 #AI #LakeHouse @hopsworks

0

7

4

0

364

TheMagicLex retweeted

almost 2 years ago

The 𝐅𝐓𝐈 (𝐟𝐞𝐚𝐭𝐮𝐫𝐞, 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠, 𝐢𝐧𝐟𝐞𝐫𝐞𝐧𝐜𝐞) architecture. A mental map to build ML Systems and not get lost in the process ... #MachineLearning #MLOps #DataScience

moteropedrido's tweet photo. The 𝐅𝐓𝐈 (𝐟𝐞𝐚𝐭𝐮𝐫𝐞, 𝐭𝐫𝐚𝐢𝐧𝐢𝐧𝐠, 𝐢𝐧𝐟𝐞𝐫𝐞𝐧𝐜𝐞) architecture.

A mental map to build ML Systems and not get lost in the process ...

#MachineLearning #MLOps #DataScience https://t.co/RKXP4nHVWv

2

7

1

0

144

Who to follow

Hopsworks

@hopsworks

Overcome legacy systems with a seamless, modular and performance-driven AI Lakehouse. Build, deploy and manage models effortlessly. https://t.co/2R2TqyW1qP

Javier de la Rúa Martínez

@Javierdlrm

🎓 Doctoral student at @dcatkth | AI Systems // 🚀 Research engineer at @hopsworks // 🥋 Kyokushinkai

Co-founder and CEO @hopsworks. Organizer of the feature store summit. Author of Building ML Systems for O'Reilly.

TheMagicLex retweeted

Pau Labarta Bajo

@paulabartabajo_

almost 2 years ago

Training ML models is easy. Transforming the data these models need is the hard part... until you learn this ↓↓↓ 𝗧𝗵𝗲 𝗽𝗿𝗼𝗯𝗹𝗲𝗺 Building a real-world ML system is > 𝟭𝟬% about training and deploying ML models, and > 𝟵𝟬% about transforming the data these models needs to work. And the thing is, 𝗻𝗼𝘁 all data transformations are the same. 𝗧𝗵𝗲 𝘁𝗮��𝗼𝗻𝗼𝗺𝘆 𝗳𝗼𝗿 𝗗𝗮𝘁𝗮 𝗧𝗿𝗮𝗻𝘀𝗳𝗼𝗿𝗺𝗮𝘁𝗶𝗼𝗻𝘀 In every ML system we can have up to 3 different types of data transformations 1️⃣ 𝗠𝗼𝗱𝗲𝗹-𝗜𝗻𝗱𝗲𝗽𝗲𝗻𝗱𝗲𝗻𝘁 Transformations, for example rolling averages. > Reusable across models > Stored in the feature store 2️⃣ 𝗠𝗼𝗱𝗲𝗹-𝗗𝗲𝗽𝗲𝗻𝗱𝗲𝗻𝘁 Transformations, for example feature normalization > Specific to one model > Applied in both training and inference 3️⃣ 𝗢𝗻-𝗗𝗲𝗺𝗮𝗻𝗱 Transformations > Require real-time data > Used in online inference Once you understand how and 𝗪𝗛𝗘𝗥𝗘 your data transformation happens, you are in a good position to start building ML software that works. If you want to learn more about the taxonomy of data transformations read this excellent blog post by the great @jim_dowling > 🔗 https://t.co/q7gTBCbxag ---- Hi there! It's Pau Labarta Bajo 👋 Every day I share free, hands-on content, on production-grade ML, to help you build real-world ML products. 𝗙𝗼𝗹𝗹𝗼𝘄 𝗺𝗲 so you don't miss what's coming next

3

111

20

59

5K

TheMagicLex retweeted

almost 2 years ago

Why is building AI systems hard? Because even with the simplest feature that you can imagine, you need to consider where it will be computed based on whether it's a batch AI system or a real-time AI system. Luckily, there are some principles to guide you. A thread 🧵👇

jim_dowling's tweet photo. Why is building AI systems hard?
Because even with the simplest feature that you can imagine, you need to consider where it will be computed based on whether it's a batch AI system or a real-time AI system.
Luckily, there are some principles to guide you.
A thread 🧵👇 https://t.co/6wjWbSfsNI

1

14

5

9

1K

TheMagicLex retweeted

Hamel Husain

@HamelHusain

almost 2 years ago

Just use Python

7

130

8

9K

TheMagicLex retweeted

almost 2 years ago

From our SIGMOD'24 paper, an easier read on the work we have done and are doing on making #rondb the database for real-time AI applications. If you thought #redis is good enough for real-time AI, please read this and tell us if we have changed your mind. https://t.co/YCClK5lKIi

0

7

5

0

466

almost 2 years ago

Mediocre data access and painful queries = wasting time and money. It may not be a matter of survival, but it might be a matter of competitive advantage; AI lakehouses have a native Python support. #AI #Lakehouse

0

17

almost 2 years ago

If you're relying on traditional lakehouses for AI, you’re behind before yo even started. This article discusses the move to AI lakehouses with a Python-native query engine, giving Python the respect it deserves; it just makes AI systems work. Read on: https://t.co/BbZgcfPzJJ

1

0

9

almost 2 years ago

The trick they employ is to force Python clients to use clunky JDBC/ODBC interfaces. The result? Slow, inefficient data handling. A native query engine bypasses this mess, delivering data at lightning speed. #AI #Performance

1

0

11

TheMagicLex retweeted

Hopsworks

@hopsworks

almost 2 years ago

Did you know that you can generate and manage your training data right from the Hopsworks UI? @TheMagicLex show how you can simplify your data workflow and enhance traceability with just a few clicks. 🔄 https://t.co/RVZRxqG3k6

0

1

0

97

almost 2 years ago

A tool that helps enforcing best practices is a good tool. A tool that needs you to enforce best practices upon it is a waste of time. Feature stores are built for purpose, not sort of stuck together in hope of nothing breaking. It’s about effient and good practices. #AI #MLOps

0

6

almost 2 years ago

Ever been in a room when someone said "Our Data Warehouse will do" only to rebuild your AI system few months later? Let's be honest, feature stores have always been the missing piece you knew you needed, but hoped the warehouse would sort of fit: https://t.co/2KpDlIQePT

1

0

7

almost 2 years ago

Data marts can't keep up with modern AI. Its just not made for it. For example; feature stores offer support for creating solid point-in-time correct training data. Unless, of course your AI systems do not need training ¯\_(ツ)_/¯ ? #AI #TrainingData

1

0

6

almost 2 years ago

You want pushdown LEFT JOIN. You just didnt know it. #RonDB does it. and it dramatically reduces latency and increases throughput for queries. This means faster, more efficient data retrieval—critical for high-performance AI and real-time use cases. #AI #Performance

0

9

almost 2 years ago

Your AI models are underperforming. Why? Outdated data schemas. The article explains the shift from star schema to snowflake schema for feature stores, increasing feature richness and performance. Lowering failure. Get slightly more enlightened: https://t.co/46dJlmxKjY TLDR;

1

0

13