Pablo Delgado

12 days ago

1/ World model research is fragmented: every paper reimplements its own data pipeline, baselines, and eval harness. Comparing two methods fairly is weeks of infra work. 𝘀𝘁𝗮𝗯𝗹𝗲-𝘄𝗼𝗿𝗹𝗱𝗺𝗼𝗱𝗲𝗹 is a new open-source platform that standardizes the whole thing: https://t.co/Gg3V3LhKJr

lancedb's tweet photo. 1/ World model research is fragmented: every paper reimplements its own data pipeline, baselines, and eval harness. Comparing two methods fairly is weeks of infra work.

𝘀𝘁𝗮𝗯𝗹𝗲-𝘄𝗼𝗿𝗹𝗱𝗺𝗼𝗱𝗲𝗹 is a new open-source platform that standardizes the whole thing: https://t.co/Gg3V3LhKJr

1

58

13

39

18K

2 months ago

@changhiskhan @lancedb Fell for it. I realized this was the perfect day to release TurboQuant vector search in LanceDB ;) ( cc @eddyxu )

1

5

0

1

298

It's all consuming. Home of @KinjaDeals. Learn more here: https://t.co/uS5BPtmVWd

2 months ago

@javisantana ponle “acceleration boost” y tendras modo deportivo…

0

164

Who to follow

The Inventory

@ItsTheInventory

Jorge Dias

@dias_jorge

Choose life, choose a job, choose a career, choose a family...

Aish Fenton

@aishfenton

Wrangling models @ OpenAI, Kiwi, and cat dad. He/Him.

pablete retweeted

Julien Chaumond

@julien_c

4 months ago

in case you missed it @lancedb and HF are partnering up to unlock the next generation of large dataset storage on the Hub 🔥 And it's fire! - Supports storing embeddings (and their indexes) directly alongside the data - Vector search / similarity search is built-in - Large multimodal datasets (text, images, video) just use the hf:// prefix: db = lancedb. connect("hf://datasets/julien-c/hub-stats-lance") 🔥🔥

julien_c's tweet photo. in case you missed it @lancedb and HF are partnering up to unlock the next generation of large dataset storage on the Hub 🔥

And it's fire!

- Supports storing embeddings (and their indexes) directly alongside the data
- Vector search / similarity search is built-in
- Large multimodal datasets (text, images, video)

just use the hf:// prefix:

db = lancedb. connect("hf://datasets/julien-c/hub-stats-lance")

🔥🔥

0

79

16

25

8K

4 months ago

@changhiskhan Another instance of Carcinisation ;)

0

1

0

7

5 months ago

@gptcrosa Dieter Rams inspired iPhone stand https://t.co/OSmLxhHTcK

0

2

0

51

pablete retweeted

Yoav HaCohen

@yoavhacohen

5 months ago

Most video models are silent. Most audio models don’t see. LTX-2 learns the joint distribution of sound and vision, generating speech, foley, ambience, motion, and timing together not as a post-hoc pipeline.

yoavhacohen's tweet photo. Most video models are silent.
Most audio models don’t see.
LTX-2 learns the joint distribution of sound and vision, generating speech, foley, ambience, motion, and timing together not as a post-hoc pipeline. https://t.co/iJD6AfkXvv

1

48

1

9

5K

pablete retweeted

DailyPapers

@HuggingPapers

5 months ago

HiStream Meta AI researchers introduce an efficient autoregressive framework for 1080p video generation. By eliminating spatial, temporal, and timestep redundancy, HiStream achieves state-of-the-art quality with up to 107.5× speedup, making high-resolution video generation practical.

HuggingPapers's tweet photo. HiStream

Meta AI researchers introduce an efficient autoregressive framework for 1080p video generation. By eliminating spatial, temporal, and timestep redundancy, HiStream achieves state-of-the-art quality with up to 107.5× speedup, making high-resolution video generation practical.

3

129

14

71

11K

pablete retweeted

7 months ago

Lei Xu and Pablo Delgado of @netflix took the stage on Wednesday at Ray Summit 2025!

1

10

1

2K

pablete retweeted

7 months ago

@netflix Our session is on Day 2, Nov 5 at 4pm. Check out the agenda here: https://t.co/FjDVqhsh30

0

1

0

277

pablete retweeted

7 months ago

We’ll walk through how Ray enables large-scale processing across hundreds of GPUs, while LanceDB’s columnar design provides efficient, intelligent curation and sampling. Together, they’re producing smaller, more diverse, and higher-quality datasets for cutting-edge text-to-image and video-to-text research.

lancedb's tweet photo. We’ll walk through how Ray enables large-scale processing across hundreds of GPUs, while LanceDB’s columnar design provides efficient, intelligent curation and sampling. Together, they’re producing smaller, more diverse, and higher-quality datasets for cutting-edge text-to-image and video-to-text research.

1

2

1

0

277

pablete retweeted

7 months ago

Building and curating large-scale multimodal datasets has long been a complex, resource-heavy challenge. But that’s changing fast. Lei Xu of LanceDB and Pablo Delgado of @netflix will be speaking at Ray Summit 2025 — Scaling Multimodal Data Curation with Ray and LanceDB

lancedb's tweet photo. Building and curating large-scale multimodal datasets has long been a complex, resource-heavy challenge. But that’s changing fast. Lei Xu of LanceDB and Pablo Delgado of @netflix will be speaking at Ray Summit 2025 — Scaling Multimodal Data Curation with Ray and LanceDB https://t.co/Zs9VGCok0J

1

9

3

0

775

pablete retweeted

changhiskhan

@changhiskhan

8 months ago

Now this is a @lancedb feature to make @Noahpinion proud. Introducing RabitQ: better compression, better recall, faster index build, higher throughput. https://t.co/46vSI8eq7n

2

8

1

646

pablete retweeted

9 months ago

🥳 Welcome another #Lancelot at the Roundtable, Ethan Rosenthal 🎉 On Ethan’s first day at @runwayml , he was tasked with building a multimodal 𝗱𝗮𝘁𝗮 𝘀𝘆𝘀𝘁𝗲𝗺 𝘁𝗵𝗮𝘁 𝘀𝘂𝗽𝗽𝗼𝗿𝘁𝗲𝗱 𝗯𝗼𝘁𝗵 𝗱𝗶𝘀𝘁𝗿𝗶𝗯𝘂𝘁𝗲𝗱 𝗱𝗮𝘁𝗮𝗹𝗼𝗮𝗱𝗶𝗻𝗴 𝗮𝗻𝗱 𝗲𝘅𝗽𝗹𝗼𝗿𝗮𝘁𝗼𝗿𝘆 𝗱𝗮𝘁𝗮 𝗮𝗻𝗮𝗹𝘆𝘀𝗶𝘀. He said, “𝘛𝘩𝘢𝘵’𝘴 𝘢 𝘵𝘦𝘳𝘳𝘪𝘣𝘭𝘦 𝘪𝘥𝘦𝘢. 𝘠𝘰𝘶 𝘴𝘩𝘰𝘶𝘭𝘥 𝘯𝘦𝘷𝘦𝘳 𝘵𝘳𝘺 𝘵𝘰 𝘥𝘰 𝘵𝘩𝘪𝘴 𝘸𝘪𝘵𝘩 𝘰𝘯𝘦 𝘴𝘺𝘴𝘵𝘦𝘮!". He then found #Lance and did exactly what he said not to do. 😆

lancedb's tweet photo. 🥳 Welcome another #Lancelot at the Roundtable, Ethan Rosenthal 🎉

On Ethan’s first day at @runwayml , he was tasked with building a multimodal 𝗱𝗮𝘁𝗮 𝘀𝘆𝘀𝘁𝗲𝗺 𝘁𝗵𝗮𝘁 𝘀𝘂𝗽𝗽𝗼𝗿𝘁𝗲𝗱 𝗯𝗼𝘁𝗵 𝗱𝗶𝘀𝘁𝗿𝗶𝗯𝘂𝘁𝗲𝗱 𝗱𝗮𝘁𝗮𝗹𝗼𝗮𝗱𝗶𝗻𝗴 𝗮𝗻𝗱 𝗲𝘅𝗽𝗹𝗼𝗿𝗮𝘁𝗼𝗿𝘆 𝗱𝗮𝘁𝗮 𝗮𝗻𝗮𝗹𝘆𝘀𝗶𝘀. He said, “𝘛𝘩𝘢𝘵’𝘴 𝘢 𝘵𝘦𝘳𝘳𝘪𝘣𝘭𝘦 𝘪𝘥𝘦𝘢. 𝘠𝘰𝘶 𝘴𝘩𝘰𝘶𝘭𝘥 𝘯𝘦𝘷𝘦𝘳 𝘵𝘳𝘺 𝘵𝘰 𝘥𝘰 𝘵𝘩𝘪𝘴 𝘸𝘪𝘵𝘩 𝘰𝘯𝘦 𝘴𝘺𝘴𝘵𝘦𝘮!". He then found #Lance and did exactly what he said not to do. 😆

1

7

2

1K

pablete retweeted

Magnific

@magnific

9 months ago

Welcome to Freepik Spaces A single place where ideas live, connected through real-time workflows Our CEO and CPO are presenting the future of Freepik live from Upscale Studios NYC Join the waitlist below

magnific's tweet photo. Welcome to Freepik Spaces

A single place where ideas live, connected through real-time workflows

Our CEO and CPO are presenting the future of Freepik live from Upscale Studios NYC

Join the waitlist below https://t.co/dDHNAkGhR0

40

343

61

80

2M

9 months ago

@changhiskhan @VLDBconf want!

0

2

0

44

pablete retweeted

Ning Yu @realNingYu

9 months ago

Video diffusion models struggle beyond training resolution → artifacts & repetition. 🎥CineScale🎥 solves this with a novel inference paradigm: ⚡ Dedicated variants for video architectures ⚡ Extends T2I to T2V & I2V & V2V ⚡ 8K images & 4K video, tuning-free/minimal tuning Expanding the frontier of generative video fidelity. ✊ Kudos to the teamwork led by our intern @qhnmoon at @eyelinestudios. #AI #AIResearch #MachineLearning #AIGC #GenAI #videos #DiffusionModels #HighRes #fidelity #ComputerVision #internship

realNingYu's tweet photo. Video diffusion models struggle beyond training resolution → artifacts & repetition.
🎥CineScale🎥 solves this with a novel inference paradigm:
⚡ Dedicated variants for video architectures
⚡ Extends T2I to T2V & I2V & V2V
⚡ 8K images & 4K video, tuning-free/minimal tuning
Expanding the frontier of generative video fidelity.
✊ Kudos to the teamwork led by our intern
@qhnmoon at @eyelinestudios.

#AI #AIResearch #MachineLearning #AIGC #GenAI #videos #DiffusionModels #HighRes #fidelity #ComputerVision #internship

0

47

14

15

5K

pablete retweeted

changhiskhan

@changhiskhan

10 months ago

I’ve been a huge fan of the Netflix engineering blog for a long time. So so excited for @lancedb to be an important part of the multimodal AI transformation in data engineering https://t.co/AMmBOMISFR

changhiskhan's tweet photo. I’ve been a huge fan of the Netflix engineering blog for a long time. So so excited for @lancedb to be an important part of the multimodal AI transformation in data engineering

https://t.co/AMmBOMISFR https://t.co/stJwUEukO3

3

121

16

54

13K