Casey A. Fitzpatrick @caseyfitz - Twitter Profile

Pinned Tweet

almost 2 years ago

Hard to believe it’s barely been a year since @douwekiela called to order our very first all hands, then @apsdehal and I spent hours in a tiny room with a whiteboard laying out the technical vision for what we were about to do. So proud of the team we’ve built and what’s to come!

Contextual AI

@ContextualAI

almost 2 years ago

We’re excited to share today that we’ve raised $80M in Series A funding to accelerate our mission to change the way the world works through AI. Read more at our blogpost: https://t.co/aTLx0NmQfr

ContextualAI's tweet photo. We’re excited to share today that we’ve raised $80M in Series A funding to accelerate our mission to change the way the world works through AI. Read more at our blogpost: https://t.co/aTLx0NmQfr https://t.co/dGSqyZeeFg

10

281

41

27

89K

1

12

1

0

2K

Casey A. Fitzpatrick @caseyfitz

over 1 year ago

Another great example of how we frame the problems of enterprise ai from a systems perspective... and this component is a key player 😎 (which is also SOTA by itself 🙌

Douwe Kiela

@douwekiela

over 1 year ago

AI struggles with messy, conflicting, ever-changing data. Today's AI ranking methods can't prioritize clearly, because they lack human guidance. Introducing the world's first instruction-following, SOTA reranker! Give our reranker instructions to control exactly how it ranks: • “Prioritize recent documents” • “Prefer PDFs over other sources” • “The boss is always right” Can’t wait to see what people build with it!

18

500

48

277

210K

0

5

0

111

Casey A. Fitzpatrick @caseyfitz

over 1 year ago

@douwekiela Psyched to show the world a little bit more about how we're tackling real problems that will unblock AI's true potential.

0

1

0

238

Casey A. Fitzpatrick @caseyfitz

almost 2 years ago

Here, have a truly open MoE 🤲. More amazing work by @Muennighoff! Tons to dive into here.

Niklas Muennighoff @Muennighoff

almost 2 years ago

Releasing OLMoE - the first good Mixture-of-Experts LLM that's 100% open-source - 1B active, 7B total params for 5T tokens - Best small LLM & matches more costly ones like Gemma, Llama - Open Model/Data/Code/Logs + lots of analysis & experiments 📜https://t.co/Vpac2q90CS 🧵1/9

Muennighoff's tweet photo. Releasing OLMoE - the first good Mixture-of-Experts LLM that's 100% open-source
- 1B active, 7B total params for 5T tokens
- Best small LLM & matches more costly ones like Gemma, Llama
- Open Model/Data/Code/Logs + lots of analysis & experiments

📜https://t.co/Vpac2q90CS
🧵1/9 https://t.co/YOMV5t2Td1

23

928

225

541

203K

0

6

0

550

Who to follow

FAMAF UNC

@famaf_unc

Página oficial de la Facultad de Matemática, Astronomía, Física y Computación -FAMAF- en X (ex Twitter).

Engineer. Founder @remoroolabs. Previously built self-driving cars and farming robots. Still moving.

caseyfitz retweeted

Stas Bekman

@StasBekman

almost 2 years ago

Here is a new Machine Learning Engineering chapter: Network debug https://t.co/4g90Eq8o14 The intention is to help non-network engineers to figure out how to resolve common problems around multi-gpu and multi-node collectives networking - it's heavily NCCL-biased at the moment. Will extend with RCCL and others when I get access to those. Your feedback and corrections are always welcome.

StasBekman's tweet photo. Here is a new Machine Learning Engineering chapter: Network debug

https://t.co/4g90Eq8o14

The intention is to help non-network engineers to figure out how to resolve common problems around multi-gpu and multi-node collectives networking - it's heavily NCCL-biased at the moment. Will extend with RCCL and others when I get access to those.

Your feedback and corrections are always welcome.

4

172

40

103

9K

Casey A. Fitzpatrick @caseyfitz

about 2 years ago

ty @charles_irl and gang

Stas Bekman

@StasBekman

about 2 years ago

The kind folks from @modal_labs have just shared with me this 10-100x faster drop in replacement for pip https://t.co/GYzRtkZpsH If you want a much faster CI startup switch to uv now! To use you just add `uv` before `pip` and everything else is the same, so: pip install uv uv pip install -e . uv pip install torch uv pip compile ... etc.

3

136

15

61

13K

0

3

0

143

Casey A. Fitzpatrick @caseyfitz

about 2 years ago

@hugobowne @sh_reya @VanishingData Haha with none other than the legendary @charles_irl in the mix. Small world.

1

2

0

30

caseyfitz retweeted

Douwe Kiela

@douwekiela

about 2 years ago

Maximizing expected human utility (i.e., KTO) is the natural way to do alignment. Cool to see how well this works even in diffusion models.

0

22

5

6K

caseyfitz retweeted

sarah guo

@saranormous

about 2 years ago

from @douwekiela (a pioneer of Retrieval-Augmented Generation) on the value of end-to-end co-optimization of systems (language models + retrievers):

0

28

5

11

11K

Casey A. Fitzpatrick @caseyfitz

about 2 years ago

😎

Nathan Lambert

@natolambert

about 2 years ago

contextual . onlybangers . ai

0

23

5

6K

0

118

Casey A. Fitzpatrick @caseyfitz

about 2 years ago

🤩

Omar Khattab

@lateinteraction

about 2 years ago

Not everyone gets to call their new system RAG 2.0 — exciting announcement by the @ContextualAI team, who have been thinking about these problems for many years.

1

109

14

57

15K

0

120

Casey A. Fitzpatrick @caseyfitz

about 2 years ago

rip FRAG 🪦 we'll never forget the many meetup demos

Aaref Hilaly

@aaref

about 2 years ago

2023: RAG vs no RAG 2024: RAG 2.0 vs frozen RAG As AI moves into production, simple RAG systems are not enough. @douwekiela & @ContextualAI show that with data.

0

34

7

2

3K

0

5

0

91

Casey A. Fitzpatrick @caseyfitz

about 2 years ago

Last year I joined Contextual AI to focus on designing and building production-grade AI systems, from first principles, focused on real world workflows not demos. Today I'm excited to share some of what we've done so far!

Contextual AI

@ContextualAI

about 2 years ago

Today, we’re excited to announce RAG 2.0, our end-to-end system for developing production-grade AI. Using RAG 2.0, we’ve created Contextual Language Models (CLMs), which achieve state-of-the-art performance on a variety of industry benchmarks. CLMs outperform strong RAG baselines built using GPT-4 and top open-source models like Mixtral, according to our research and customers. Read more in our blog post: https://t.co/YUFgTS3izT

ContextualAI's tweet photo. Today, we’re excited to announce RAG 2.0, our end-to-end system for developing production-grade AI.

Using RAG 2.0, we’ve created Contextual Language Models (CLMs), which achieve state-of-the-art performance on a variety of industry benchmarks. CLMs outperform strong RAG baselines built using GPT-4 and top open-source models like Mixtral, according to our research and customers.

Read more in our blog post: https://t.co/YUFgTS3izT

33

917

119

337

196K

0

10

1

0

280

Casey A. Fitzpatrick @caseyfitz

over 2 years ago

👋hiiii we’re here to play!

Winnie Xu @winniethexu

over 2 years ago

Excited to share a new model with @ContextualAI that tops the AlpacaEval 2.0 leaderboard! How did we manage to rank higher than models like GPT4, Claude 3 and Mistral Medium? Enter iterative alignment… 🧵

winniethexu's tweet photo. Excited to share a new model with @ContextualAI that tops the AlpacaEval 2.0 leaderboard!

How did we manage to rank higher than models like GPT4, Claude 3 and Mistral Medium? Enter iterative alignment… 🧵 https://t.co/TscUlrYs7U

11

203

33

111

73K

0

3

0

127

caseyfitz retweeted

Kawin Ethayarajh

@ethayarajh

over 2 years ago

The Orca-Math paper does a comparison of DPO and KTO for mathematical reasoning, finding that KTO is slightly better when all data is used and 25+ pts better when you have fewer positive examples than negative examples.

ethayarajh's tweet photo. The Orca-Math paper does a comparison of DPO and KTO for mathematical reasoning, finding that KTO is slightly better when all data is used and 25+ pts better when you have fewer positive examples than negative examples. https://t.co/7zyzIaRuMY

3

120

23

72

33K

caseyfitz retweeted

Omar Khattab

@lateinteraction

over 2 years ago

I'm glad that a lot more people understand the key ideas behind ColBERT and DSPy now. My only remaining goal is to make sure people can also say them correctly; both are quite tricky😆 * Col-BAIR (it's "the late" interaction retriever, get it?) * Dee-Ess-Pie (like num-pie)

18

185

11

67

34K

Casey A. Fitzpatrick @caseyfitz

over 2 years ago

😊 🙏

Andrew Carr 🤸

@andrew_n_carr

over 2 years ago

I know everyone is excited about Mixtral and the new Hyena models - but @ContextualAI just dropped a pile of cool new models and a new alignment framework https://t.co/Mmn3Vvff3T

andrew_n_carr's tweet photo. I know everyone is excited about Mixtral and the new Hyena models - but @ContextualAI just dropped a pile of cool new models and a new alignment framework

https://t.co/Mmn3Vvff3T https://t.co/zRErEbQU4q

4

200

29

91

24K

0

3

0

164

caseyfitz retweeted

fraser

@Fraser

over 2 years ago

I believe strongly that: 1) The best products that will emerge from this moment are “full stack”, with teams training their own models, and the models & UI informing one another. 2) This requires researchers who care deeply about what’s best for the product, including data

19

263

24

73

85K

Casey A. Fitzpatrick @caseyfitz

over 2 years ago

🌠🌠🌠

Contextual AI

@ContextualAI

over 2 years ago

We're proud to be a 2023 #IA40 Intelligent Applications Rising Star winner!

0

15

7

0

4K

0

80

Casey A. Fitzpatrick

@caseyfitz

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users