Roman Castagné

21 days ago

Introducing Cohere's first open-source coding model: North Mini Code Small & efficient, designed for agentic performance and built for community input.

70

2K

268

1K

591K

RomCast_ retweeted

about 1 month ago

Introducing: Cohere Command A+ We’ve created our most powerful LLM yet, optimized it to run on as little hardware as possible, and released it open-source for all.

104

3K

381

2K

739K

RomCast_ retweeted

AI Scientist @MistralAI. Prev. PhD student @Inria in Willow and ALMAnaCH teams and intern @GoogleDeepMind. MVA & Ensae Paris Alumni

3 months ago

Introducing: Cohere Transcribe – a new state-of-the-art in open source speech recognition.

81

3K

295

2K

611K

Who to follow

Matthieu Futeral

@FuteralMatthieu

Lydia Nishimwe

@LydiaNishimwe

AI Research Scientist | Robust ML · Representation Learning · LLM Behaviour | PhD

Clémentine Fourrier 🍊 is off till Dec 2026 (🪂)

@clefourrier

@HuggingFace ✨ "The future is already here, it’s just not very evenly distributed" (Gibson)

RomCast_ retweeted

10 months ago

Introducing Command A Reasoning, our most advanced model for enterprise reasoning tasks.

27

723

124

184

279K

RomCast_ retweeted

11 months ago

Introducing Command A Vision, a state-of-the-art generative model that excels across multimodal image capabilities that matter for enterprises!

cohere's tweet photo. Introducing Command A Vision, a state-of-the-art generative model that excels across multimodal image capabilities that matter for enterprises! https://t.co/TUykxh0Z8O

57

384

81

124

90K

Roman Castagné @RomCast_

about 1 year ago

@awnihannun @johnowhitaker Same thing for Llamba actually, they use the MLP weights, embeddings and layernorms from the teacher.

RomCast_'s tweet photo. @awnihannun @johnowhitaker Same thing for Llamba actually, they use the MLP weights, embeddings and layernorms from the teacher. https://t.co/F3rzaCH9ek

0

1

0

57

RomCast_ retweeted

Nathan Godey @nthngdy

over 1 year ago

🚀 New Paper Alert! 🚀 We introduce Q-Filters, a training-free method for efficient KV Cache compression! It is compatible with FlashAttention and can compress along generation which is particularly useful for reasoning models ⚡ ⬇️R1-Distill-Llama-8B with 128 KV pairs ⬇️ 🧵

4

186

37

130

15K

RomCast_ retweeted

over 1 year ago

Today, we’re excited to release Command R7B, the smallest, fastest, and final model in the R series. It’s a powerhouse with a minimal footprint.

aidangomez's tweet photo. Today, we’re excited to release Command R7B, the smallest, fastest, and final model in the R series.

It’s a powerhouse with a minimal footprint. https://t.co/NKUs0ojSmL

4

148

23

20

41K

RomCast_ retweeted

Ekagra Ranjan @EkagraRanjan

over 1 year ago

Introducing Command R7B: the smallest, fastest, and final model in our R series of enterprise-focused LLMs! It delivers a powerful combination of state-of-the-art performance in its class and efficiency to lower the cost of building AI applications. https://t.co/e2ah5c5x5J

12

490

108

165

239K

RomCast_ retweeted

Cohere Labs

@Cohere_Labs

over 1 year ago

Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier 🌿 Today, we release a technical report with extended evaluation for Aya Expanse, our new generation of 8B and 32B parameters multilingual language models 🌎

Cohere_Labs's tweet photo. Aya Expanse: Combining Research Breakthroughs for a New Multilingual Frontier 🌿

Today, we release a technical report with extended evaluation for Aya Expanse, our new generation of 8B and 32B parameters multilingual language models 🌎 https://t.co/2PXo5VXzMT

1

81

31

19

24K

RomCast_ retweeted

almost 2 years ago

Building artefacts for JSON Schema is expensive to do on scale in production for LLM inference. Probably why OpenAI had "JSON mode" for a year but not "JSON schema mode" until @cohere released it first :) * @cohere is >10x faster than OpenAI * @cohere is >45x faster than Outlines

EkagraRanjan's tweet photo. Building artefacts for JSON Schema is expensive to do on scale in production for LLM inference. Probably why OpenAI had "JSON mode" for a year but not "JSON schema mode" until @cohere released it first :)
* @cohere is >10x faster than OpenAI
* @cohere is >45x faster than Outlines https://t.co/H11T0jQIqz

1

98

22

30

13K

RomCast_ retweeted

Stella Biderman @BlancheMinerva

almost 2 years ago

We’re excited to announce our Series D financing to accelerate growth, expand our team, & develop our next class of frontier, enterprise-grade, data privacy-focused AI technology. We’re bringing highly scalable and secure AI solutions to global enterprises wherever their data is and in whatever language they speak.

cohere's tweet photo. We’re excited to announce our Series D financing to accelerate growth, expand our team, & develop our next class of frontier, enterprise-grade, data privacy-focused AI technology.

We’re bringing highly scalable and secure AI solutions to global enterprises wherever their data is and in whatever language they speak.

24

323

46

25

142K

RomCast_ retweeted

Nick Frosst

@nickfrosst

almost 2 years ago

@cohere just shipped json schema sampling! Now not only can you guarantee that the model returns valid json, you can actually ensure it returns json with a specific format! Big win for people actually building with LLMs :) https://t.co/fUEEqZxTAw

nickfrosst's tweet photo. @cohere just shipped json schema sampling!

Now not only can you guarantee that the model returns valid json, you can actually ensure it returns json with a specific format!

Big win for people actually building with LLMs :)

https://t.co/fUEEqZxTAw https://t.co/xL4PqYvRvb

3

102

14

23

16K

RomCast_ retweeted

Arena.ai

@arena

about 2 years ago

Exciting news - the latest Arena result are out! @cohere's Command R+ has climbed to the 6th spot, matching GPT-4-0314 level by 13K+ human votes! It's undoubtedly the **best** open model on the leaderboard now🔥 Big congrats to @cohere's incredible work & valuable contribution to the open community! More exciting updates: - Qwen1.5-32B-Chat almost top-10 - Gemma-1.1-7B-it shows great improvement (1044 -> 1088, on par with Llama-2-70b) - Starling-7B-Beta still the best 7B with over 13K votes!

arena's tweet photo. Exciting news - the latest Arena result are out!

@cohere's Command R+ has climbed to the 6th spot, matching GPT-4-0314 level by 13K+ human votes! It's undoubtedly the **best** open model on the leaderboard now🔥

Big congrats to @cohere's incredible work & valuable contribution to the open community!

More exciting updates:
- Qwen1.5-32B-Chat almost top-10
- Gemma-1.1-7B-it shows great improvement (1044 -> 1088, on par with Llama-2-70b)
- Starling-7B-Beta still the best 7B with over 13K votes!

39

1K

287

414

765K

RomCast_ retweeted

about 2 years ago

Hot damn, @cohere @CohereForAI is really bringing it

3

150

12

11

18K

RomCast_ retweeted

about 2 years ago

One subtlety worth mentioning is how significant the tokenizer is to the cost to use models in non-english languages. Our tokenizer is meaningfully better than others at the 9 non-English languages, achieving up to a 2x effective cost reduction to use.

aidangomez's tweet photo. One subtlety worth mentioning is how significant the tokenizer is to the cost to use models in non-english languages. Our tokenizer is meaningfully better than others at the 9 non-English languages, achieving up to a 2x effective cost reduction to use. https://t.co/BNefn4j81r

5

115

12

23

51K

RomCast_ retweeted

about 2 years ago

Today, we’re introducing Command R+: a state-of-the-art RAG-optimized LLM designed to tackle enterprise-grade workloads and speak the languages of global business. Our R-series model family is now available on Microsoft Azure, and coming soon to additional cloud providers.

cohere's tweet photo. Today, we’re introducing Command R+: a state-of-the-art RAG-optimized LLM designed to tackle enterprise-grade workloads and speak the languages of global business.

Our R-series model family is now available on Microsoft Azure, and coming soon to additional cloud providers. https://t.co/Q8XXU8wAzu

29

940

188

399

515K

RomCast_ retweeted

Cohere Labs

@Cohere_Labs

about 2 years ago

Announcing C4AI Command R+ open weights, a state-of-the-art 104B LLM with RAG, tooling and multilingual in 10 languages. This release builds on our 35B and is a part of our commitment to make AI breakthroughs accessible to the research community. 🎉 https://t.co/2UCLl5sfPB

Cohere_Labs's tweet photo. Announcing C4AI Command R+ open weights, a state-of-the-art 104B LLM with RAG, tooling and multilingual in 10 languages.

This release builds on our 35B and is a part of our commitment to make AI breakthroughs accessible to the research community. 🎉

https://t.co/2UCLl5sfPB https://t.co/R3U2l6eJ0G

2

219

56

48

76K

RomCast_ retweeted

about 2 years ago

⌘R+ Welcoming Command R+, our latest model focused on scalability, RAG, and Tool Use. Like last time, we're releasing the weights for research use, we hope they're useful to everyone! https://t.co/HgESxxEYlK

23

941

172

311

443K

RomCast_ retweeted