Ram Kadiyala

7 days ago

@aakrit Any context on what areas ?

0

584

10 days ago

What has been publicly “rolledback” may still privately be happening as there is no way to know what was degraded and whats not. Previously it was “accidental overlook” when it came to coding over several turns of chat via IDEs (ex: through cursor) where minor mistakes were added so a few more turns of usage could increase token usage. While most coding benchmarks arent multi-turn coding sessions via chat, this would have no impact on scores. Although the models are far better than what openAI or google offer currently, One should reduce dependence on whatever model is SOTA at a given time and limit the usage to harder tasks or the ones other models are failing to grasp/solve. Same applies for any tools, I personally wouldnt use claude/codex unless I absolutely have to, but regulary use the both of them via cursor or other IDEs

MTS @MTSlive

10 days ago

SITUATION UPDATE: Anthropic is reversing its Fable 5 policy of covertly degrading performance for competing AI researchers, per Wired.

40

1K

32

97

140K

0

53

27 days ago

@kenwuuuu Strands / Google ADK depending upon what cloud platform you are (going to be) using.

0

1

0

2

2K

about 2 months ago

@therealoliulv @speedrun Unfortunately, this is something that exists out of the box for aws, some service companies have built this already. https://t.co/9JVg2p5GGm

0

126

about 2 months ago

We have a bi-weekly meets and are open to contributions. No prior exp or publications are needed. We also offer co-authorship to contributors.

about 2 months ago

🚀 Our community-led ML Agents group is kicking off a new collaborative project to build a Street Navigation Agent for more inclusive, region-aware local search. In many parts of the world, businesses exist physically — but not digitally. They're exploring how AI can use tools like Google Street View to read storefront signs, apply distance & category constraints, and reason step-by-step to identify real-world services. We’re also building a global benchmark across countries and languages to evaluate visual verification.

Cohere_Labs's tweet photo. 🚀 Our community-led ML Agents group is kicking off a new collaborative project to build a Street Navigation Agent for more inclusive, region-aware local search. In many parts of the world, businesses exist physically — but not digitally.

They're exploring how AI can use tools like Google Street View to read storefront signs, apply distance & category constraints, and reason step-by-step to identify real-world services.

We’re also building a global benchmark across countries and languages to evaluate visual verification.

1

11

2

8

974

0

2

0

96

2 months ago

Will be there at the @ycombinator startup school. Open to discussing any potential matches/ideas on data/evals/agents.

Jared Friedman

@snowmaker

2 months ago

Tomorrow. YC Startup School India.

202

4K

115

228

676K

0

79

_1024_m retweeted

Sara Hooker

@sarahookr

5 months ago

Congrats to everyone involved in Kaleidoscope, a cross-institutional collaboration accepted to ICLR 2026 🔥 A special shoutout to @mziizm who championed this collaboration from day 1. It is the first accepted paper for many of the collaborators who are first time authors.

sarahookr's tweet photo. Congrats to everyone involved in Kaleidoscope, a cross-institutional collaboration accepted to ICLR 2026 🔥

A special shoutout to @mziizm who championed this collaboration from day 1. It is the first accepted paper for many of the collaborators who are first time authors. https://t.co/3dTkmYeVAb

4

62

14

4

7K

_1024_m retweeted

5 months ago

Many researchers join our community seeking mentorship, support, and a roadmap as they embark on their journeys. @_1024_m and @jebish7 did just this. Now, just 2 years later, they are creating these pathways for others, opening doors, and leading the way.

Cohere_Labs's tweet photo. Many researchers join our community seeking mentorship, support, and a roadmap as they embark on their journeys.

@_1024_m and @jebish7 did just this. Now, just 2 years later, they are creating these pathways for others, opening doors, and leading the way. https://t.co/3KW5awpR5n

1

15

3

2K

_1024_m retweeted

6 months ago

In 2025, our Open Science Community Leads showed what’s possible when AI research is built in the open. 38 leads, 17 programs, 125 guest speakers advancing open, collaborative AI across the world (find all talks here! https://t.co/UBXm7nkwB1). 🤯

Cohere_Labs's tweet photo. In 2025, our Open Science Community Leads showed what’s possible when AI research is built in the open.

38 leads, 17 programs, 125 guest speakers advancing open, collaborative AI across the world (find all talks here! https://t.co/UBXm7nkwB1). 🤯 https://t.co/EgwUZKiNcl

2

30

20

4

4K

8 months ago

(3/3) Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance. https://t.co/8HkIeXYHRr A Hindi-English bi-lingual LLM with over 140 checkpoints trained with variations in data distributions. Findings : - LLM-translated data can work as good as real data to address lack of data - Each task type has a different optimal data distribution amount, which could be determined by test runs on a subset of data. - LLM-generated thinking texts were made descriptive yet concise, this led to less emission (less token consumptions) during evals for text-generation tasks while providing better performance. Release : - Open Data, Models and 140 Checkpoints https://t.co/rkG5vXBhZu https://t.co/ukSrLOShq8

0

3

0

70

8 months ago

Three of our papers have been accepted at AACL 2025 @aaclmeeting (2 Main, 1 Findings). 1. DSBC : Data Science task Benchmarking with Context engineering https://t.co/WwYwQ6uunl 2. Uncovering Cultural Representation Disparities in Vision-Language Models https://t.co/jTtbZqGx3w 3. Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance https://t.co/8HkIeXYHRr Grateful to the co-authors @SidYaeger @Siddartha_10 @jebish7 @delliott @alexrs95 @_sumand @_srishtiyadav @KanwalMehreen2 This was made possible through research grants from @TraversaalAI @AnthropicAI @Cohere_Labs

1

8

3

2

792

8 months ago

(2/3) Uncovering Cultural Representation Disparities in Vision-Language Models https://t.co/jTtbZqGx3w https://t.co/VASt9oPevl Key Highlights : - We test several VLMs at country/culture recognition task in 3 settings : Open-ended, MCQs with similar or neighbouring countries, MCQs with random countries - We also test them by image ablations (noise, rotations, greyscaling, etc..) Findings : - Country level biases do correlate with country wise availability of online data i.e more data or mentions >> less bias or misclassification. This contradicts the common assumption of western-favouritism. - Image perturbations affect biases in a very random way even among models belonging to the same family. - Language of prompt had negligible effect other than improving accuracy over countries that speak the language.

1

4

0

98

_1024_m retweeted