Jebish7 @jebish7 - Twitter Profile

Jebish7 @jebish7

8 days ago

@jieyuzhao11 4 papers in total, possibly looking at 8. I would have to leave my job for 15 papers 😭

0

310

Jebish7 @jebish7

26 days ago

@ShayneRedford @alex_pentland @sarahookr @PeterHndrsn Congratulations Shayne.

0

1

0

32

jebish7 retweeted

Cohere Labs

@Cohere_Labs

about 2 months ago

🚀 Our community-led ML Agents group is kicking off a new collaborative project to build a Street Navigation Agent for more inclusive, region-aware local search. In many parts of the world, businesses exist physically — but not digitally. They're exploring how AI can use tools like Google Street View to read storefront signs, apply distance & category constraints, and reason step-by-step to identify real-world services. We’re also building a global benchmark across countries and languages to evaluate visual verification.

Cohere_Labs's tweet photo. 🚀 Our community-led ML Agents group is kicking off a new collaborative project to build a Street Navigation Agent for more inclusive, region-aware local search. In many parts of the world, businesses exist physically — but not digitally.

They're exploring how AI can use tools like Google Street View to read storefront signs, apply distance & category constraints, and reason step-by-step to identify real-world services.

We’re also building a global benchmark across countries and languages to evaluate visual verification.

1

11

2

8

974

jebish7 retweeted

Cohere Labs

@Cohere_Labs

about 2 months ago

Interested in contributing? ✨ Beginners welcome — no hard requirements ✨ Familiarity with VLMs is helpful for evaluation ✨ Experience with agentic workflows and PyTorch is a plus Learn more and get involved today: https://t.co/oOVlxQhTm7 Many thanks to our community leads @_1024_m, @ankanpy, @SovitRath5 and @jebish7 for their leading this initiative!

0

9

2

7

776

jebish7 retweeted

Cohere Labs

@Cohere_Labs

2 months ago

Cohere Labs x ICLR 2026: Kaleidoscope A multilingual multimodal benchmark with exam-style questions written directly in 18 languages (not translated from English).

Cohere_Labs's tweet photo. Cohere Labs x ICLR 2026: Kaleidoscope

A multilingual multimodal benchmark with exam-style questions written directly in 18 languages (not translated from English). https://t.co/4a4gtsI8hs

1

23

12

2

4K

Jebish7 @jebish7

3 months ago

@universeinanegg Just today I was thinking about what happens when we train LLMs on their own data. Looks like not too long till I get the answer.

0

1

0

76

Jebish7 @jebish7

4 months ago

@universeinanegg Saving this just in case…

1

0

31

Jebish7 @jebish7

4 months ago

@awsdevelopers AWS

0

9

Jebish7 @jebish7

5 months ago

@universeinanegg Thanks for bringing this to my feed. Went through few threads. My biggest takeaway is that, we are really lacking in our evaluations of models. Really fascinating idea this, kinda also shows how agents operate in social settings.

0

1

0

29

Jebish7 @jebish7

5 months ago

@sarahookr With No Bias whatsoever, can say Momo is the best.

0

2

0

506

Jebish7 @jebish7

5 months ago

@skoularidou Congratulations. As an aspiring researcher, I don’t think 3K is trivial. It shows that researchers need your paper.

0

1

0

151

Jebish7 @jebish7

5 months ago

@tomssilver Relocate to South Asia? You will have deadline on 5-6 PM, so a typical office time.

0

1

0

526

Jebish7 @jebish7

5 months ago

@universeinanegg Tried this with Claude and GPT, and it’s the same. Gemini 3 pro did give two 3s ( after thinking a lot). Though all of their first two numbers are 3 followed by 1 (irrespective of temperature). Models seem to love 3.

0

50

Jebish7 @jebish7

5 months ago

@Kuvvius Congratulations 🎊

1

0

214

Jebish7 @jebish7

5 months ago

Kaleidoscope has been accepted at ICLR 🔥. This is the first of its kind massive multimodal multilingual benchmark. Configurations to everyone @Cohere_Labs 🎊

Sara Hooker

@sarahookr

5 months ago

Congrats to everyone involved in Kaleidoscope, a cross-institutional collaboration accepted to ICLR 2026 🔥 A special shoutout to @mziizm who championed this collaboration from day 1. It is the first accepted paper for many of the collaborators who are first time authors.

sarahookr's tweet photo. Congrats to everyone involved in Kaleidoscope, a cross-institutional collaboration accepted to ICLR 2026 🔥

A special shoutout to @mziizm who championed this collaboration from day 1. It is the first accepted paper for many of the collaborators who are first time authors. https://t.co/3dTkmYeVAb

4

62

14

4

7K

0

2

0

198

Jebish7 @jebish7

5 months ago

@Cohere_Labs kick-started my research journey. Two years later, we continue moving forward together.

Cohere Labs

@Cohere_Labs

5 months ago

Many researchers join our community seeking mentorship, support, and a roadmap as they embark on their journeys. @_1024_m and @jebish7 did just this. Now, just 2 years later, they are creating these pathways for others, opening doors, and leading the way.

Cohere_Labs's tweet photo. Many researchers join our community seeking mentorship, support, and a roadmap as they embark on their journeys.

@_1024_m and @jebish7 did just this. Now, just 2 years later, they are creating these pathways for others, opening doors, and leading the way. https://t.co/3KW5awpR5n

1

15

3

2K

0

5

1

686

jebish7 retweeted

Cheng Qian

@qiancheng1231

5 months ago

🔮 Can a world model (simulator) give today’s AI agents foresight? We tested “world model as a tool”… and found it often doesn’t help—sometimes it hurts. Check our newest paper here: https://t.co/nujSGeHKMx #AIagents #WorldModel #ToolUse

qiancheng1231's tweet photo. 🔮 Can a world model (simulator) give today’s AI agents foresight? We tested “world model as a tool”… and found it often doesn’t help—sometimes it hurts.

Check our newest paper here: https://t.co/nujSGeHKMx

#AIagents #WorldModel #ToolUse https://t.co/cHbfRg2pzb

1

52

19

15

12K

jebish7 retweeted

David Chiang @davidweichiang

6 months ago

@ReviewAcl pretty please extend the Jan deadline

0

9

1

0

444

Jebish7 @jebish7

7 months ago

@PingbangHu There was sudden score inflation after the leak, so this was the only realistic way to fix it. They could have rolled everything back to just before the leak, but that would’ve been unfair to people whose reviewers hadn’t responded yet.

1

0

446

jebish7 retweeted

Catherine Arnett @linguist_cat

7 months ago

Very excited to see that Global PIQA is already being used to evaluate multilingual capabilities in new models!

1

29

7

1

2K

Jebish7

@jebish7

Last Seen Users on Sotwe

Trends for you

Most Popular Users