C S Krishna @CSkrishna - Twitter Profile

CSkrishna retweeted

Omar Khattab

@lateinteraction

10 days ago

Claude Code is finally an RLM (oct 2025), congrats to Anthropic :-)

18

490

42

220

79K

CSkrishna retweeted

Iceland Cricket

@icelandcricket

19 days ago

Our Prime Minister @KristrunFrosta meeting her Indian counterpart today, sorting out our first five match Test series. No doubt about it, you can see a cricketing look in her eye.

icelandcricket's tweet photo. Our Prime Minister @KristrunFrosta meeting her Indian counterpart today, sorting out our first five match Test series. No doubt about it, you can see a cricketing look in her eye. https://t.co/PX9JTy4Out

70

7K

551

124

82K

CSkrishna retweeted

Andrej Karpathy

@karpathy

19 days ago

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

8K

150K

11K

14K

27M

CSkrishna retweeted

Parth Asawa

@pgasawa

about 1 month ago

Today, we’re releasing Continual Learning Bench 1.0: the first, realistic benchmark for measuring how AI systems can improve in online settings. Benchmarks today assume models are stateless. Each example is independent, and once a system finishes a task, it moves on as if nothing happened. But deployed AI systems should learn from experience. We tested 10+ frontier systems against novel, expert-validated tasks and find there’s still plenty of headroom for learning. (1/n)

pgasawa's tweet photo. Today, we’re releasing Continual Learning Bench 1.0: the first, realistic benchmark for measuring how AI systems can improve in online settings.

Benchmarks today assume models are stateless. Each example is independent, and once a system finishes a task, it moves on as if nothing happened.

But deployed AI systems should learn from experience. We tested 10+ frontier systems against novel, expert-validated tasks and find there’s still plenty of headroom for learning. (1/n)

42

1K

156

900

830K

Who to follow

T.Hayase

@ThayaFluss

ランダム行列と自由確率論はいいぞ. Random Matrices, Free Probability Theory and Deep Neural Networks.

Julia Kiseleva

@julia_kiseleva

Genome-scale foundation models to read, predict, and design DNA The genome is the next foundation layer Former Microsoft Research · University of Amsterdam

Ashwin S Kumar

@ashwinskumar

Former humour contributor @asianetnewstv-@MyNation & @theunrealtimes; Novice Musician (@JHarrisjayaraj bhakt)+mimic; VIEWS PERSONAL+varying; RT!=endorsement

CSkrishna retweeted

Somnath Mukherjee @somnath1978

about 1 month ago

Why does a political party so ruthlessly competent in political management doesnt manage the same insanely-outlier outcomes in economic-development terms?

93

689

120

43

37K

CSkrishna retweeted

Charles 🎉 Frye

@charles_irl

about 1 month ago

this post has aged so excellently https://t.co/KqRiJy0fZq

1

59

3

33

4K

CSkrishna retweeted

Joey Gonzalez

@profjoeyg

about 1 month ago

There is a lot of hype around continual learning, but what is it and how do we evaluate it? With our new continual learning bench we sought to answer both of these questions. We developed a new methodology for designing continual learning tasks and a growth-based learning metric to isolate continual learning. Have you experienced models (agent loops) rapidly improving on your tasks? Do you have tasks that could benefit from continual learning? Let us know.

0

26

8

11

4K

CSkrishna retweeted

Omar Khattab

@lateinteraction

about 1 month ago

We're gearing up to release two research efforts I've been extremely excited about for quite some time. Y'all will really love these.

23

504

16

74

44K

CSkrishna retweeted

spacy

@dosco

about 1 month ago

this fleet RLM result screams mismanaged geniuses hypothesis. recursive agents (DSPy ReAct + sandbox) more than doubled accuracy on 100 long-horizon tasks 13% to 33%, with zero failures. perf on tasks in the logic domain exploded to 80% (8×). every "wrong" answer was just formatting, the actual reasoning was clean. we’re not missing model smarts. we’re missing proper management. better architectures beat bigger models.

4

140

15

147

15K

C S Krishna @CSkrishna

about 1 month ago

@somnath1978 @ShashiTharoor IWT in "abeyance" (Pakistanis reach for the dictionary every time we say this) can be changed to IWT "abrogated/annulled" etc after the next terror attack - this itself is a big deterrent.

0

1

0

71

C S Krishna @CSkrishna

about 1 month ago

@somnath1978 @ShashiTharoor also unstable and inequitable internally. If their most popular leader has been jailed, why are we expected to engage in good faith with a compromised polity in the quest for peace?

1

4

1

0

402

CSkrishna retweeted

Joey Gonzalez

@profjoeyg

about 1 month ago

Someone not bragging about a better number but instead reflecting on how we talk about things and where the field is headed. Thought leadership! We need more of this!

2

17

3

15

5K

CSkrishna retweeted

Prakash Dadlani

@prakdadlani

about 1 month ago

This is EPIC! Another Hong Kong based Sindhi going big in Bharat 🇮🇳🇭🇰 10,000 manufacturing jobs added in Odisha and this is just the start. The movement is REAL.

6

1K

174

100

47K

CSkrishna retweeted

Sridhar Vembu

@svembu

about 1 month ago

We are investing in foundational technologies across the board: recently in quantum sensing, advanced materials, and soon metallurgy. I am a big proponent of metallurgy R&D in particular. Without it, we cannot build nail cutters or precision machinery or jet engines. These are not flashy billion dollar investments to make headlines, they are foundational R&D that cost millions a year, stretched out over many years. The key is to SUSTAIN them for a decade or longer. Scientists and engineers need time and rock solid support. We also don't aim for prestige, we want to first replicate know-how already there. We have also been looking to partner with small Japanese companies with critical know-how. I have two fluent Japanese speakers with me now!

155

4K

676

186

78K

C S Krishna @CSkrishna

about 1 month ago

@VatsRishap how does this work? The Pakistan army has genrated amazing PR for itself in the process

0

2

0

294

C S Krishna @CSkrishna

about 1 month ago

@DivaJain2 vibes similar to puff pieces on Musharraf in the Western media after he agreed to ditch the Taliban and partner with America post 9/11

0

2

0

2

412

C S Krishna @CSkrishna

about 1 month ago

@Iyervval @ShivrattanDhil1 @SriLankaTweet @SriLanka We have a large captive population to feed off - but the sheer overpopulation mars the experience - traffic jams in Ranthambore for tiger sighting being a case in point!!

1

9

0

1K

C S Krishna @CSkrishna

about 1 month ago

@KanwalSibal also B'desh, India more or less neck and neck, with close integration. If Bdesh sligtly ahead, nothing wrong. Comparing Bdesh with Indian states rather than entire India makes more sense. Bihar, W bengal can learn fron Bdesh.

0

1

0

229

CSkrishna retweeted