Subhendu @subhendu - Twitter Profile

subhendu retweeted

Rishav Sharma @rishav_sharma1

2 months ago

Today i visited Vrindavan for the first time. Here’s what I have to say

636

8K

2K

1K

800K

subhendu retweeted

Priya Purohit

@Priyaa_Purohit

2 months ago

Epic grilling !! 🤣

46

2K

477

72

21K

subhendu retweeted

vir sanghvi

@virsanghvi

2 months ago

There is something seriously wrong if a body charged with regulating food goes to the police to file FIRs against those who question its performance. What is the FSSAI really regulating? Food? Or people who question it? Deeply disturbing https://t.co/9wIvvXQ9qG via @theprintindia

112

5K

2K

99

69K

subhendu retweeted

Suraj Kumar Bauddh

@SurajKrBauddh

2 months ago

"Youth angry over poor civic sense." 😡 He was peacefully walking on the footpath, but a large number of bikers kept honking at him to move aside for an easy pass. When will people in India understand that the footpath is for pedestrians, not bikes?

436

31K

6K

1K

633K

Who to follow

⁺ ୨୧ . ˚lovingly attached to jeno and jaemin ˖* 𓏲∙ @NA_RCISSISM_

subhendu retweeted

2 months ago

The Great Indian cultural renaissance—young girls singing Bharat mein jo deshdrohi hai unki Ma ka B****. 👏👏

525

3K

1K

716

720K

subhendu retweeted

Piyush Rai

@Benarasiyaa

2 months ago

Delhi Police has busted a fake "Sensodyne" toothpaste factory. 1,800 filled tubes, 10,000 empty tubes, 1,200 packed tubes, and 130 kg of paste have been seized. The factory owner, Hariom Mishra, has been arrested.

499

13K

4K

1K

1M

subhendu retweeted

Urban Clash @urban_kalesh

2 months ago

@NalinisKitchen 🫡

1

603

44

18

34K

subhendu retweeted

Kimi.ai @Kimi_Moonshot

3 months ago

Congrats to the @cursor_ai team on the launch of Composer 2! We are proud to see Kimi-k2.5 provide the foundation. Seeing our model integrated effectively through Cursor's continued pretraining & high-compute RL training is the open model ecosystem we love to support. Note: Cursor accesses Kimi-k2.5 via @FireworksAI_HQ ' hosted RL and inference platform as part of an authorized commercial partnership.

516

20K

1K

3K

4M

subhendu retweeted

fynn

@fynnso

3 months ago

was messing with the OpenAI base URL in Cursor and caught this accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast so composer 2 is just Kimi K2.5 with RL at least rename the model ID

fynnso's tweet photo. was messing with the OpenAI base URL in Cursor and caught this

accounts/anysphere/models/kimi-k2p5-rl-0317-s515-fast

so composer 2 is just Kimi K2.5 with RL
at least rename the model ID https://t.co/fyUWbo1InF

281

7K

470

2K

3M

subhendu retweeted

Manoj Arora

@manoj_216

4 months ago

I was wrong. I tweeted that we are not investing enough in R&D and education. Here's the world's first "Tuning Fork Flyover" in Maharashtra. This one slows you down mentally before it slows you down physically. Honestly, it feels like the Maharashtra government tried to solve traffic but accidentally created a confidence exam for drivers.

manoj_216's tweet photo. I was wrong.
I tweeted that we are not investing enough in R&D and education.
Here's the world's first "Tuning Fork Flyover" in Maharashtra.
This one slows you down mentally before it slows you down physically.
Honestly, it feels like the Maharashtra government tried to solve traffic but accidentally created a confidence exam for drivers.

100

662

146

34

53K

subhendu retweeted

Alex Prompter

@alex_prompter

6 months ago

This paper from Stanford and Harvard explains why most “agentic AI” systems feel impressive in demos and then completely fall apart in real use. The core argument is simple and uncomfortable: agents don’t fail because they lack intelligence. They fail because they don’t adapt. The research shows that most agents are built to execute plans, not revise them. They assume the world stays stable. Tools work as expected. Goals remain valid. Once any of that changes, the agent keeps going anyway, confidently making the wrong move over and over. The authors draw a clear line between execution and adaptation. Execution is following a plan. Adaptation is noticing the plan is wrong and changing behavior mid-flight. Most agents today only do the first. A few key insights stood out. Adaptation is not fine-tuning. These agents are not retrained. They adapt by monitoring outcomes, recognizing failure patterns, and updating strategies while the task is still running. Rigid tool use is a hidden failure mode. Agents that treat tools as fixed options get stuck. Agents that can re-rank, abandon, or switch tools based on feedback perform far better. Memory beats raw reasoning. Agents that store short, structured lessons from past successes and failures outperform agents that rely on longer chains of reasoning. Remembering what worked matters more than thinking harder. The takeaway is blunt. Scaling agentic AI is not about larger models or more complex prompts. It’s about systems that can detect when reality diverges from their assumptions and respond intelligently instead of pushing forward blindly. Most “autonomous agents” today don’t adapt. They execute. And execution without adaptation is just automation with better marketing.

alex_prompter's tweet photo. This paper from Stanford and Harvard explains why most “agentic AI” systems feel impressive in demos and then completely fall apart in real use.

The core argument is simple and uncomfortable: agents don’t fail because they lack intelligence. They fail because they don’t adapt.

The research shows that most agents are built to execute plans, not revise them. They assume the world stays stable. Tools work as expected. Goals remain valid. Once any of that changes, the agent keeps going anyway, confidently making the wrong move over and over.

The authors draw a clear line between execution and adaptation.

Execution is following a plan.

Adaptation is noticing the plan is wrong and changing behavior mid-flight.

Most agents today only do the first.

A few key insights stood out.

Adaptation is not fine-tuning. These agents are not retrained. They adapt by monitoring outcomes, recognizing failure patterns, and updating strategies while the task is still running.

Rigid tool use is a hidden failure mode. Agents that treat tools as fixed options get stuck. Agents that can re-rank, abandon, or switch tools based on feedback perform far better.

Memory beats raw reasoning. Agents that store short, structured lessons from past successes and failures outperform agents that rely on longer chains of reasoning. Remembering what worked matters more than thinking harder.

The takeaway is blunt.

Scaling agentic AI is not about larger models or more complex prompts. It’s about systems that can detect when reality diverges from their assumptions and respond intelligently instead of pushing forward blindly.

Most “autonomous agents” today don’t adapt.
They execute.

And execution without adaptation is just automation with better marketing.

205

3K

643

4K

1M

subhendu retweeted

ₕₐₘₚₜₒₙ

@hamptonism

7 months ago

Major Cloudflare outage for the 33rd time this year…

33

3K

126

86

117K

subhendu retweeted

Mario Candela @m4r10c4nd3l4

7 months ago

It’s not DNS There’s no way it’s DNS It was DNS #cloudflare

59

6K

672

294

400K

subhendu retweeted

Gergely Orosz

@GergelyOrosz

7 months ago

Not frequent to get this error message on half the webpages I try to visit The internet's key dependencies include not just AWS, but also Cloudflare, clearly (we knew this, but nothing brings it home like an outage!)

GergelyOrosz's tweet photo. Not frequent to get this error message on half the webpages I try to visit

The internet's key dependencies include not just AWS, but also Cloudflare, clearly (we knew this, but nothing brings it home like an outage!) https://t.co/Qsmk7uC6Mt

30

471

23

17

47K

subhendu retweeted

Nitish @nitishisme

7 months ago

Cloudflare

149

67K

9K

3K

1M

subhendu retweeted

Andrej Karpathy

@karpathy

8 months ago

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language person) is whether pixels are better inputs to LLMs than text. Whether text tokens are wasteful and just terrible, at the input. Maybe it makes more sense that all inputs to LLMs should only ever be images. Even if you happen to have pure text input, maybe you'd prefer to render it and then feed that in: - more information compression (see paper) => shorter context windows, more efficiency - significantly more general information stream => not just text, but e.g. bold text, colored text, arbitrary images. - input can now be processed with bidirectional attention easily and as default, not autoregressive attention - a lot more powerful. - delete the tokenizer (at the input)!! I already ranted about how much I dislike the tokenizer. Tokenizers are ugly, separate, not end-to-end stage. It "imports" all the ugliness of Unicode, byte encodings, it inherits a lot of historical baggage, security/jailbreak risk (e.g. continuation bytes). It makes two characters that look identical to the eye look as two completely different tokens internally in the network. A smiling emoji looks like a weird token, not an... actual smiling face, pixels and all, and all the transfer learning that brings along. The tokenizer must go. OCR is just one of many useful vision -> text tasks. And text -> text tasks can be made to be vision ->text tasks. Not vice versa. So many the User message is images, but the decoder (the Assistant response) remains text. It's a lot less obvious how to output pixels realistically... or if you'd want to. Now I have to also fight the urge to side quest an image-input-only version of nanochat...

558

13K

2K

7K

3M

subhendu retweeted

Dr Singularity

@Dr_Singularity

8 months ago

This is insane. New AI model from Samsung, 10,000x smaller than DeepSeek and Gemini 2.5 Pro just beat them on ARC-AGI 1 and 2 Samsung’s Tiny Recursive Model (TRM) is about 10,000x smaller than typical LLMs yet smarter because it thinks recursively instead of just predicting text. It first drafts an answer, then builds a hidden "scratchpad" for reasoning, repeatedly critiques and refines its logic (up to 16 times), and produces improved answers each cycle. This approach shows that architecture and reasoning loops (not just size), can drive intelligence. It enables powerful, efficient models that run cheaply, validate neuro symbolic ideas, and open highest quality reasoning to far more applications. Acceleration is everywhere

Dr_Singularity's tweet photo. This is insane.

New AI model from Samsung, 10,000x smaller than DeepSeek and Gemini 2.5 Pro just beat them on ARC-AGI 1 and 2

Samsung’s Tiny Recursive Model (TRM) is about 10,000x smaller than typical LLMs yet smarter because it thinks recursively instead of just predicting text. It first drafts an answer, then builds a hidden "scratchpad" for reasoning, repeatedly critiques and refines its logic (up to 16 times), and produces improved answers each cycle.

This approach shows that architecture and reasoning loops (not just size), can drive intelligence. It enables powerful, efficient models that run cheaply, validate neuro symbolic ideas, and open highest quality reasoning to far more applications.

Acceleration is everywhere

218

8K

1K

5K

1M

subhendu retweeted

Deedy

@deedydas

8 months ago

The TRM paper feels like a significant AI breakthrough. It destroys the pareto frontier on the ARC AGI 1 and 2 benchmarks (and Sudoku and Maze solving) with an estd < $0.01 cost per task and cost < $500 to train the 7M model on 2 H100s for 2 days. [Training and test specifics] For ARC, it trained on 160 examples from ConceptARC. At test-time, it uses the most common answer of 1000 augmentations at test-time and embeds a fixed shape of the task in the input. [Industry implications] Most AI companies today use general purpose LLMs with prompting for tasks. For specific tasks, smaller models may not just be cheaper, but far higher quality! Startups could (and should) train models for < $1000 for specific "fixed length" subtasks (specific PDF extraction, time series forecasting, etc) and use it as a tool to the general model to not only push performance, but build some meaningful IP at the task they're trying to automate.

deedydas's tweet photo. The TRM paper feels like a significant AI breakthrough.

It destroys the pareto frontier on the ARC AGI 1 and 2 benchmarks (and Sudoku and Maze solving) with an estd < $0.01 cost per task and cost < $500 to train the 7M model on 2 H100s for 2 days.

[Training and test specifics]
For ARC, it trained on 160 examples from ConceptARC. At test-time, it uses the most common answer of 1000 augmentations at test-time and embeds a fixed shape of the task in the input.

[Industry implications]
Most AI companies today use general purpose LLMs with prompting for tasks. For specific tasks, smaller models may not just be cheaper, but far higher quality! Startups could (and should) train models for < $1000 for specific "fixed length" subtasks (specific PDF extraction, time series forecasting, etc) and use it as a tool to the general model to not only push performance, but build some meaningful IP at the task they're trying to automate.

58

2K

207

1K

143K

subhendu retweeted

Kunal Kamra

@kunalkamra88

about 1 year ago

To all those hounding for a quote - “The mainstream media at this point is nothing but a miscommunication arm of the ruling party. They are vultures who report on issues that dont matter to the people of this country. If they all shut shop from tomorrow till eternity they will be doing favour to the country, its people & their own children”

3K

66K

12K

1K

3M

subhendu retweeted

John Rush

@johnrushx

over 1 year ago

Chinese AI startups: 1/6th of US funding, bad press, sanctions, brain drain, communism, little English proficiency, and no talent influx.. But after using Manus AI, Deepseek, Trae, Kling, Vidu, & Ying, I think the US is in trouble. At this pace, China will dominate AI. Demos:

347

9K

1K

8K

1M

Subhendu

@subhendu

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users