Sikim Chakraborty @scrab017 - Twitter Profile

scrab017 retweeted

4 months ago

Ask ChatGPT a complex question and you'll get a confident, well-reasoned answer. Then type, "Are you sure?" Watch it completely reverse its position. Ask again. It flips back. By the third round, it usually acknowledges you're testing it, which is somehow worse. It knows what's happening and still can't hold its ground. This isn't a quirky bug. A 2025 study found GPT, Claude, and Gemini flip their answers ~60% of the time when users push back. Not even with evidence, just doubt. We trained AI this way. RLHF rewards agreement over accuracy. Human evaluators consistently rate agreeable answers higher than correct ones. So the models learned a simple lesson: telling you what you want to hear gets rewarded. And now 1/3 of companies are using these systems for complex tasks like risk forecasting and scenario planning. We built the world's most expensive yes-men and deployed them where we need pushback the most. I wrote up why this happens and what actually fixes it: https://t.co/CDKq8xdgbW

randal_olson's tweet photo. Ask ChatGPT a complex question and you'll get a confident, well-reasoned answer. Then type, "Are you sure?" Watch it completely reverse its position.

Ask again. It flips back. By the third round, it usually acknowledges you're testing it, which is somehow worse. It knows what's happening and still can't hold its ground.

This isn't a quirky bug. A 2025 study found GPT, Claude, and Gemini flip their answers ~60% of the time when users push back. Not even with evidence, just doubt.

We trained AI this way. RLHF rewards agreement over accuracy. Human evaluators consistently rate agreeable answers higher than correct ones. So the models learned a simple lesson: telling you what you want to hear gets rewarded. And now 1/3 of companies are using these systems for complex tasks like risk forecasting and scenario planning.

We built the world's most expensive yes-men and deployed them where we need pushback the most.

I wrote up why this happens and what actually fixes it: https://t.co/CDKq8xdgbW

658

19K

3K

5K

1M

scrab017 retweeted

Jeff Dean

@JeffDean

about 1 year ago

We're using a ReLU to set tariffs?

82

5K

429

375

440K

scrab017 retweeted

François Chollet

@fchollet

about 2 years ago

If intelligence is the ability to deal with what you weren't prepared for, then the modern AI strategy is to prepare for everything, so you never need intelligence. This is of course a terrible strategy, because it is impossible to prepare for everything. The problem isn't just scale, the problem is the fact that the real world isn't sampled from a static distribution -- it is ever changing and ever novel.

29

431

47

103

50K

Sikim Chakraborty @scrab017

almost 6 years ago

@ShamikaRavi Well, not much!

0

10

1

0

Who to follow

Prachi Singh ‏‎| پراچي | प्राची

@prachi_eco

Environment & Health researcher, Book/Comic lover, Binge watcher, News hungry

Dweepobotee Brahma

@Dweepobotee

Assistant Professor @CMCE_iitj @iitjodhpur| Development and Health Economics| Machine-Learning, Econometrics and Causal Inference

Ted Ring

@EdwardRing2111

Lost my main account. A dub living in the kingdom.

scrab017 retweeted

Andrej Karpathy

@karpathy

about 6 years ago

moderna white paper on mRNA vaccines [pdf] https://t.co/CjcIiy8SRN + a nature article on them https://t.co/8D1Kqm2yPb clever and interesting

3

197

44

35

0

Sikim Chakraborty @scrab017

about 6 years ago

@prachi_eco 🤦🏻‍♂️

0

1

0

Sikim Chakraborty @scrab017

about 6 years ago

@ShamikaRavi @filmy_foodie @ICMRDELHI You can find them here: https://t.co/P9DOKWAAFT

1

0

scrab017 retweeted

Arthur Welle @ArthurWelle

about 6 years ago

@MaxCRoser Modestly, I'd also like to show my animation. Done in R. Johns Hopkings data.

2

65

10

0

Sikim Chakraborty @scrab017

about 6 years ago

@Dweepobotee then scramble and crowd every mart in the vicinity, defeating the entire purpose of the lockdown 😅

0

1

0

Sikim Chakraborty @scrab017

about 6 years ago · New Delhi

@prachi_eco sometimes Twitter trends are helpful as well.

0

1

0

Sikim Chakraborty @scrab017

over 6 years ago

@MerseyReds1 and against Chelsea in the FA Cup...but did win the Super Cup

0

1

0

Sikim Chakraborty @scrab017

over 6 years ago

@prachi_eco As stores begin to run out, google searches for ‘hand sanitizer’ also going up in India with new cases :p

0

1

2

0

Sikim Chakraborty @scrab017

over 6 years ago

@AnfieldWatch Millie

0

1

0

scrab017 retweeted

Kaggle @kaggle

over 6 years ago

📣BIG NEWS: TPUs are now available on Kaggle notebooks! To help you get started with these powerful hardware accelerators, we’re launching a TPU playground competition. Check it out and become one of the world’s first TPU experts! https://t.co/9qYuVu29zQ

2

278

79

11

0

scrab017 retweeted

Google Earth

@googleearth

over 6 years ago

The #EarthEngine Data Catalog contains over 600 datasets. Check out the newest additions from @NOAASatellites, @USGS_EROS, @CopernicusEU, and more: https://t.co/unmv4tKqjI

0

149

64

11

0

scrab017 retweeted

Rahul Tongia @DrTongia

over 6 years ago

WHOA, Indian #Electricity "load met" (grid level demand) falls below 100 GW for the first time in a long time. Post Diwali. BTW, low demand also translates to low coal output. Delhi #pollution is not from coal. See @BrookingsIndia #ElectricityCarbonTracker https://t.co/wt64PzL8OI

DrTongia's tweet photo. WHOA, Indian #Electricity "load met" (grid level demand) falls below 100 GW for the first time in a long time. Post Diwali. BTW, low demand also translates to low coal output. Delhi #pollution is not from coal. See @BrookingsIndia #ElectricityCarbonTracker https://t.co/wt64PzL8OI https://t.co/sj0Fweunsh

3

29

9

0

Sikim Chakraborty @scrab017

over 6 years ago · New Delhi

@YourMateJez and Atkinson in VAR 🙄

0

1

0

scrab017 retweeted

Sam Quek @SamanthaQuek

over 6 years ago

Got to love the VARclays Premier League... What a shambolic weekend 🙄

102

2K

138

0

scrab017 retweeted

Sundar Pichai

@sundarpichai

over 6 years ago

Detecting deepfakes is one of the most important challenges ahead of us. Following our release of a synthetic audio dataset in Jan, we're releasing a large dataset of visual deepfakes to support researchers working on synthetic video detection #GoogleAI https://t.co/sDW7BP34qL

72

3K

491

64

0

scrab017 retweeted

Andrej Karpathy

@karpathy

about 7 years ago

Speech2Face: Learning the Face Behind a Voice https://t.co/9enUz600fK With increasingly large/effective library of neural net encoders of any X and decoders of any Y, any source of paired data X,Y can give X2Y nets. And opens the door to many X2Y2Z2W...2X

20

657

181

69

0

Sikim Chakraborty

@scrab017

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users