Deepthi Sara Anil

@deepthisaraanil

Researcher @TataCornell |Phd @iitk_econ|Masters @HydUniv

New Delhi, India

Joined July 2022

627 Following

263 Followers

43 Posts

deepthisaraanil retweeted

Ashwini_Deshpande @_ADeshpande

about 1 month ago

5 years, zero progress: actually regression. Men's domestic work time fell from 95 to 86 min/day. Women: still at 305 min: 3.5X more. Caregiving gap unchanged too (140 vs 74). The needle didn't just fail to shift. It moved the wrong way. (TUS 2024) #sharetheload

_ADeshpande's tweet photo. 5 years, zero progress: actually regression. Men's domestic work time fell from 95 to 86 min/day. Women: still at 305 min: 3.5X more. Caregiving gap unchanged too (140 vs 74). The needle didn't just fail to shift. It moved the wrong way. (TUS 2024) #sharetheload https://t.co/F6VcW2jrzY

0

18

12

4

2K

deepthisaraanil retweeted

Libertad González @LibertadGonLu

4 months ago

"It is quite difficult to defend what you didn't build. (...) The value of deeply understanding something — of having built the knowledge yourself — hasn't diminished with AI. If anything, it's increased."

0

15

3

0

2K

deepthisaraanil retweeted

6 months ago

“Can I bring my baby to the interview?” The message came in at 11 PM: “Hi, I have an interview with you tomorrow at 2 PM. My childcare fell through. Can I bring my 8-month-old? I understand if you need to reschedule.” Old me would have rescheduled. Unprofessional. Distraction. Red flag. New me replied: “Absolutely. See you tomorrow.” She showed up with her baby on her hip. She apologized three times before even sitting down. Ten minutes in, the baby started crying. She tried to soothe him while answering questions. She apologized again. I stopped the interview and said: “Hey. You’re managing a fussy baby, answering complex questions, and staying calm under pressure. That’s literally the job. Handling chaos while staying professional. You’re already proving you can do it.” Her eyes filled with tears. We hired her. She’s been with us for a year now. The most reliable team member we have. Why? Because when you’re used to handling a screaming infant at 3 AM and still showing up to work the next day, workplace stress feels like nothing. Working parents, especially mothers, are some of the most organized, efficient, and resilient people you’ll ever hire. Yet we lose them because our hiring processes are built for people with zero caregiving responsibilities. If your interview process can’t accommodate a parent facing a childcare issue, you’re not filtering for professionalism. You’re filtering for privilege.

1K

90K

13K

4K

4M

deepthisaraanil retweeted

Alex Veremeyenko

6 months ago

This paper from Harvard and MIT quietly answers the most important AI question nobody benchmarks properly: Can LLMs actually discover science, or are they just good at talking about it? The paper is called “Evaluating Large Language Models in Scientific Discovery”, and instead of asking models trivia questions, it tests something much harder: Can models form hypotheses, design experiments, interpret results, and update beliefs like real scientists? Here’s what the authors did differently 👇 • They evaluate LLMs across the full discovery loop hypothesis → experiment → observation → revision • Tasks span biology, chemistry, and physics, not toy puzzles • Models must work with incomplete data, noisy results, and false leads • Success is measured by scientific progress, not fluency or confidence What they found is sobering. LLMs are decent at suggesting hypotheses, but brittle at everything that follows. ✓ They overfit to surface patterns ✓ They struggle to abandon bad hypotheses even when evidence contradicts them ✓ They confuse correlation for causation ✓ They hallucinate explanations when experiments fail ✓ They optimize for plausibility, not truth Most striking result: `High benchmark scores do not correlate with scientific discovery ability.` Some top models that dominate standard reasoning tests completely fail when forced to run iterative experiments and update theories. Why this matters: Real science is not one-shot reasoning. It’s feedback, failure, revision, and restraint. LLMs today: • Talk like scientists • Write like scientists • But don’t think like scientists yet The paper’s core takeaway: Scientific intelligence is not language intelligence. It requires memory, hypothesis tracking, causal reasoning, and the ability to say “I was wrong.” Until models can reliably do that, claims about “AI scientists” are mostly premature. This paper doesn’t hype AI. It defines the gap we still need to close. And that’s exactly why it’s important.

alex_verem's tweet photo. This paper from Harvard and MIT quietly answers the most important AI question nobody benchmarks properly:

Can LLMs actually discover science, or are they just good at talking about it?

The paper is called “Evaluating Large Language Models in Scientific Discovery”, and instead of asking models trivia questions, it tests something much harder:

Can models form hypotheses, design experiments, interpret results, and update beliefs like real scientists?

Here’s what the authors did differently 👇

• They evaluate LLMs across the full discovery loop hypothesis → experiment → observation → revision
• Tasks span biology, chemistry, and physics, not toy puzzles
• Models must work with incomplete data, noisy results, and false leads
• Success is measured by scientific progress, not fluency or confidence

What they found is sobering.

LLMs are decent at suggesting hypotheses, but brittle at everything that follows.

✓ They overfit to surface patterns
✓ They struggle to abandon bad hypotheses even when evidence contradicts them
✓ They confuse correlation for causation
✓ They hallucinate explanations when experiments fail
✓ They optimize for plausibility, not truth

Most striking result:

`High benchmark scores do not correlate with scientific discovery ability.`

Some top models that dominate standard reasoning tests completely fail when forced to run iterative experiments and update theories.

Why this matters:

Real science is not one-shot reasoning.

It’s feedback, failure, revision, and restraint.

LLMs today:

• Talk like scientists
• Write like scientists
• But don’t think like scientists yet

The paper’s core takeaway:

Scientific intelligence is not language intelligence.

It requires memory, hypothesis tracking, causal reasoning, and the ability to say “I was wrong.”

Until models can reliably do that, claims about “AI scientists” are mostly premature.

This paper doesn’t hype AI. It defines the gap we still need to close.

And that’s exactly why it’s important.

378

8K

2K

6K

1M

Who to follow

Verified account

Professor, Jointly with Department of Economic Sciences and Wadhwani School of Advanced AI and Intelligent Systems, IIT Kanpur

Akanksha Aggarwal

Ph.D. candidate at Indian Statistical Institute (Delhi) || Interested in Development Economics with a focus on Education || Previously, @SRCCDU

Assistant Professor, Department of Economics, University of Calcutta. Fellow, Global Labor Organization

deepthisaraanil retweeted

Rikhia।রিখিয়া @rikhia_

7 months ago

Conducted a hands-on STATA session on RDD at IEG winter school on causal inference! Thanks to all the participants who ensured an engaging session.

1

56

2

5

6K

deepthisaraanil retweeted

QJE @QJEHarvard

over 1 year ago

#QJE Nov 2024, #3, “The Dynamics of Abusive Relationships,” by Adams (@abicadams), Huttunen (@KristiinaHuttu2), Nix (@EmilyNix100), and Zhang (@ningzhang0927): https://t.co/rje1TuvsiT

2

39

14

31

72K

deepthisaraanil retweeted

over 1 year ago

The average Nobel laureate grew up in an 87–90th percentile household. Access to opportunity doubled from 1901–2023, but remains highly unequal. Barriers are higher for women, but lower for Americans. https://t.co/brk56JeHgo @paulnovosad

florianederer's tweet photo. The average Nobel laureate grew up in an 87–90th percentile household.

Access to opportunity doubled from 1901–2023, but remains highly unequal.

Barriers are higher for women, but lower for Americans.

https://t.co/brk56JeHgo @paulnovosad https://t.co/opffG2IR82

2

222

56

68

60K

deepthisaraanil retweeted

Kunal Mangal @mrgunalan

almost 2 years ago

My paper on the costs of extreme competition for government jobs in India is now forthcoming at the Journal of Development Economics! Open access link: https://t.co/0Mn1UQFZ0u

mrgunalan's tweet photo. My paper on the costs of extreme competition for government jobs in India is now forthcoming at the Journal of Development Economics!

Open access link:
https://t.co/0Mn1UQFZ0u https://t.co/K7U6kvROAc

30

1K

196

407

244K

deepthisaraanil retweeted

David Evans @DaveEvansPhD

almost 2 years ago

What's the role of fathers in promoting early childhood development in middle- and low-income countries? @PJakiela and I talk with @TimSvengali on @Vox_Dev ⬇️ (based on this review paper: https://t.co/RqcJMlsStE)

1

35

16

7

9K

deepthisaraanil retweeted

The Better India

@thebetterindia

about 2 years ago

Kerala's government is leading the charge for gender equality with new textbooks showing kitchens where everyone pitches in!

thebetterindia's tweet photo. Kerala's government is leading the charge for gender equality with new textbooks showing kitchens where everyone pitches in! https://t.co/Fj4BhvmXaC

184

8K

1K

290

469K

deepthisaraanil retweeted

Justin Sandefur

@JustinSandefur

over 2 years ago

The Economist revisits the RCT wars https://t.co/lXO8BgJh8I

JustinSandefur's tweet photo. The Economist revisits the RCT wars

https://t.co/lXO8BgJh8I https://t.co/JwnGdvi7FU

10

548

175

469

316K

deepthisaraanil retweeted

IIT Kanpur Economics @iitk_econ

over 2 years ago

The third edition of our Public Policy Conference will be held on August 2-3, 2024. Please see the Call for Papers below for details. Last date for submissions: April 15.

iitk_econ's tweet photo. The third edition of our Public Policy Conference will be held on August 2-3, 2024. Please see the Call for Papers below for details. Last date for submissions: April 15. https://t.co/oN892VUncI

3

60

19

36

6K

deepthisaraanil retweeted

Stephanie H. Murray @stephmurrayyyy

over 2 years ago

Dang this study found that the Swedish “daddy month” (use-it-or-lose-it paternity leave) led to a *drop* in kids’ GPA, driven entirely by sons of non-college-educated fathers. No effects for girls or for children of college-educated fathers. https://t.co/q6n1rqx4zC

stephmurrayyyy's tweet photo. Dang this study found that the Swedish “daddy month” (use-it-or-lose-it paternity leave) led to a *drop* in kids’ GPA, driven entirely by sons of non-college-educated fathers. No effects for girls or for children of college-educated fathers.

https://t.co/q6n1rqx4zC https://t.co/2zRJBXRCMn

2

17

4

10

21K

deepthisaraanil retweeted

Dina D. Pomeranz 🟣 @DinaPomeranz

over 2 years ago

A strong political divide is emerging between young men and women in many countries: https://t.co/kPY21noAt8

DinaPomeranz's tweet photo. A strong political divide is emerging between young men and women in many countries:
https://t.co/kPY21noAt8 https://t.co/5KDhYkxIEQ

362

5K

2K

2K

5M

deepthisaraanil retweeted

The Nobel Prize

over 2 years ago

"I had a teacher that didn't like me and I didn't like him. At the end of the year he decided to fail me. The ironic thing is that the topic was chemistry. I have the distinction of being the only chemistry laureate who failed the topic in high school!" - Tomas Lindahl

NobelPrize's tweet photo. "I had a teacher that didn't like me and I didn't like him. At the end of the year he decided to fail me. The ironic thing is that the topic was chemistry. I have the distinction of being the only chemistry laureate who failed the topic in high school!"

- Tomas Lindahl https://t.co/2WdJV8T6zJ

101

6K

1K

391

1M

deepthisaraanil retweeted

Road Scholarz @roadscholarz

over 2 years ago

Here is India’s latest “egg map”. Eggs in midday meals are still a no-no for the Sangh Parivar. Children pay the price.

roadscholarz's tweet photo. Here is India’s latest “egg map”. Eggs in midday meals are still a no-no for the Sangh Parivar. Children pay the price. https://t.co/8h1vzHKT1h

213

4K

873

242

462K

deepthisaraanil retweeted

𝙐𝙜𝙤 𝙋𝙖𝙣𝙞𝙯𝙯𝙖 @upanizza

over 2 years ago

I didn’t know that the “three essays” thesis was Solow’s idea

upanizza's tweet photo. I didn’t know that the “three essays” thesis was Solow’s idea https://t.co/1kFL4eqLns

2

455

76

51

92K

deepthisaraanil retweeted

over 2 years ago

Soon we won't be able to use nightlights as a proxy for economic growth. Why? It's due to limitations of its spectral bands. Here's the breakdown in simple terms:

yohaniddawela's tweet photo. Soon we won't be able to use nightlights as a proxy for economic growth.

Why?

It's due to limitations of its spectral bands.

Here's the breakdown in simple terms: https://t.co/b32N5yRDCI

39

2K

404

913

665K

deepthisaraanil retweeted

Duncan Webb @dunc_webb

over 2 years ago

Excited to present my JMP: Silence to Solidarity My job market paper studies whether communication between discriminatory people can lead to large reductions in discrimination 🔗 https://t.co/IrRLIjPJLq 👇for more

dunc_webb's tweet photo. Excited to present my JMP: Silence to Solidarity

My job market paper studies whether communication between discriminatory people can lead to large reductions in discrimination
🔗 https://t.co/IrRLIjPJLq

👇for more https://t.co/pAtOdwywIb

2

227

53

62

79K

Last Seen Users on Sotwe

Trends for you

Most Popular Users