Mourya @mourya - Twitter Profile

Pinned Tweet

about 5 years ago

Sharing my first NFT collection. Inspired by whimsical characters, these paper sculptures can be minted as art collectibles. https://t.co/ybzghJ9BBp #NFT #NFTCommunity #opensea #withFND #WazirXNFT #rarible

0

3

1

0

mourya retweeted

Krish Ashok

@krishashok

3 months ago

The Mathematics of Life - where I explore how fractals determine how long complex biological systems (like us) last and how to get more out of your 2 billion heartbeat budget https://t.co/KUf8m3aRh6

12

266

53

324

93K

Mourya @mourya

6 months ago

@makemytripcare Please call back , I need full refund , why is half the refund deducted by your charges

1

0

12

Mourya @mourya

6 months ago

@makemytrip @IndiGo6E the flight from kozikode to vishakhpatnam has been cancelled on 4th dec by @IndiGo6E bearing the pnr number V773NZ . I have raised the refund for @makemytrip and it’s been 20 days and they keep saying @IndiGo6E hasn’t processed the refund

1

0

33

Who to follow

SaveKBR

@CitizensForHyd

Citizen-led movement against KBR National Park choking flyovers & tree felling. Matter sub judice. For Hyderabad’s last green lung 🌿

Emani B V Kumar

@emanibvkumar

Deputy Secretary General, ICLEI & Executive Director, ICLEI South Asia

Rajeev Kaushik

@RajeevK88712709

PhD Research Scholar, Social Activist & Politician Aam Aadmi Party (AAP)

Mourya @mourya

6 months ago

@makemytripcare This has been going on for past two weeks , maybe better to go to consumer court

1

0

8

Mourya @mourya

6 months ago

@makemytripcare Still waiting

1

0

7

mourya retweeted

Oleksandr Veremeyenko

@alex_verem

6 months ago

This paper from Harvard and MIT quietly answers the most important AI question nobody benchmarks properly: Can LLMs actually discover science, or are they just good at talking about it? The paper is called “Evaluating Large Language Models in Scientific Discovery”, and instead of asking models trivia questions, it tests something much harder: Can models form hypotheses, design experiments, interpret results, and update beliefs like real scientists? Here’s what the authors did differently 👇 • They evaluate LLMs across the full discovery loop hypothesis → experiment → observation → revision • Tasks span biology, chemistry, and physics, not toy puzzles • Models must work with incomplete data, noisy results, and false leads • Success is measured by scientific progress, not fluency or confidence What they found is sobering. LLMs are decent at suggesting hypotheses, but brittle at everything that follows. ✓ They overfit to surface patterns ✓ They struggle to abandon bad hypotheses even when evidence contradicts them ✓ They confuse correlation for causation ✓ They hallucinate explanations when experiments fail ✓ They optimize for plausibility, not truth Most striking result: `High benchmark scores do not correlate with scientific discovery ability.` Some top models that dominate standard reasoning tests completely fail when forced to run iterative experiments and update theories. Why this matters: Real science is not one-shot reasoning. It’s feedback, failure, revision, and restraint. LLMs today: • Talk like scientists • Write like scientists • But don’t think like scientists yet The paper’s core takeaway: Scientific intelligence is not language intelligence. It requires memory, hypothesis tracking, causal reasoning, and the ability to say “I was wrong.” Until models can reliably do that, claims about “AI scientists” are mostly premature. This paper doesn’t hype AI. It defines the gap we still need to close. And that’s exactly why it’s important.

alex_verem's tweet photo. This paper from Harvard and MIT quietly answers the most important AI question nobody benchmarks properly:

Can LLMs actually discover science, or are they just good at talking about it?

The paper is called “Evaluating Large Language Models in Scientific Discovery”, and instead of asking models trivia questions, it tests something much harder:

Can models form hypotheses, design experiments, interpret results, and update beliefs like real scientists?

Here’s what the authors did differently 👇

• They evaluate LLMs across the full discovery loop hypothesis → experiment → observation → revision
• Tasks span biology, chemistry, and physics, not toy puzzles
• Models must work with incomplete data, noisy results, and false leads
• Success is measured by scientific progress, not fluency or confidence

What they found is sobering.

LLMs are decent at suggesting hypotheses, but brittle at everything that follows.

✓ They overfit to surface patterns
✓ They struggle to abandon bad hypotheses even when evidence contradicts them
✓ They confuse correlation for causation
✓ They hallucinate explanations when experiments fail
✓ They optimize for plausibility, not truth

Most striking result:

`High benchmark scores do not correlate with scientific discovery ability.`

Some top models that dominate standard reasoning tests completely fail when forced to run iterative experiments and update theories.

Why this matters:

Real science is not one-shot reasoning.

It’s feedback, failure, revision, and restraint.

LLMs today:

• Talk like scientists
• Write like scientists
• But don’t think like scientists yet

The paper’s core takeaway:

Scientific intelligence is not language intelligence.

It requires memory, hypothesis tracking, causal reasoning, and the ability to say “I was wrong.”

Until models can reliably do that, claims about “AI scientists” are mostly premature.

This paper doesn’t hype AI. It defines the gap we still need to close.

And that’s exactly why it’s important.

378

8K

2K

6K

1M

mourya retweeted

Tom Dörr

@tom_doerr

6 months ago

Personal finance app with AI assistant https://t.co/ygZBcquXYx

52

7K

553

11K

683K

Mourya @mourya

7 months ago

@DishPatani

0

1

Mourya @mourya

7 months ago

@DishPatani

0

2

Mourya @mourya

7 months ago

@mybmc please get this cleaned before we all get diseases and mosquito bites Oshiwara nala.

0

1

0

11

Mourya @mourya

over 1 year ago

@hamptonism This is from a German film about a doll from future

0

123

mourya retweeted

Rachel Karten

@milkkarten

over 1 year ago

It’s been a tough few months for Sonos. A redesign of the app caused outrage. So when a friend tipped me off to the r/Sonos subreddit filled with 261K angry people, I braced for impact. I found the expected complaints—but I also noticed they really liked an employee named Keith.

199

11K

504

4K

2M

Mourya @mourya

almost 2 years ago

@JioCare I want to cancel my service but your helpline doesn’t give the option and my Jio app doesn’t have it either

0

11

Mourya @mourya

about 2 years ago

@AgniBankai I will take the illustrated ones

0

29

mourya retweeted

Historic Vids

@historyinmemes

over 2 years ago

Angular Momentum keeps gyroscope impossibly standing

409

125K

11K

10K

15M

Mourya @mourya

over 2 years ago

@oxhak Trying to install it on iPad Pro m2 can’t figure out the prompt format for llmfarm

1

0

76

Mourya @mourya

over 2 years ago

@justmalhar What’s the prompt format I should use , trying to install on llmfarm

1

0

53

Mourya @mourya

over 2 years ago

@LinusEkenstam I have downloaded have been using , sometimes it gives random answers un related to the question

0

13

mourya retweeted

Zipeng Fu

@zipengfu

over 2 years ago

Mobile ALOHA's hardware is very capable. We brought it home yesterday and tried more tasks! It can: - do laundry👔👖 - self-charge⚡️ - use a vacuum - water plants🌳 - load and unload a dishwasher - use a coffee machine☕️ - obtain drinks from the fridge and open a beer🍺 - open doors🚪 - play with pets🐱 - throw away trash - turn on/off a lamp💡 Project website: https://t.co/9rzIX8wLEp Co-lead @tonyzzhao, advised by @chelseabfinn (amazing photographing from @qingqing_zhao_ )

319

7K

2K

3K

3M

Mourya @mourya

over 2 years ago

Story of lot of families and dads

Rituparna Chatterjee @MasalaBai

over 2 years ago

My father's simplicity is legendary. But today I was stumped. 20 years ago I bought a thin flannel jacket for Rs 150 or so in Sarojini. In a couple of years as I made slightly better salary, I bought better clothes and left that jacket at home. Today my parents sent me pics

236

6K

242

454

937K

0

2

0

138

Mourya

@mourya

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users