Aryaman Shandilya @shandilyeah - Twitter Profile

Aryaman Shandilya

@shandilyeah

16 days ago

We did a thing at @pointer... ...and we smashed it.

Neal Chopra

@nealchopra

16 days ago

Today, we’re sharing a new state of the art for computer use. Our system holds the two highest verified scores on OSWorld, the standard benchmark for AI agents that operate a computer like a person: 83.6% using Claude Opus 4.7 and 81.5% using Claude Sonnet 4.6. The human baseline is 72.4%. 🧵 1/7

nealchopra's tweet photo. Today, we’re sharing a new state of the art for computer use.

Our system holds the two highest verified scores on OSWorld, the standard benchmark for AI agents that operate a computer like a person: 83.6% using Claude Opus 4.7 and 81.5% using Claude Sonnet 4.6. The human baseline is 72.4%.

🧵 1/7

52

282

41

157

182K

1

6

0

316

Aryaman Shandilya

@shandilyeah

about 1 month ago

we were supposed to be moving in silence @NYTGames

1

2

0

466

Aryaman Shandilya

@shandilyeah

3 months ago

@tomscott is back. tell your friends

0

1

0

20

Aryaman Shandilya

@shandilyeah

4 months ago

@sanikerp *downvote*

0

1

0

16

Who to follow

Kauntact

@Condiment20

khali dimaag shaitan ka ghar

somani

@tsunami7700

the modern devil is cheap dopamine | iitk'22

bean is seeing 5SOS in Tokyo

@shivangayyy

EEIITK’22 | she/her | f1 | 5sos | bts | music | art | anything really

Aryaman Shandilya

@shandilyeah

5 months ago

@Gravito841 I agree with that, but I also believe the entry barrier for these activities is bigger when moving to an entirely new culture (not just the US, maybe even Europe or Japan) especially for the intial period for someone moving from India

1

0

107

Aryaman Shandilya

@shandilyeah

6 months ago

the avengers desperately need a retirement plan - our heroes don’t deserve this

1

3

0

178

Aryaman Shandilya

@shandilyeah

7 months ago

Alain Prost’s race engineer during the 1984 Monaco Grand Prix

0

1

0

190

Aryaman Shandilya

@shandilyeah

7 months ago

Not a single person with YouTube Premium has ever complained about it. S Tier Subscription

0

1

0

223

Aryaman Shandilya

@shandilyeah

8 months ago

@sanikerp Right, “premium” fast bowler

0

1

0

19

Aryaman Shandilya

@shandilyeah

8 months ago

@sanikerp Makes sense, but still L take because Aguero (as much as I’d like to say that he’s not) is a legend himself also

1

0

24

Aryaman Shandilya

@shandilyeah

8 months ago

@apple need a notification cache where I can see notifications I have cleared (accidentally) for a user-defined period of time

0

8

Aryaman Shandilya

@shandilyeah

9 months ago

tried switching to android, "fold"ed in 2 days - returning the phone now

0

1

0

165

Aryaman Shandilya

@shandilyeah

9 months ago

kim really did a number on him

0

3

0

255

Aryaman Shandilya

@shandilyeah

10 months ago

@AmericanAir I have been on hold for 42 minutes and counting. Help a man out

1

0

72

Aryaman Shandilya

@shandilyeah

10 months ago

@Gravito841 @Jitesh_117 Mumbai janta - take note

0

3

0

50

Aryaman Shandilya

@shandilyeah

10 months ago

@sanikerp Hola amigo

0

1

0

28

Aryaman Shandilya

@shandilyeah

10 months ago

@byArunKai @LinkedIn to those who don’t know, being unemployed is FUN

0

1

0

41

Aryaman Shandilya

@shandilyeah

10 months ago

What’s going on with my boys and, in some cases, gals at @LinkedIn? You’ve got to stop posting on my behalf without my consent. Thank you for your attention to this matter!

1

0

228

Aryaman Shandilya

@shandilyeah

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users