fabio casati

Verified account

@sphoebs

father of two

Palo Alto, CA

Joined March 2009

141 Following

248 Followers

461 Posts

Pinned Tweet

4 months ago

Enterprises are doing AI eval wrong - and it's causing wasted iteration cycles and wrong decision making. I’m preparing my spring semester course on designing large-scale AI systems and I need feedback: What’s the one change you’d make to eval practice + reporting to make it reliable? What's the error you see made often?

sphoebs's tweet photo. Enterprises are doing AI eval wrong - and it's causing wasted iteration cycles and wrong decision making.

I’m preparing my spring semester course on designing large-scale AI systems and I need feedback:

What’s the one change you’d make to eval practice + reporting to make it reliable?

What's the error you see made often?

1

15

4

2

82K

4 days ago

@xAviation Where is he going.

0

0

0

0

57

27 days ago

@SawyerMerritt Omg that’s so 2020….

0

0

0

0

3

about 2 months ago

@nic_amadio This is very true. Italian engineers are massively underpaid. Companies that hire and pay well do wonders in Italy - and everywhere.

1

2

0

0

298

Who to follow

Co-founder of @chino_api. #mHealth #digitalhealth #dataprotection #privacy #security #GDPR

3 months ago

@ylecun @haider1 Why S?

0

0

0

0

35

3 months ago

@karpathy We als need new programming abstractions. A powerful one is the notion of statistical assertions. Properties that you’ve weakly expect to be true over your flow but not all the times.

0

0

0

0

45

3 months ago

@DanielMiessler “The best influencers”? What’s that category. The most competent or the most controversial or, like, Mr. Beast?

0

2

0

0

168

4 months ago

I keep seeing posts on AI and SaaS. My (biased) take is that AI agents need SaaS way more than humans do. And this is from experience. Agents can consume knowledge like no human can. They can learn and iterate faster than humans. They can also go off track faster than humans—and at scale. They need a platform that maximizes their potential while keeping controls.

0

0

0

0

56

4 months ago

@Lol19559014 @JenniferSey That is not what he said. That is a redacted selection. It’s ok if you are trying to help him get more supporters, because this is what’s posts like this do.

1

0

0

0

68

4 months ago

@petergyang Air way better than pro. Minimum 15. 14 is frustrating.

0

0

0

0

33

4 months ago

@_BMSimpson @curiosityonx It’s always 5pm where I am.

0

1

0

0

169

4 months ago

@Angaisb_ I don’t get all the people with “my agent takes 10 minutes”. Try the same simple ask to both opus and gpt. Difference is huge.

0

0

0

0

21

4 months ago

@IntCyberDigest Please do!! This idiotic healthcare privacy forced on people who don’t want it is preventing progress in medicine.

0

0

0

0

11

4 months ago

@lugaricano @mustafasuleyman @Microsoft You are asking for the moon. Let’s set a lower goal, like automating many routine elderly care tasks with robots.

0

7

2

0

2K

4 months ago

@travisakers Why. Do you need to hide your cholesterol from the CIA?

0

0

0

0

29

4 months ago

@operationdanish Why not?

0

0

0

0

9

4 months ago

Anybody remembers the 2025 NY humanoid robots show ? Aside from dancing, I hope companies work on robotic care for older adults. I’d trust it more than nursing homes

ChineseEmbassyManila @Chinaembmanila

4 months ago

At the 2025 Spring Festival Gala, 16 humanoid robots joined a traditional folk dance known for its sweeping steps and vibrant handkerchiefs. During the grand opening performance, humanoid robots demonstrated the "Thomas 360" stunt move. In a subsequent stage comedy, a group of AI-powered humanoid robots, along with a bionic robot, took on comedic roles. And in a separate martial arts performance, robots executed a series of high-difficulty movements.

0

3

1

0

501

0

0

0

0

73

4 months ago

@kmcannon ^^ this. This never happened before. At scale. Think about it.

0

0

0

0

271

4 months ago

@JoshDaws Very true. Nothing compares to it. Not even iPhone. Maybe the internet

0

0

0

0

35

4 months ago

Imagine how fun for people preparing presentations on red teaming AI ))

sphoebs's tweet photo. Imagine how fun for people preparing presentations on red teaming AI )) https://t.co/3CMNZ34sLO

0

0

0

0

50

Last Seen Users on Sotwe

Trends for you

Most Popular Users