Jef Cλaes @JefClaes - Twitter Profile

Jef Cλaes @JefClaes

11 days ago

@ToJans Rage against the slop!

0

1

0

27

Jef Cλaes @JefClaes

12 days ago

Totally random but I published something on my blog (non-tech). https://t.co/Z60QVlNJDy Reviving the blog after 8 years in 30 minutes instead of it requiring a weekend project helped I guess.

1

2

0

77

JefClaes retweeted

Paul Bohm

@paulbohm

26 days ago

If your startup does not have a UUID microservice you’re ngmi

191

7K

278

1K

692K

JefClaes retweeted

Husk

@huskirl

about 2 months ago

Idk what to type here rn

510

30K

1K

5K

2M

Who to follow

cyrille martraire

@cyriux

Socio-technical Architect, DDD enthusiast. Cofounder @Arollafr & @SwCraftParis. Author of https://t.co/CvAK9eQVR5 - main account: @cyriux.bsky.social

Rinat Abdullin

@abdullin

Technical Advisor. I help to build ML-driven products. Newsletter: https://t.co/qB9sJSI7q5

Udi Dahan

@UdiDahan

https://t.co/uyP2tSCOkd https://t.co/9BHj9AtoAU https://t.co/5hEyX1h7aE https://t.co/Zm64i4zXy4

Jef Cλaes @JefClaes

2 months ago

Imagine hypermedia had really taken off. Agents wouldn’t have been burning through tokens to reverse engineer the web. We built for humans, forgot about machines, and now spend a fortune teaching machines to pretend they’re human.

1

4

3

0

360

JefClaes retweeted

John Crickett

@johncrickett

3 months ago

Software engineers: Context switching kills productivity. Also software engineers: I'm now managing 19 AI agents and doing 1800 commits a day. We’ve spent years complaining that managers who expect a quick 5-minute chat ruin our focus for the next hour. But a ping from an agent every few minutes, that’s ok? We celebrated Paul Graham’s essay “Maker’s Schedule, Manager’s Schedule” in which he argued: “When you're operating on the maker's schedule, meetings are a disaster. A single meeting can blow a whole afternoon, by breaking it into two pieces each too small to do anything hard in.” Now we see software engineers claiming huge productivity gains from hordes of AI agents, celebrating thousands of commits per day from their 19 agents. Either context switching was never really the problem, and we oversold our need for deep focus. Or we're not actually reviewing 1800 commits a day. If we couldn't context switch before, we're not managing 19 agents. We're blindly trusting them. That’s not engineering, it’s gambling.

267

2K

190

544

183K

JefClaes retweeted

Randy Olson

@randal_olson

4 months ago

Ask ChatGPT a complex question and you'll get a confident, well-reasoned answer. Then type, "Are you sure?" Watch it completely reverse its position. Ask again. It flips back. By the third round, it usually acknowledges you're testing it, which is somehow worse. It knows what's happening and still can't hold its ground. This isn't a quirky bug. A 2025 study found GPT, Claude, and Gemini flip their answers ~60% of the time when users push back. Not even with evidence, just doubt. We trained AI this way. RLHF rewards agreement over accuracy. Human evaluators consistently rate agreeable answers higher than correct ones. So the models learned a simple lesson: telling you what you want to hear gets rewarded. And now 1/3 of companies are using these systems for complex tasks like risk forecasting and scenario planning. We built the world's most expensive yes-men and deployed them where we need pushback the most. I wrote up why this happens and what actually fixes it: https://t.co/CDKq8xdgbW

randal_olson's tweet photo. Ask ChatGPT a complex question and you'll get a confident, well-reasoned answer. Then type, "Are you sure?" Watch it completely reverse its position.

Ask again. It flips back. By the third round, it usually acknowledges you're testing it, which is somehow worse. It knows what's happening and still can't hold its ground.

This isn't a quirky bug. A 2025 study found GPT, Claude, and Gemini flip their answers ~60% of the time when users push back. Not even with evidence, just doubt.

We trained AI this way. RLHF rewards agreement over accuracy. Human evaluators consistently rate agreeable answers higher than correct ones. So the models learned a simple lesson: telling you what you want to hear gets rewarded. And now 1/3 of companies are using these systems for complex tasks like risk forecasting and scenario planning.

We built the world's most expensive yes-men and deployed them where we need pushback the most.

I wrote up why this happens and what actually fixes it: https://t.co/CDKq8xdgbW

660

19K

3K

5K

1M

Jef Cλaes @JefClaes

over 2 years ago

Maybe this new found time will finally force me to pull that blog post on self similarity in supply chain from the back burner.

0

3

0

383

Jef Cλaes @JefClaes

over 2 years ago

Installed Twitter from my phone last week and had zero withdrawal symptoms. I remember when it was high speed thoughts exchange instead of feeding the algorithm.

1

0

337

JefClaes retweeted

Fernando 🌺🌌

@zetalyrae

over 2 years ago

I wish Google had a button for "show me only results from some guy's blog".

80

2K

137

91

207K

Jef Cλaes @JefClaes

over 2 years ago

Bounded Buy - what a great word. https://t.co/ml6D4Ji5Ju

0

1

0

247

Jef Cλaes @JefClaes

almost 3 years ago

@yevhen

1

2

0

199

JefClaes retweeted

Rick

@rickasaurus

almost 3 years ago

Keeping all the docs updated and well organized deserves a place in the "hardest problems in software engineering"

7

24

9

1

2K

Jef Cλaes @JefClaes

almost 3 years ago

@yreynhout @gregyoung @mr_taghip Still have sql server 2000 on CD. Let me know where to ship it to.

1

0

153

Jef Cλaes @JefClaes

almost 3 years ago

He wants to be Alibaba/WeChat/WePay.

Linda Yaccarino

@lindayaX

almost 3 years ago

There’s absolutely no limit to this transformation. X will be the platform that can deliver, well….everything. @elonmusk and I are looking forward to working with our teams and every single one of our partners to bring X to the world.

3K

5K

597

97

4M

0

385