Will Daubney

Verified account

@WillDaubney

VP Engineering @sxaler - AI enthusiast

London, England

Joined February 2009

289 Following

136 Followers

485 Posts

Pinned Tweet

10 months ago

We built a browser with no rendering engine. Instead: – GPT-5-mini “reads” the page 📝 – Google’s Nano Banana hallucinates it into an image 🍌 – You wait 20s to see the chaos ✨ It’s slow. It’s useless. It’s hilarious. Give it a try below!

2

5

1

1

327

about 16 hours ago

@dimitrioskonst @MLStreetTalk That’s good to hear - rooting for you guys!

0

1

0

0

47

about 22 hours ago

My hunch is the the included agent features bundled with 16.3 are there for this exact purpose. If you’re building with Claude Code or Codex - with the right skills enabled no doubt they can follow these more complex patterns but for sure it’s making it harder and harder for the code to be readable and easy to human debug.

0

0

0

0

495

3 days ago

@letsbuildmore @nico_laqua Think that’s the British in me 😂

0

0

0

0

66

Who to follow

Live the Life you imagine!

THE CANMAN 🇺🇲

@realJamesCDavis

peace. & love. 🪬♋✝️

3 days ago

@nico_laqua This would be believable if the text wasn't word for word the same

0

16

0

0

1K

6 days ago

@rowlsmanthorpe The hard problem will be that half of the current problems that Burnham will be confronting become non issues in the world of ASI - whereas small issues now like AI sovereignty will become some of the most important issues that the U.K. has ever or will ever face.

0

0

0

0

124

6 days ago

This is such an under-reported issue right now and really glad you're diving into it more. I'm just completely gobsmacked at how unconcerned or unaware the mainstream media and politicians are of this upcoming tidal wave. This should be policy area number 1. I'm interested if you've read https://t.co/JV2srHDDRN - and how a sovereign effort from the EU / aligned allies might play into this essentially winner takes all viewpoint that continually seems to be mostly accurate?

0

0

0

0

30

12 days ago

If GLM5.2 is genuinely almost frontier - which it seems to be - at what point do all of cyber protections on Opus and GPT 5.5 become kind of pointless? Or are these models not good at cyber stuff yet?

Proximal @ProximalHQ

12 days ago

GLM 5.2 ranks #3 on FrontierSWE. It is only behind Fable 5 and Opus 4.8, and it outperforms GPT-5.5. This is the first model that closes the large gap between models from Anthropic / OpenAI and other providers, and it is the strongest open-weight model by far.

47

1K

144

314

328K

0

1

0

0

251

about 2 months ago

@harry_uglow Memory management and retrieval seems to be super key in getting this to work well.

0

1

0

0

34

3 months ago

@t_blom The question now is will OpenAI, Google and the fast follower Chinese labs do the same, or is it too big a competitive advantage to hold off releasing a new gen of frontier models

0

0

0

0

97

4 months ago

@jacobtechtavern @steipete This is it 🙃

0

1

0

0

20

4 months ago

@jacobtechtavern @steipete ** flashbacks intensify **

1

1

0

0

38

7 months ago

QQ: Building a complex agent or getting a working Google Vertex AI API key Which should take longer? You’d be surprised at my answer (or maybe not)

0

0

0

0

47

9 months ago

@aditabrm @reductoai @a16z Congrats 🙌

0

1

0

0

116

9 months ago

@utdreport @OptaAnalyst Loving how so many people don't understand that this is just based on if the scorelines of games finished as per xG not some weird pundit prediction. What it is showing is that we're not finishing our chances pretty much worse than anyone in the league.

1

14

1

0

4K

10 months ago

@jacobtechtavern Hahaha brilliant 😂

1

1

0

0

35

10 months ago

@benhylak Really good blog post Ben!

1

2

0

0

885

10 months ago

@benhylak This is the point so many people are missing. Of course evals are useful for sense checking outputs. But A/B testing and intuition are separating the great products from the okay products.

0

1

0

0

502

10 months ago

Link to Github: https://t.co/dLwM7CjQ7M

0

0

0

0

39

10 months ago

We built a browser with no rendering engine. Instead: – GPT-5-mini “reads” the page 📝 – Google’s Nano Banana hallucinates it into an image 🍌 – You wait 20s to see the chaos ✨ It’s slow. It’s useless. It’s hilarious. Give it a try below!

2

5

1

1

327

Last Seen Users on Sotwe

Trends for you

Most Popular Users