Florian @FlorianCaesar - Twitter Profile

about 19 hours ago

@ctatedev I thought this too, but after experimenting with giving agents much more structured code info it doesn’t actually help much at all They’re just really good at figuring anything out with plain text and making changes just with text, which is almost disappointing :D

0

199

Florian

@FlorianCaesar

about 23 hours ago

Amazing. I had the exact same problem asking 5.4 not too long ago to come up with some benches for my VM using my lower level IR before I had that compiler phase properly working. GPT produced some very plausible looking benches but it turns out they weren’t actually exercising anything interesting and sneakily papered over some actual VM bugs by “simplifying” _failing_ fixtures.

0

1

0

1

734

Florian

@FlorianCaesar

1 day ago

@VictorTaelin Nothing wrong with bootstrapping to C first, that’s exactly what Static Hermes is doing also. So, no, no strong reason for the first iteration. Do whatever works fastest and easiest :)

0

27

Florian

@FlorianCaesar

1 day ago

@VictorTaelin Why not feed directly to LLVM or even MLIR or something?

1

3

0

1

649

Who to follow

Cédric

@cedric_blzk

Entrepreneur, developer and crypto enthusiast.

Mario

@sirsupermario

flipping burgers since 2011

1 day ago

Seems like an unlikely prediction to me, especially now that we’re over 6 months in and have seen what an absolute train wreck heavily agent authored code bases turn into at an astonishing and terrifying pace. I don’t think delegating everything to agents will work even in the medium term for anything but basic apps, there’s just too much lost in translation for “last mile fixes” to work well (unfortunately!). I mean you still have the IDE here, so you’ll be fine. But I think we need much more than just “manage fleets of agents better”, that’s just not a problem any experienced dev I know actually has.

0

254

FlorianCaesar retweeted

ZurichAI @zurichnlp

1 day ago

Friendly reminder that ZurichNLP#21 will be next Monday at the @ETH_AI_Center! :)

0

1

0

217

Florian

@FlorianCaesar

2 days ago

@RhysSullivan I have been longing for Cursor to acquire Zed and just get a really good autocomplete in a really fast IDE for exactly this reason I would much rather drive one very fast agent + autocomplete than 5 terminal tabs. Agent code review is torturous

1

2

0

210

FlorianCaesar retweeted

OpenAI

@OpenAI

2 days ago

Building apps has never been easier. With Sites, Codex can turn your work, ideas, and plans into an interactive website or app your team can explore, use, and share with a URL. Rolling out to Business and Enterprise plans, before expanding more broadly.

883

19K

2K

10K

9M

Florian

@FlorianCaesar

2 days ago

I once failed to specify that the compiler task was specific to oner phase, and because it noticed that the old inference engine was stubbed out, Codex proceeded to clank out an entire new TS-grade inference phase into a single giant ungodly file that almost kind of worked. Completely unusable, of course, but man, I was impressed. The goblins just dont care.

0

1

0

524

Florian

@FlorianCaesar

2 days ago

@samuelcolvin This is genuinely surprising to me. I know you guys are actual professional engineers, so that helps, but even those few of us who still take code seriously make mistakes. Maybe we just get to do more novel mistakes that AI can't find yet? Did anyone try the mythical Mythos?

1

0

308

FlorianCaesar retweeted

Anthropic

@AnthropicAI

3 days ago

Anthropic has confidentially submitted a draft S-1 registration statement to the Securities and Exchange Commission. Pending completion of SEC review, this gives us the option to pursue an initial public offering. Read more: https://t.co/onGZAhRLvD

979

22K

3K

20M

Florian

@FlorianCaesar

4 days ago

@samuelcolvin Yeah, but you’re using this power tool responsibly like a professional, so you’re already not part of the discussion :D Seriously though it depends on the work. Sometimes I many approaches, or do hungry side tasks, or yeet out large bindings files, and that adds up on those days

0

4

0

318

Florian

@FlorianCaesar

4 days ago

@ThePrimeagen “it’s the agent native operating system for operating systems but reimagined as a b2b SaaS desktop shortcut”

0

1

0

268

Florian

@FlorianCaesar

4 days ago

@thsottiaux Fairly new bug but when it renders markdown tables (which, yay, great feature) it often starts truncating other parts of the text while streaming in the CLI. Once the message is finished it restores fine so it’s not a big deal, but it is slightly annoying :)

0

112

Florian

@FlorianCaesar

5 days ago

Hmm, not actually sure if this is true. It already feels only semi true and more economic / integration / contractual lock in based than actual merit based. Long term even more so probably. The future is unknowable ofc, but for the work I do models have been smart enough for like half a year at least and not much has changed other than my own approach, except maybe some better instruction following.

0

59

FlorianCaesar retweeted

Anthropic

@AnthropicAI

7 days ago

We've raised $65 billion in Series H funding at a $965 billion post-money valuation, led by @AltimeterCap, Dragoneer, @Greenoaks, and @sequoia. This investment will help us advance our research and expand our capacity to meet growing demand for Claude.

1K

22K

2K

8M

FlorianCaesar retweeted

ZurichAI @zurichnlp

7 days ago

ZurichNLP#21 will be on Monday, June 8th at the @ETH_AI_Center! Jannis Vamvas (University of Zurich) on the challenge of Romansh and Eric Chen (@EPFL) on reasoning as test time learning. RSVP: https://t.co/pjdh3ewyoc

0

4

3

0

597

Florian

@FlorianCaesar

8 days ago

@ThePrimeagen It is kind of amazing they had enough AI capability to do a Zig > Rust rewrite of an entire _JS runtime_ but _not_ enough to get their most profitable billion dollar product that is still slow and buggy 6 months in rewritten in a language that doesn’t require >1GB RAM / session.

2

12

0

1K

Florian

@FlorianCaesar

11 days ago

@dhh For basic CRUD apps sure, do whatever you want, for anything non trivial I thought we had long settled that, yes, you definitely do want as much static correctness as you can get.

0

341

Florian

@FlorianCaesar

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users