as good as LLMs have become at understanding instructions and solving the task, they still don't produce the code i would
their solutions are always the 2nd or 3rd best way of doing something
you think this doesn't matter because so are yours
recommended reading. deepmind's new AI control roadmap. looks like they've given up on solving the lethal trifecta directly. the new direction seems to be a tower of LLMs. i suppose that's as good as it gets.
https://t.co/RiXTjnCJuz
Just Shipped: Flue 1.0 Beta
Flue is the TypeScript framework for building the next generation of agents, designed around an open agent harness with zero LLM lock-in. It’s like Astro, for agents.
Flue 1.0 has been redesigned around three core primitives:
🔁 Workflows — structured automations designed for background work, where your code drives the agent from start to finish.
🧭 Agents (New!) — autonomous, stateful loops where the model drives itself to complete a given task.
📡 Channels (New!) — connect agents to Slack, GitHub, Linear, Discord, Teams, and more. Flue handles the boilerplate for you.
Everything shares the same durable foundation, powered internally by Pi, Vite, and Durable Streams. Deploy anywhere, use any LLM, and recover running agents across restarts and downtime.
We’ve talked to a lot of teams building agents, and keep hearing the same thing: getting to production is hard work. We built Flue to help change that.
Flue 1.0 Beta is available today. Give it a try and let me know what you think!
So apparently after Meta leadership:
- Force reassigned some of the best devs on teams to AI data labelling fulltime
- Laid off another 10%
- Started to record every dev’s screen in the US 24/7
They now realized that it has, indeed started to destroy their eng culture. And are now trying to walk back.
All of the above was unprompted, not forced by anything external or even business reasons (Meta recorded record revenue, record profits)
The biggest self-inflicted eng culture destruction I’ve seen in a matter of weeks
Final one-shot prompt I did before the Fable interruption: "build me a cool simulation thing that lets me demo the various forms of FTL travel from both famous works of fiction and scientific speculation. it should be graphically compelling & interesting." https://t.co/j9QwssK1mD
It is an incredibly small world... Never taken pics or video of a plane takeoff and have it not return safely. Not a great feeling.
Aviation photography has been a blessing of a hobby, but as much as it can be a joy or exciting, these are, in the end, dangerous jobs people do every day. And I don't ever lose sight of that.
I am quite relieved to hear the pilot is ok. I wish them the best in recovery after ejecting: no small experience to go through.
@UK_Daniel_Card@ZackKorman How many people can diff a patch or read a CVE and create an exploit?
How many people with an LLM can do it?
Also, how quickly?
We're seeing more vulns on edge devices on a faster cadence, and I just think it's going to get worse before it gets better ;)
@NathanMcNulty@UK_Daniel_Card@ZackKorman Edge device vendors also have LLMs and can white box test with them. Finding a vuln before the vendor does should be very expensive.
@ZackKorman Not as bad as the hype would have one believe. Perhaps it might speed up the next couple of months and cause a short term spike in chaos, but I've seen nothing that says this is a fundamentally different dynamic than what we've experienced since December.