Ryan Janssen @ryanjanssen - Twitter Profile

@Suhail And it’s not just that horizons are growing - if Anthropic’s take-off blog is correct then model development times are shrinking too this could be sooner than we think

0

1

0

118

Ryan Janssen

@ryanjanssen

4 days ago

@alexalbert__ sooo…could you reset them again? 😇

0

1

0

257

Ryan Janssen

@ryanjanssen

4 days ago

@rankintweets mine rate-limited after “I am”

0

1

0

62

Ryan Janssen

@ryanjanssen

4 days ago

@RhysSullivan they’re clearly gonna call it Symphony

0

57

Ryan Janssen

@ryanjanssen

4 days ago

@robertcourson OpenAI tends to hold things back until someone advances SOTA. Will be interesting to see if they drop something big in the next week or so (and if not, they’re legit behind)

0

72

Ryan Janssen

@ryanjanssen

4 days ago

WE NEED THE OFFICIAL PELICAN @simonw

0

15

Ryan Janssen

@ryanjanssen

4 days ago

many people are missing the distinction between /loops and /goals

0

19

Ryan Janssen

@ryanjanssen

4 days ago

Even though it's the most important product in the world right now, general purpose agent harnesses are still up for grabs. I've tried all of them and none of them give me everything I need. So far it's basically been an either/or: - The frontier models are finally transitioning from the terminal to apps, but they're still architected for smaller, local tasks. - The harnesses have a great Telegram-style experience, but harder to use on a laptop - Even though we know the really valuable stuff for an agent now (self-learning, compounding knowledge, loops), you have to work really hard to get them out of the box. None of them feel really ubiquitous, and they all feel like they're just scratching the surface of what a smart model can do. I think the one to beat right now is Codex with an always-on server. But that's not going to fly for the non-nerd public. And the product that satisfies the full shape has to live outside Claude/Codex by definition. Until the model wars settle down, we need interoperability and the one thing Codex won't do is work with a different model. This is going to get figured out in the next 6 months. Big prize and still anyone's game.

ryanjanssen's tweet photo. Even though it's the most important product in the world right now, general purpose agent harnesses are still up for grabs.

I've tried all of them and none of them give me everything I need.

So far it's basically been an either/or:

- The frontier models are finally transitioning from the terminal to apps, but they're still architected for smaller, local tasks.
- The harnesses have a great Telegram-style experience, but harder to use on a laptop
- Even though we know the really valuable stuff for an agent now (self-learning, compounding knowledge, loops), you have to work really hard to get them out of the box.

None of them feel really ubiquitous, and they all feel like they're just scratching the surface of what a smart model can do.

I think the one to beat right now is Codex with an always-on server. But that's not going to fly for the non-nerd public.

And the product that satisfies the full shape has to live outside Claude/Codex by definition. Until the model wars settle down, we need interoperability and the one thing Codex won't do is work with a different model.

This is going to get figured out in the next 6 months. Big prize and still anyone's game.

0

35

Ryan Janssen

@ryanjanssen

4 days ago

@gregisenberg "This agglomeration which calls itself the Holy Roman Empire was neither holy, nor Roman, nor an empire" my favorite piece on this @tanayj https://t.co/mXkW7b9UgC

0

3

0

1

228

Ryan Janssen

@ryanjanssen

4 days ago

lol no kidding

0

14

Ryan Janssen

@ryanjanssen

4 days ago

@michalmalewicz Apple’s path is definitely the riskier of the two though. For OpenAI’s bet to work, LLMs need to be really useful. For Apple’s bet, they need to be both useful and small/cheap.

1

0

267

Ryan Janssen

@ryanjanssen

5 days ago

@clairevo In non-coding domains, IMO the hardest part is defining a clear success condition those can be highly subjective (and often need weeks of wall clock time before they can even be met)

0

1

0

1

44

Ryan Janssen

@ryanjanssen

5 days ago

@zachtratar @hnshah What does the cron look like? just -exec “check for notion jobs”? openclaw heartbeat is very good but feels like it’s harder to replicate in codex/CC (or even hermes)

1

0

64