AgenticRebirth @AgenticRebirth - Twitter Profile

AgenticRebirth @AgenticRebirth

7 days ago

@goodalexander Which provider did you end up landing on?

0

3

AgenticRebirth @AgenticRebirth

9 days ago

@usr_bin_roygbiv I’m beyond jealous… I assume at some point we’re going to have a giant memory glut but it’ll be years away.

0

6

AgenticRebirth @AgenticRebirth

9 days ago

@TheAhmadOsman - one RTX 3090 or one RTX Pro 6000? Does 96gb vram actually unlock anything drastically better?

0

7

AgenticRebirth @AgenticRebirth

10 days ago

Unfortunately this account has been shadowbanned for 10 days, for no reason. All attempts to request support from X have been ignored. If you can see this and are willing to ping @nikitabier, I would appreciate it. If this account is not restored soon, I will have to leave.

0

1

0

31

AgenticRebirth @AgenticRebirth

10 days ago

@mvanhorn Try using orca. It will preserve your sessions.

0

1

AgenticRebirth @AgenticRebirth

10 days ago

I have some interesting benchmarks on AI agent harnesses to post, but unsure if anyone can see my posts. If you can, could you please interact below?

0

1

0

30

AgenticRebirth @AgenticRebirth

10 days ago

@karpathy @gallabytes It's not obvious how this materially differs from a 'claw'. Can you elaborate on what you think makes this such a giant leap compared to tagging your agent in a Discord or Telegram chat?

0

1

AgenticRebirth @AgenticRebirth

19 days ago

While I like omp a lot in concept, I (sadly) still find codex to be more reliable. Even so, I will keep using/trying to propose improvements to omp since I don't want my model and my harness to be bound together.

Rhys

@RhysSullivan

19 days ago

people haven't used the codex desktop app and it shows

41

377

2

41

57K

0

53

AgenticRebirth @AgenticRebirth

19 days ago

this has been great marketing for fable, whole tl can't wait to get back to using it. they'll have 5x the demand on relaunch as they had on launch.

0

25

AgenticRebirth @AgenticRebirth

20 days ago

@omarsar0 There has actually been some work on this, I’m not surprised to see this approach works https://t.co/RLlklTadXB

0

15

AgenticRebirth @AgenticRebirth

20 days ago

We still know remarkably little about how to extract maximum performance from current models. The research on this topic seems extremely inconsistent, or at least difficult to interpret cohesively. Nevertheless, this is very cool.

OpenRouter

@OpenRouter

21 days ago

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

OpenRouter's tweet photo. Introducing the Fusion API, the smartest compound model in the market.

Fusion achieves Fable-level intelligence at half the price.

How it works 👇 https://t.co/OTUQAdTQjU

725

15K

2K

13K

6M

0

1

0

41

AgenticRebirth @AgenticRebirth

20 days ago

@sudoingX they're going to work closely with USG to make sure it's cleared after the anthropic debacle - which means it's probably going to be nerfed compared to what it could be, but still an improvement

0

13

AgenticRebirth @AgenticRebirth

20 days ago

if there's an economic downturn and people start dumping their used GPUs you bet your ass I'm slurping those up

0

22

AgenticRebirth @AgenticRebirth

20 days ago

This isn't new/surprising - in theory I knew it - but somehow it's still surprising to observe.

0

11

AgenticRebirth @AgenticRebirth

20 days ago

An observation from doing multiple terminal-bench runs is that results can vary significantly even if you change nothing. LLMs just don't work deterministically: if you run a test 5 times, they can fail twice and pass 3 times despite being configured exactly the same way.

1

0

26

AgenticRebirth @AgenticRebirth

21 days ago

@DavidOndrej1 I aint sleeping https://t.co/qAKbKLiiex

AgenticRebirth @AgenticRebirth

21 days ago

Baseline: Pi with GPT-5.5 (medium) scores 70.8% on Terminal Bench 2.1 at a cost of $35.14. Next: see if we can improve performance by tuning only the system prompt, without increasing cost.

0

1

0

136

0

59

AgenticRebirth @AgenticRebirth

21 days ago

Baseline: Pi with GPT-5.5 (medium) scores 70.8% on Terminal Bench 2.1 at a cost of $35.14. Next: see if we can improve performance by tuning only the system prompt, without increasing cost.

AgenticRebirth @AgenticRebirth

23 days ago

Today's Day 1: Deep-diving into agent harnesses. Let's find out what it takes to squeeze max performance from frontier models. Goal: Best quality + speed at the lowest token cost. Inspired by @usr_bin_roygbiv’s cheerleading, I’m testing with the Pi harness and optimising 3 core dimensions: 1. Context management. 2. Tools. 3. Control logic (loops, workflows, determinism). The loop is simple: • Run baseline benchmarks in Pi • Generate a narrow hypothesis on one dimension • Test and measure • Iterate Hoping to ship a stronger harness + learn a ton along the way. Follow along for updates and results! What harnesses are you running that I should try learn from? Drop your suggestions below 👇

1

4

0

1

490

0

1

0

136

AgenticRebirth @AgenticRebirth

21 days ago

@usr_bin_roygbiv Don’t fucking tell me that I already hate myself for not committing sooner. I don’t even wanna look at the ram I’m gonna have to buy

0

1

0

34

AgenticRebirth @AgenticRebirth

21 days ago

Extremely plausible. I’m buying an RTX Pro 6000 today.

Ahmad

@TheAhmadOsman

21 days ago

Next up is ID verification for AI models btw "...suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees."

10

66

3

6

3K

0

1

0

199

AgenticRebirth

@AgenticRebirth

Last Seen Users on Sotwe

Trends for you

Most Popular Users