oribi @0ribi - Twitter Profile

Pinned Tweet

oribi @0ribi

about 1 year ago

to everyone who uses light mode, mad respect, seriously

21

41

0

3K

0ribi retweeted

dax

@thdxr

1 day ago

i have low conviction on model routers - very open to changing my mind but this is a snapshot of my current thoughts - i don't think it's good to not be aware of what model you're using. coding with LLMs is a skill you develop and getting a feel for models is part of that - people (at scale) don't have this skill right now which is why a lot of companies are complaining that people are using expensive models for dumb things. a model router promises to solve this without the user having to do anything but i think the issue is missing feedback loops to the user. id rather we figure out how to help users get smarter - i dont even know how much you can model route when factoring in things like prompt cache. only so much you can do - their effectiveness is a bit exaggerated by the same dynamic that's impacting everything AI. so many companies desperately searching for opportunities and trying anything. model routing is the one thing models labs cannot do so everyone is jumping on it

139

986

39

207

90K

0ribi retweeted

Ok, Jose

@JoseRMejia

6 days ago

it’s simple, really

13

666

68

70

21K

oribi @0ribi

10 days ago

the Fable copium is remarkable

OpenRouter

@OpenRouter

10 days ago

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

OpenRouter's tweet photo. Introducing the Fusion API, the smartest compound model in the market.

Fusion achieves Fable-level intelligence at half the price.

How it works 👇 https://t.co/OTUQAdTQjU

714

15K

2K

14K

6M

0

75

0ribi retweeted

Prompter

@PromptLLM

12 days ago

Insane take from Fable 5

236

9K

670

3K

540K

0ribi retweeted

Artificial Analysis

@ArtificialAnlys

12 days ago

We've updated the Artificial Analysis Coding Agent Index, replacing SWE-Bench Pro with Datacurve's DeepSWE benchmark - the swap lifts Codex with GPT-5.5 (xhigh) above Claude Code with Opus 4.8 (max), while the newly released Claude Fable 5 (max) in Claude Code debuts at the top DeepSWE, built by @datacurve, writes its tasks from scratch rather than adapting them from public GitHub issues or pull requests, so no model has seen the solutions during training. That matters because SWE-Bench Pro, the benchmark it replaces in our Coding Agent Index, had grown gameable, with some models recovering the fix from the repository's commit history instead of solving the task. The swap reorders the index: Codex with GPT-5.5 (xhigh) rises from 65 to 76, overtaking Claude Code with Opus 4.8 (max) at 73. Claude Code with Fable 5 (max), which enters directly on the refreshed index, leads at 77. SWE-Bench Pro had been flattering some combinations and penalizing others. More below.

ArtificialAnlys's tweet photo. We've updated the Artificial Analysis Coding Agent Index, replacing SWE-Bench Pro with Datacurve's DeepSWE benchmark - the swap lifts Codex with GPT-5.5 (xhigh) above Claude Code with Opus 4.8 (max), while the newly released Claude Fable 5 (max) in Claude Code debuts at the top

DeepSWE, built by @datacurve, writes its tasks from scratch rather than adapting them from public GitHub issues or pull requests, so no model has seen the solutions during training. That matters because SWE-Bench Pro, the benchmark it replaces in our Coding Agent Index, had grown gameable, with some models recovering the fix from the repository's commit history instead of solving the task.

The swap reorders the index: Codex with GPT-5.5 (xhigh) rises from 65 to 76, overtaking Claude Code with Opus 4.8 (max) at 73. Claude Code with Fable 5 (max), which enters directly on the refreshed index, leads at 77. SWE-Bench Pro had been flattering some combinations and penalizing others.

More below.

114

2K

185

412

568K

oribi @0ribi

15 days ago

sucks to be a nintendo employee rn

0

2

0

36

0ribi retweeted

dax

@thdxr

about 2 months ago

some of you need to go outside and touch ass

150

3K

103

105

109K

0ribi retweeted

Martinalgui

@MartinoliCuri

about 2 months ago

*La empresa cayéndose a pedazos* Yo en el baño viendo el PSG vs Bayern:

187

120K

15K

3K

2M

0ribi retweeted

Niels Rogge @NielsRogge

about 2 months ago

FYI Claude Code is mostly a vibe-coded product (as they say, 100% written by Claude) It's the worst harness for Opus 4.6 among ANY harness on Terminal-Bench 2

NielsRogge's tweet photo. FYI Claude Code is mostly a vibe-coded product (as they say, 100% written by Claude)

It's the worst harness for Opus 4.6 among ANY harness on Terminal-Bench 2 https://t.co/MpZMsO8uRu

100

2K

73

1K

448K

oribi @0ribi

2 months ago

really slow btw

BuBBliK

@k1rallik

2 months ago

NVIDIA IS LITERALLY GIVING AWAY FREE AI INFERENCE I literally set it up in 5 minutes and couldn't believe it was free DeepSeek, MiniMax, Kimi, GLM, Llama - all on NVIDIA's DGX Cloud via clean OpenAI-compatible API. Setup in 5 min: → https://t.co/2zMHb4Q8zV → grab API key → base_url = https://t.co/zUbPzFZA7J → drop it into any OpenAI SDK We've been using it. Yes, it slows down under heavy load. Yes, free tier has limits. But for solo devs, indie hackers, and students learning AI engineering? This is the best free playground that exists right now. Stop paying $20/mo to experiment. Use this first.

33

1K

130

2K

160K

0

46

oribi @0ribi

2 months ago

@dhruvtwt_ 😂😂

0

1

0

12

oribi @0ribi

2 months ago

all this to just center a div

Dhruv

@dhruvtwt_

2 months ago

I got bored, so I changed my setup again.

24

103

0

5

8K

1

6

0

141

oribi @0ribi

2 months ago

@sznmelvin nah bro

0

10

oribi @0ribi

2 months ago

tf is adaptive?

1

0

33

0ribi retweeted

Mike

@WorldsByMike

2 months ago

@claudeai @AnthropicAI Um guys. Why does Opus 4.7 think it’s Sonnet 4.6. Feels like it should know what it is.

36

514

14

35K

oribi @0ribi

2 months ago

tired of juggling terminals, agents, and projects across multiple windows? one canvas. every project. every agent. every terminal. this is ekegai

0ribi's tweet photo. tired of juggling terminals, agents, and projects across multiple windows?

one canvas. every project. every agent. every terminal.
this is ekegai https://t.co/O77O3tXWsH

1

6

1

0

172

0ribi retweeted

ℏεsam

@Hesamation

2 months ago

"girl i'm running OpenClaw on 2 Mac studios and 4 Mac minis running 3 local models and 8 different agents that I constantly monitor with a cute monitoring dashboard, i need 4 screens just to look at it." "OK BUT WHAT THE FUCK ARE YOU USING IT FOR?"

9

438

24

85

38K

oribi @0ribi

2 months ago

@explorersofai does GLM 5.1 have vision

1

0

20

oribi @0ribi

3 months ago

@ube_codes @developerxcodes all of that just to center a div

1

0

13

0ribi retweeted

AI at Meta

@AIatMeta

3 months ago

Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs. Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration. Muse Spark is available today at https://t.co/wHkMPH82ZH and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model. Learn more: https://t.co/PloE9q5x96

AIatMeta's tweet photo. Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs.

Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration.

Muse Spark is available today at https://t.co/wHkMPH82ZH and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model.

Learn more: https://t.co/PloE9q5x96

577

9K

1K

3K

3M

oribi

@0ribi

Last Seen Users on Sotwe

Trends for you

Most Popular Users