Requesty @RequestyAI - Twitter Profile

Requesty @RequestyAI

5 days ago

@ThibaultJaigu @vkhosla https://t.co/FJP3Dw4RIS

0

15

Requesty @RequestyAI

5 days ago

Now live on Requesty! https://t.co/FJP3Dw4RIS

1

2

0

151

RequestyAI retweeted

Thibault Jaigu

@ThibaultJaigu

11 days ago

GLM-5.2 now live on @RequestyAI ! Congrats to our friends at @Zai_org

0

4

1

0

190

Requesty @RequestyAI

12 days ago

Free models on Requesty! https://t.co/JKpcQ3VNyC

1

4

0

1

325

Requesty @RequestyAI

12 days ago

@benln Requesty! https://t.co/YbbMtYUv9e London, United Kingdom

0

5

0

2

3K

Requesty @RequestyAI

12 days ago

@aicodeking Some free models on Requesty too! https://t.co/ciAYJHXJiE

0

1

0

37

Requesty @RequestyAI

18 days ago

Mythos Live on @RequestyAI !

0

2

0

551

Requesty @RequestyAI

26 days ago

Our friends at @MiniMax_AI are doing a tremendous job! Now available on https://t.co/ghznn8v9nQ

MiniMax (official) @MiniMax_AI

26 days ago

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: https://t.co/fHRdSV7BwZ Token Plan: https://t.co/BDCycxepZw 🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul Weights & Tech Report in ~10 Days

MiniMax_AI's tweet photo. Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities

- Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas
- MiniMax Sparse Attention scales context to 1M
- Natively Multimodal from Step Zero

API: https://t.co/fHRdSV7BwZ
Token Plan: https://t.co/BDCycxepZw
🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul

Weights & Tech Report in ~10 Days

564

12K

1K

3K

5M

0

1

0

223

Requesty @RequestyAI

about 1 month ago

The Coding Agent Economy. • $92 avg cost per active user / month • Claude powers 92% of all coding agent spend (up from 68%) • Cache hit rates jumped 52% → 86% https://t.co/CG9rIsKXkq

0

203

Requesty @RequestyAI

about 1 month ago

The throughput density data suggests something counterintuitive: the highest throughput providers are not necessarily serving the largest requests. They are serving a massive number of relatively small generations extremely efficiently. A lot of AI infrastructure performance right now looks less like “big intelligence” and more like high frequency inference systems. Congrats @GroqInc https://t.co/8ED0RDo3QT

0

1

0

158

Requesty @RequestyAI

about 1 month ago

The surprising thing in the latency data is how compressed the top providers have become. For a lot of workloads, the gap between “fast” and “slow” providers is now smaller than the variance introduced by tool calls, long context, and agentic execution itself. Model latency is starting to matter less than workflow latency. Congrats @xai https://t.co/XXys3M1CTJ

0

110

Requesty @RequestyAI

about 1 month ago

Most AI teams have zero control over which models employees and agents can actually use. Today we’re launching Approved Models + Access Lists in Requesty. You can now: • approve models org-wide • restrict models by API key or group • enforce regional/compliance policies • standardize model usage across teams AI governance is becoming critical infrastructure. https://t.co/1DWRNdZOut

0

1

0

120

Requesty @RequestyAI

about 1 month ago

The open source model market is consolidating much faster than expected. A handful of OSS families now dominate traffic share while most new releases barely register. The gap between “models people talk about” and “models people actually use in production” is getting very large. @deepseek_ai is still dominating! Jan → Apr 2026 data from Requesty ↓ https://t.co/outpgd6i2w

0

68

Requesty @RequestyAI

about 2 months ago

The interesting metric is not tool call request share. It’s tool call token share. Once workflows become agentic, token consumption shifts dramatically toward tool execution: retrieval code output tool responses intermediate reasoning The number of requests can look normal while the token profile completely changes.https://t.co/b95ln2sqOL

0

79

Requesty @RequestyAI

about 2 months ago

One of the clearest signals of how people actually use AI might be finish reasons. Anthropic direct traffic is now 52% tool calls. OpenAI direct is just 3%. You can literally see the difference between conversational usage and agentic workflows in the data. April 2026 data from Requesty ↓ https://t.co/uFQMpkLRZr

0

277