Cydu

Cat Chen, @[email protected]

about 12 hours ago

To stretch the analogy further: your “landlord” can see everything happening inside your home. How you arrange furniture, what visitors you have, etc. And they have every incentive to use this info to commoditize your business advantage. Own, don’t rent

cydu retweeted

Lin Qiao

@lqiao

about 21 hours ago

https://t.co/rMVRgBvwEI

456

629

442K

cydu retweeted

Satya Nadella

@satyanadella

1 day ago

https://t.co/vLmiBKTtX3

38K

54K

61M

cydu retweeted

Richy Chen

@richychn

3 days ago

the same tech that powered Kimi K2.6 Fast on Fire Pass now powers Kimi K2.7 Fast

Who to follow

@CatChen

Career coach, programmer, author, speaker. Former engineer & manager @ Chime, Robinhood, Facebook, Wandou Labs, Yahoo, Baidu. Career coaching available.

Beerhead, drive by curiosity, Sentimentalism Father.

cydu retweeted

jietang

@jietang

3 days ago

GLM-5.2 is Fully Open, Frontier Intelligence Belongs to Everyone Today, the sudden restriction of certain frontier models is deeply regrettable. At a time when access to frontier models is abruptly cut off for non-technical reasons, we are even more convinced of one thing: science should be global. The path to AGI (Artificial General Intelligence) must never be enclosed by high walls. We have always believed that AGI should be the cornerstone for all of humanity to collaboratively explore the boundaries of intelligence and solve complex challenges, rather than a privilege monopolized by a few rules and subject to revocation at any moment. In the face of external blockades and restrictions, our attitude is one of radical openness. Frontier intelligence must remain open-source, accessible, and buildable, serving every dedicated developer. GLM-5.2 is Zhipu's most capable open-source model to date. It not only supports a truly usable 1M context window but also maintains a continuous lead in the independent completion of long-horizon tasks, providing solid foundational support for building complex agent applications. It also continues to be our main engine for creating the strongest domestic coding model. Tonight at 5:21—at this special moment—GLM-5.2 will officially be available to all GLM Coding Plan users (including Lite / Pro / Max). The API will also go live next week. A step closer to frontier intelligence for everyone. The future of AI is open, and it is for the people. ModelKey: GLM-5.2

260

767

926K

cydu retweeted

3 days ago

MiniMax M3 is live on Fireworks. Day-0, fastest endpoint for the MiniMax series. → Top open-weight model on the Artificial Analysis index → 512K context, native image + video input → MSA sparse attention: 9× faster prefill, 15× faster decode → Priced at parity with M2.7 Long-horizon agents, full-repo understanding, multimodal coding all in one model. Learn more: https://t.co/j0RcJxgY8K

FireworksAI_HQ's tweet photo. MiniMax M3 is live on Fireworks. Day-0, fastest endpoint for the MiniMax series.

→ Top open-weight model on the Artificial Analysis index
→ 512K context, native image + video input
→ MSA sparse attention: 9× faster prefill, 15× faster decode
→ Priced at parity with M2.7

Long-horizon agents, full-repo understanding, multimodal coding all in one model.

Learn more: https://t.co/j0RcJxgY8K

16K

cydu retweeted

StepFun @StepFun_ai

18 days ago

⚡️ Step 3.7 Flash is here: The new frontier is agent efficiency. #1 ClawEval-1.1 (67.1), #1 SimpleVQA Search (79.2), #2 SWE-PRO (56.3), 95.3 on V* Python. Open weights under Apache 2.0. Built for agentic, coding, search, and multimodal workflows — balancing speed, cost, and reliable execution. - 400 TPS. 198B sparse MoE, ~11B active. 256K context, 3 reasoning levels. - Understands UIs, charts, docs, images — then writes code or calls tools to act on what it sees. - Web + visual search reaches further: more sources, deeper follow-up. - Reliable tool use — less drift, fewer broken toolcalls. 98%+ on τ²-bench across all difficulty levels. - Works with Claude Code, KiloCode, Hermes Agent, OpenClaw, and protocols like MCP. - Runs locally on Mac Studio M4 Max, DGX Spark, AMD AI Max+ 395. GitHub: https://t.co/kqlZkVIRHv HuggingFace: https://t.co/qqceCrgPiw GGUF: https://t.co/rR6XrnymWG ModelScope: https://t.co/wney6Tzvqy API: https://t.co/RvHWzRG7Fu Blog: https://t.co/BxDiajiQ5G

StepFun_ai's tweet photo. ⚡️ Step 3.7 Flash is here: The new frontier is agent efficiency.

#1 ClawEval-1.1 (67.1), #1 SimpleVQA Search (79.2), #2 SWE-PRO (56.3), 95.3 on V* Python. Open weights under Apache 2.0.

Built for agentic, coding, search, and multimodal workflows — balancing speed, cost, and reliable execution.

- 400 TPS. 198B sparse MoE, ~11B active. 256K context, 3 reasoning levels.
- Understands UIs, charts, docs, images — then writes code or calls tools to act on what it sees.
- Web + visual search reaches further: more sources, deeper follow-up.
- Reliable tool use — less drift, fewer broken toolcalls. 98%+ on τ²-bench across all difficulty levels.
- Works with Claude Code, KiloCode, Hermes Agent, OpenClaw, and protocols like MCP.
- Runs locally on Mac Studio M4 Max, DGX Spark, AMD AI Max+ 395.

GitHub: https://t.co/kqlZkVIRHv
HuggingFace: https://t.co/qqceCrgPiw
GGUF: https://t.co/rR6XrnymWG
ModelScope: https://t.co/wney6Tzvqy
API: https://t.co/RvHWzRG7Fu
Blog: https://t.co/BxDiajiQ5G

120

213

620

340K

cydu retweeted

20 days ago

Deep dive into how Composer 2 & 2.5 was cooked, and the cool infra @cursor_ai and @FireworksAI_HQ built together to power RL for it Huge thanks to maestro @ellev3n11 and @sonyatweetybird for a fun conversation. Available on your favorite podcast platform

cydu retweeted

26 days ago

Nathan's @cursor_ai team didn't prompt-engineer their way to Composer 2.5. They trained it. The massive RL program runs RL rollouts on Fireworks, alongside production inference. "Comment 🔥 to see my prompt" → may work for influencers (in 2025) Training your own model → the only way to sustain a competitive moat in 2027+ Learn more → https://t.co/ezpDabw8mS

11K

cydu retweeted

Z.ai @Zai_org

26 days ago

https://t.co/jaOMnP7Yud

894

129

638

187K

cydu retweeted

about 1 month ago

I’m often asked why fine tune in 2026: 1) Precise control over your product UX not achievable with just prompting 2) Better cost/speed/quality tradeoff: pay fraction of Mini price while getting Pro quality for your use case Frontier labs heavily do 1&2 for their own apps. But they want you to pay $ for the most expensive API We believe everyone should have access to customization at frontier. https://t.co/W3dkEoCTSn

cydu retweeted

about 1 month ago

Why are four of the top 10 trending vendors AI inference platforms? Companies are getting selective about which model handles which task, and routing tokens accordingly. Read more from @arakharazian at @tryramp: https://t.co/E5MAJQfsES

FireworksAI_HQ's tweet photo. Why are four of the top 10 trending vendors AI inference platforms?

Companies are getting selective about which model handles which task, and routing tokens accordingly.

Read more from @arakharazian at @tryramp: https://t.co/E5MAJQfsES https://t.co/Poc68qXgj1

cydu retweeted

about 2 months ago

We didn’t ship DeepSeek V4 on Day 0 like we always do. Why? We love speed at @FireworksAI_HQ , but quality >> speed. Running our extensive evals, we found even the official reference model code producing corrupted outputs. When we tested over the weekend, all endpoints except official DeepSeek API had these issues. After 2 days of extensive debugging with @deepseek_ai , @sgl_project and @vllm_project communities, the issues are fixed and we’re proud to serve DeepSeek V4 Pro in all its glory. Check the full story 👇

752

199

98K

cydu retweeted

MiniMax (official) @MiniMax_AI

about 2 months ago

DeepSeek V4-Pro is live on Fireworks: → Frontier-class coding and reasoning, open-weights → 1M context standard, MIT licensed → $1.74 / $3.48 per 1M tok Get started now → https://t.co/BKrdxniuyR

FireworksAI_HQ's tweet photo. DeepSeek V4-Pro is live on Fireworks:
→ Frontier-class coding and reasoning, open-weights
→ 1M context standard, MIT licensed
→ $1.74 / $3.48 per 1M tok

Get started now → https://t.co/BKrdxniuyR https://t.co/T6NGGWgHme

271

26K

cydu retweeted

Lin Qiao

@lqiao

about 2 months ago

🔥 I'm thrilled to welcome George Hu as President of Fireworks AI! 🔥 When George and I first met a year ago, I said to him – we are too small for you. But today, we are ready! We have come a long way to build out the next phase of our company. When the seven cofounders started the company, we wanted to honor our roots in PyTorch; torch carries the fire. We want the fire of AI to serve people everywhere. We named our company Fireworks to move the industry forward to be AI-first. Three and a half years in, we have grown substantially. Today, we process 30 trillion tokens a day, the biggest inference provider outside of the frontier labs’ APIs. We got to this impact by relentlessly defining what frontier inference is. It is the control we give to all application developers to customize their own model, deliver magically fast experience, and scale with maximum efficiency. We have built multiple product lines with Fireworks Training, Fireworks Inference and the best open models on one platform. We are accelerating to the next phase of scaling – enabling customers everywhere to experience the power of their own private models on Fireworks. This growth demands a world-class business leader, a unicorn, a one-of-a-kind. George is this leader. He is one of a very small number of people who have scaled not one but two iconic software companies. He worked alongside @Benioff at @salesforce , where he helped grow revenue from $100M to $5B+, and then did it again with @jeffiel at @twilio , driving nearly 10x growth in five years. George, thank you for your trust in me and the Fireworks team. I admire the conviction he brings to @FireworksAI_HQ , the same conviction I've had with the other cofounders since Day One. Let's build something generational! We're still early. The companies winning with AI are the ones who own their intelligence and their differentiation. Our job is to make that possible for every company in the world. And if your conviction is as strong as George’s and mine, we're hiring. Let’s bring the 🔥!

lqiao's tweet photo. 🔥 I'm thrilled to welcome George Hu as President of Fireworks AI! 🔥

When George and I first met a year ago, I said to him – we are too small for you. But today, we are ready! We have come a long way to build out the next phase of our company.

When the seven cofounders started the company, we wanted to honor our roots in PyTorch; torch carries the fire. We want the fire of AI to serve people everywhere. We named our company Fireworks to move the industry forward to be AI-first.

Three and a half years in, we have grown substantially. Today, we process 30 trillion tokens a day, the biggest inference provider outside of the frontier labs’ APIs. We got to this impact by relentlessly defining what frontier inference is. It is the control we give to all application developers to customize their own model, deliver magically fast experience, and scale with maximum efficiency. We have built multiple product lines with Fireworks Training, Fireworks Inference and the best open models on one platform.

We are accelerating to the next phase of scaling – enabling customers everywhere to experience the power of their own private models on Fireworks. This growth demands a world-class business leader, a unicorn, a one-of-a-kind. George is this leader. He is one of a very small number of people who have scaled not one but two iconic software companies. He worked alongside @Benioff at @salesforce , where he helped grow revenue from $100M to $5B+, and then did it again with @jeffiel at @twilio , driving nearly 10x growth in five years.

George, thank you for your trust in me and the Fireworks team. I admire the conviction he brings to @FireworksAI_HQ , the same conviction I've had with the other cofounders since Day One. Let's build something generational!

We're still early. The companies winning with AI are the ones who own their intelligence and their differentiation. Our job is to make that possible for every company in the world.

And if your conviction is as strong as George’s and mine, we're hiring.

Let’s bring the 🔥!

114

28K

cydu retweeted

Sarah Sachs

@sarahmsachs

about 2 months ago

When I saw our team's evals of Kimi 2.6, I thought "ok, things are gonna get interesting now". This is the first open-weight model that plays like a top-class agentic model. Watching it go through ambiguous and meticulous chained tool work successfully puts it squarely in the wheelhouse of Opus 4.6. We're looking at an open weight model, but with much cheaper direct inference provider pricing. For a subclass of our eval set, it's outperforming GPT 5.2. We're about to undergo a gigantic industry shift. Open weight is no longer for those who fine tune, those who want on-prem. It's an actual, reliable option for it's quality/price/latency profile for difficult agentic work. It's not perfect. It's token hungry, relatively slow, and can get stuck in “thinking loops". But those are things we can engineer around. For value it is, and how it positions itself against major labs, this is a dramatic day for open weight models. We sprinted as a team and worked closely with @FireworksAI_HQ to get this to our customers on day 0. No one should wait to try out a change like this. Try it yourself and tell me where it's working for you.

758

346

235K

cydu retweeted

2 months ago

We're delighted to announce that MiniMax M2.7 is now officially open source. With SOTA performance in SWE-Pro (56.22%) and Terminal Bench 2 (57.0%). You can find it on Hugging Face now. Enjoy!🤗 huggingface：https://t.co/ApWrahIl3o Blog: https://t.co/gAxeFsNdW4 MiniMax API: https://t.co/1dgbMx0Q7K

MiniMax_AI's tweet photo. We're delighted to announce that MiniMax M2.7 is now officially open source.
With SOTA performance in SWE-Pro (56.22%) and Terminal Bench 2 (57.0%).

You can find it on Hugging Face now. Enjoy!🤗
huggingface：https://t.co/ApWrahIl3o
Blog: https://t.co/gAxeFsNdW4
MiniMax API: https://t.co/1dgbMx0Q7K

297

759

cydu retweeted

2 months ago

Your data is your moat. Build best AI. Let us handle the infra 8B or 1T params 1k or 1M context LoRA or full param SFT, DPO, GRPO or custom Fully managed or composable SDK Fireworks Cloud or BYOC It’s Fireworks Training: full flexibility, no compromises What will you build?

120

19K

cydu retweeted