🚀 1,000+ TOKENS/S ON A 1T MODEL! 🚀
We are thrilled to release Xiaomi MiMo-V2.5-Pro-UltraSpeed in collaboration with @TileRT_AI , breaking the 1,000 tokens/s output speed on a 1 Trillion parameter model for the FIRST TIME!
Not wafer-scale integration like Cerebras. Not pure on-chip SRAM chips like Groq. We achieve 1,000 tps on a 1T MoE model using just a SINGLE, STANDARD 8-GPGPU NODE.
Read the full technical deep dive:https://t.co/MX0kjHKdKi
Want to experience the future of real-time AI?
👉 Apply for UltraSpeed now: https://t.co/aeWAxyhwVk
⏳ Limited-Time Access: Application-based · Jun 8 – Jun 23 (PDT)
💬 Chat Experience: Completely FREE for a limited time — try the blazing-fast web chat now.
⚡ UltraSpeed API: Just 3x the price for a ~10x boost in output experience.
🤝 Enterprise & Large-Scale Needs: [email protected]
난 게임을 즐겨하지 않는데 이런건 진짜 유익함
만원으로 데이터 센터의 복잡한 구조와 컴퓨터 인프라를 이해하는 스팀게임 : Data Center
빈 방에서 시작해서
랙 구매 → 서버 장착 → 모든 케이블을 직접 손으로 하나하나 연결해야함
실제 데이터 센터처럼 고객 트래픽을 처리하는 시뮬레이션 게임
출시 48시간 만에 180개가 넘는 리뷰가 달렸고, 플레이어들은 “최근 본 시뮬레이션 게임 중 가장 몰입감 있다”, “컴퓨팅 인프라를 이해하는 데 최고”라는 평가를 하고 있습니다.
the public doesn’t see that current models can do so much, you don’t need the internal locked behind lab doors.
Also not sure I agree with Sebastien on letting the math community slowly work out these problems themselves, they’ve been notorious gatekeepers.
🚨 Bitcoin just dropped from $74,000 to $67,500 in 48 hours. On no real news.
One thesis that fits the data:
The exit liquidity rotation has begun.
In the next months, four companies are raising over $350 billion in fresh equity:
– SpaceX IPO: ~$75B
– OpenAI raise: ~$100B
– Anthropic raise: ~$100B+
– Google net equity issuance: ~$80B
That money has to come from somewhere. Existing portfolios. Risk-on capital. Cash.
Bitcoin is the most liquid risk-on asset on earth. Selling it is the fastest way to free up dollars without triggering tax events on long-held equity positions.
If the most religious Bitcoin holders – the corporate treasuries, the funds, the whales – are even partially rotating to participate in the largest IPO cycle in history, you don't need a news catalyst to explain the drop.
You just need the supply curve to flip.
This isn't bearish on Bitcoin long-term. It's a sign that the entire risk-on crowd is preparing to absorb the largest equity issuance year since 2000.
When the marginal Bitcoin holder needs to be on a SpaceX cap table, Bitcoin goes down for reasons that have nothing to do with Bitcoin.
The exit liquidity avalanche doesn't just hit overvalued stocks.
It hits anything liquid.
I wrote this ~3 months ago, and since then,
1) Memory has been more or less fully integrated with the frontier models
2) Almost all features that made OpenClaw unique as a harness has been fully absorbed by the frontier models (e.g. schedules, loops, goals, memory, etc.)
3) New, vertical killing features and capabilities are being added every other week
--
All that being said, agentic engineering is still an incredibly high skill affair.
It is now obvious to me that there is a gulf of know-how and tacit knowledge between those that CAN remove humans-out-of-the-loop and actually produce a working product, and the rest of the world insisting that agents are still producing "slop".
Première mondiale ! Un robot a travaillé 200h non-stop, et trié plus de 249 000 colis à lui seul. Pas une seule panne, pas une seule pause et tout a été diffusé en live pour le prouver.
À la base c'était un défi de 8h. Le robot a tellement bien tourné qu'ils ne l'ont jamais coupé. 200 heures plus tard il tournait encore.
Le truc de fou, c'est qu'il y a quelques jours un stagiaire a fait un duel contre le robot sur un shift de 10h. Le gars a gagné. De justesse, 2.79 secondes par colis contre 2.83 pour la machine. Sauf que le stagiaire a fini avec l'avant-bras en vrac. Le robot lui il a continué 190 heures de plus sans broncher.
Et c'est là que je comprends pas. On a littéralement un robot humanoïde qui fait un boulot d'entrepôt en continu, sans supervision, tout est géré par son IA embarquée. Si le robot bug, il se reset tout seul et reprend. Si il a un souci hardware, il sort de la ligne et un autre prend le relais automatiquement.
Malgré tout ça, la majorité des gens ne voient pas ce qui arrive. On scrolle, on passe, on se dit "c'est cool" et on oublie. Mais c'est pas "cool". C'est un changement de civilisation. Les tâches physiques répétitives vont être automatisées.
La robotique humanoïde c'est le sujet dont personne ne parle assez. On commence à peine à parler d'IA avec bien du retard, sauf qu'il faut comprendre que l'étape d'après c'est l'IA incarnée, cad, les robots.
We are still so early.
And sometimes we forget that we live in an AI ivory tower.
The majority don't use AI as intensively as we do.
(Source reddit h/t r/Terrible-Priority-21)
Heads up, agent users!
If you're using Xiaomi MiMo with thinking mode:
When thinking mode is enabled in a multi-turn agent session and the conversation history contains a tool call, any assistant message with tool calls passed back in subsequent user turns must preserve its full reasoning_content field — otherwise the API will return a 400 error.
Without it, the model's context is incomplete, which can lead to weaker instruction-following, more hallucinations, and a visibly degraded user experience.
Missing reasoning = incomplete context = degraded reasoning quality.
Affected frameworks include TRAE, Cursor, Roo Code, Codex, GitHub Copilot CLI, Zed, AutoGen. We're actively working with the maintainers to push compatibility updates.
Affected models: MiMo-V2.5-Pro, MiMo-V2.5, MiMo-V2-Pro, MiMo-V2-Omni, MiMo-V2-Flash.
See docs(https://t.co/9v7oybaW26 )for more details.
My new favorite workflow
=-=-=-=-=-=
Pre-requisites
1. Setup Warp (either clone or use the app)
2. Setup Vibeproxy
3. Use Droid
4. GPT-5.5
=-=-=-=-=-=
Workflow:
- left bar for project management
- files + git view
- prooompt GPT-5.5
- Let Droid do your work