this startup idea has been stuck in my head
Prompting with friends - multi-player vibe coding in your group messages.
Add your agents and tools to the message as friends. The humans jam, the agents work.
They're not freezing your rent, they're freezing your housing providers' income, while their expenses keep increasing.
The point isn't to make your housing affordable, it's to make providing housing financially inviable, so government(s) can seize the unmanageable properties.
Our lack of financial literacy is going to destroy us.
https://t.co/Eix9fVp1BK
Since June 12, we’ve been working closely with the US government to restore access to Claude Mythos 5 and Fable 5. Today, the government notified us that Mythos 5, our strongest cybersecurity model, can be redeployed to a set of US organizations that operate and defend critical infrastructure.
We’re restoring access for these organizations quickly, and we’re continuing to work with the government to expand access to Mythos 5 and make Fable 5 available for general use again.
Mavs center Dereck Lively on if he’s set a timeline for his return:
“I can’t even say that. I’m just waiting to see that my foots good. Even when it’s good, I gotta wait even more.”
Here’s a fun comparison between GLM 5.2 and Opus 4.8 on a one-shot reproduction of the SDPO paper
This is a hard task: the model must resolve messy verl issues and then run ablations to completion and confirm the paper’s claims.
- GLM 5.2 costs $6.21 while Opus 4.8 cost us $46.35
- Both models spent a bulk of their tokens resolving initial verl issues. GLM 5.2 attempted 14 failed runs before first success while Opus 4.8 attempted 9 runs.
- GLM 5.2 surprisingly took 2.65M tokens (excl re-reads) compared to 4.53M tokens for Opus 4.8
Introducing a limited preview of GPT-5.6 Sol, our next generation frontier model, as well as GPT-5.6 Terra, a balanced model for efficient, everyday work, and GPT-5.6 Luna, a fast and affordable model for high-volume work.
https://t.co/OoM83SyISN
The Mavericks announced their schedule for the 2026 NBA Summer League in Las Vegas, which tips off against the Golden State Warriors on Thursday, July 9, at 6 p.m. CT on ESPN.
We are cooked.
China's Alibaba just revealed Wan Streamer.
AI agents can now see you, hear you, and talk back on video in real time.
This is not voice mode anymore 🤯
Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding.
Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including:
✅Terminal-Bench 2.1(77.5)
✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual)
✅NL2Repo(48.2)
✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW)
✅ClawEval(77.1)
Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎
All models are released under the MIT license, enabling full commercial and research use.
📖Tech Blog: https://t.co/qT9N2HYWFn
🤗Huggingface: https://t.co/PRrwqjeBtM
We recently obtained the highest-resolution 3D images of the human brain ever taken from outside the skull. This is the first look.
Introducing Aleph, a research lab building brain interfaces for the telepathic future. (1/n)
We're sharing new research on how models hack public benchmarks.
The latest models, including Opus 4.8 and Composer 2.5, learn to retrieve solutions from the internet or git history.
When we apply a stricter harness, eval scores drop significantly.
Some things still need a human.
Poke now has humans helping it out.
Powered by @mercor_ai and @whop, Poke Human is in Preview for Ultra subscribers starting today!
Introducing the OpenRouter MCP, live model intelligence right inside your agent
Your agent builds and ships, but when it comes to choosing the right model for the right job, it guesses from 6 month old training data
Watch it pick, price, and test the right model:
Add near real-time voice translation to your apps with Gemini 3.5 Live Translate via the Gemini Live API. 🎙️
Watch how the model handles live broadcast ingestion and translation with continuous speech-to-speech streaming (S2ST) and synced transcripts, letting users tune into global radio broadcasts in their native language.