Is anyone else seeing the new Claude models (Sonnet 5, Opus 4.7, 4.8) tokenize into ~60% more tokens than GPT models for the same input prompt?
So per-token-pricing for Claude has to be multiplied by ~1.6 before comparing with GPT
This is just counting input tokens, not output
Anthropic claims: Alibaba continues to distill Claude on a large scale to train Qwen. Via Bloomberg
Anthropic is accusing Alibaba-linked operators of running a massive campaign to illicitly access Claude through nearly 25,000 fraudulent accounts.
According to Bloomberg, Anthropic claims the campaign generated 28.8 million Claude exchanges between April and June, targeting capabilities like software engineering and agentic reasoning.
The company says this is part of a broader pattern of “adversarial distillation,” where Chinese labs allegedly harvest outputs from US frontier models to train rival systems at a fraction of the cost.
Lets see how good Qwen 3.8 will be, probably FABLEous good.
The new prompting meta of Jun 2026:
"We got this bug report [..]. Send a test request to dev server to repro. Retrieve production logs to diagnose. Read our code base and implement a fix. Deploy your fix to dev server. Send a test request again and iterate until fixed”
If an LLM can recursively self-improve inside a frontier lab, can it do the same outside one? If so, releasing an RSI-capable model would effectively destroy the lab’s own moat
@hwchase17@nikunj Why can't LLM be the adapter between any arbitrary memory format?
Tell System A: Give me everything you know about me in a markdown file
Tell System B: [upload the markdown file] This markdown file contains everything System A knows about me, import it
What's the best opensource code mode tool execution that's lightweight, support both MCP and non-MCP tools? Don't want dependencies on any agents/harness sdk