Chau Tran

@mr_cheu

Building AI @glean. Past: Machine Translation at FAIR, ML @Quora

New York, USA

Joined April 2009

984 Following

1.1K Followers

1.1K Posts

Chau Tran

@mr_cheu

2 days ago

Is anyone else seeing the new Claude models (Sonnet 5, Opus 4.7, 4.8) tokenize into ~60% more tokens than GPT models for the same input prompt? So per-token-pricing for Claude has to be multiplied by ~1.6 before comparing with GPT This is just counting input tokens, not output

221

Chau Tran

@mr_cheu

3 days ago

@mweinbach GLM 5.2 inference can be much cheaper (than pay-per-token price) when done on dedicated compute

403

Chau Tran

@mr_cheu

4 days ago

@jietang Image understanding

601

Chau Tran

@mr_cheu

8 days ago

Ok so they CAN look at ZDR requests

Who to follow

Reka

@RekaAILabs

An AI research and product company 🫠. We are a team of scientists and engineers building state-of-the-art multimodal models 😻

Scaling @xAI. Previously Gemini 3 perception and project Astra @GoogleDeepMind

Chau Tran

@mr_cheu

8 days ago

how do they differentiate between normal requests and distillation requests, assuming they can't "look" at the requests with ZDR enabled?

Chubby♨️

@kimmonismus

9 days ago

Anthropic claims: Alibaba continues to distill Claude on a large scale to train Qwen. Via Bloomberg Anthropic is accusing Alibaba-linked operators of running a massive campaign to illicitly access Claude through nearly 25,000 fraudulent accounts. According to Bloomberg, Anthropic claims the campaign generated 28.8 million Claude exchanges between April and June, targeting capabilities like software engineering and agentic reasoning. The company says this is part of a broader pattern of “adversarial distillation,” where Chinese labs allegedly harvest outputs from US frontier models to train rival systems at a fraction of the cost. Lets see how good Qwen 3.8 will be, probably FABLEous good.

$kimmonismus's tweet photo. Anthropic claims: Alibaba continues to distill Claude on a large scale to train Qwen. Via Bloomberg Anthropic is accusing Alibaba-linked operators of running a massive campaign to illicitly access Claude through nearly 25,000 fraudulent accounts. According to Bloomberg, Anthropic claims the campaign generated 28.8 million Claude exchanges between April and June, targeting capabilities like software engineering and agentic reasoning. The company says this is part of a broader pattern of “adversarial distillation,” where Chinese labs allegedly harvest outputs from US frontier models to train rival systems at a fraction of the cost. Lets see how good Qwen 3.8 will be, probably FABLEous good.$

240

173

448

604K

239

Chau Tran

@mr_cheu

9 days ago

The new prompting meta of Jun 2026: "We got this bug report [..]. Send a test request to dev server to repro. Retrieve production logs to diagnose. Read our code base and implement a fix. Deploy your fix to dev server. Send a test request again and iterate until fixed”

160

Chau Tran

@mr_cheu

9 days ago

With all due respect, Slack bot is not a newly invented concept

127

mr_cheu retweeted

Tony Gentilcore

@tonygentilcore

9 days ago

https://t.co/4z4aAS9VKl

14K

Chau Tran

@mr_cheu

11 days ago

@tuhinone Very nice speed. Is it true that context window is just 131k tokens though?

246

Chau Tran

@mr_cheu

13 days ago

It is so refreshing to see full unsummarized thinking trace again. Makes it much easier to debug context/prompting issues #GLM52

111

Chau Tran

@mr_cheu

14 days ago

GLM-5.2 is apparently less censored than some other Chinese models. It doesn't reject questions about Tiananmen Square 1989

112

Chau Tran

@mr_cheu

22 days ago

If an LLM can recursively self-improve inside a frontier lab, can it do the same outside one? If so, releasing an RSI-capable model would effectively destroy the lab’s own moat

Chau Tran

@mr_cheu

28 days ago

@suchenzang sadly there's no verifiable reward for writing quality. even high quality human preferences feedback is hard to get?

665

Chau Tran

@mr_cheu

about 2 months ago

Are we just going to accept that Claude Code now just sometimes fail to do basic things like making a file edit? Started happening with Opus 4.7

138

Chau Tran

@mr_cheu

2 months ago

@hwchase17 @nikunj Why can't LLM be the adapter between any arbitrary memory format? Tell System A: Give me everything you know about me in a markdown file Tell System B: [upload the markdown file] This markdown file contains everything System A knows about me, import it

Chau Tran

@mr_cheu

4 months ago

What's the best opensource code mode tool execution that's lightweight, support both MCP and non-MCP tools? Don't want dependencies on any agents/harness sdk