xiaowuc1 @xiaowuc1 - Twitter Profile

xiaowuc1 retweeted

4 days ago

We are expanding our team to scale up our vision of end-to-end oversight assistants! Do you: * Want to understand AI systems? * Like training large models? * Enjoy learning with teammates who are curious and earnest? Then apply to join @TransluceAI ! https://t.co/JxqhEkwD3p

2

163

22

126

36K

xiaowuc1 retweeted

Transluce

@TransluceAI

9 days ago

Why'd my agent fail? Was it reward hacking? These days, you'd just ask another AI to vibe-analyze the agent logs But how do you know the claims aren't hallucinated, cherrypicked, or plain wrong? That's why we've been building Analysis Plans: a framework for trustable analysis

TransluceAI's tweet photo. Why'd my agent fail? Was it reward hacking?

These days, you'd just ask another AI to vibe-analyze the agent logs

But how do you know the claims aren't hallucinated, cherrypicked, or plain wrong?

That's why we've been building Analysis Plans: a framework for trustable analysis https://t.co/b8zvYeKfUp

2

88

18

49

11K

xiaowuc1 retweeted

Vincent

@vvvincent_c

14 days ago

what does @TransluceAI know....

3

105

3

8

4K

xiaowuc1 retweeted

MidnightCodeCup

@midnightcodecup

4 months ago

Midnight Code Cup is a programming competition where coding agents are allowed and the problems are still challenging and fun. Teams of up to 3. Qual: April 11, Codeforces (4h). Finals: July 4–5, Belgrade (24h onsite). See you at Midnight!

2

23

8

4

5K

Who to follow

Neal Wu

@neal_wu

@thinkymachines, prev new stealth co, @cognition, @tryramp, @GoogleBrain, competitive programming

Shushan Arakelyan ✨ feel the AGI ✨

@sharakelyan

Researcher at Microsoft. Previously: PhD @CSatUSC, MPhil @Cambridge_Uni

阿橡

@oakvale5

Rest at the edge of chaos. .... . .-.. .--. -....- -- . God plays dice. Maybe that's a feature, not a bug. 一切都会好起来。

xiaowuc1 retweeted

Jacob Steinhardt @JacobSteinhardt

4 months ago

New blog post:"Building Technology to Drive AI Governance". I argue that many governance challenges are fundamentally bottlenecked by technical gaps, and consider case studies from other fields (food safety, climate change) that illustrate this dynamic.

JacobSteinhardt's tweet photo. New blog post:"Building Technology to Drive AI Governance". I argue that many governance challenges are fundamentally bottlenecked by technical gaps, and consider case studies from other fields (food safety, climate change) that illustrate this dynamic. https://t.co/cRgTVXfyPX

4

123

29

69

16K

xiaowuc1 retweeted

Transluce

@TransluceAI

4 months ago

Why does GPT-5.1 Codex score 6.5% worse than GPT-5 Codex on Terminal-Bench, with the same scaffold? 🧵 GPT-5.1 times out at ~2x the rate of GPT-5. Excluding timeouts, GPT-5.1 wins by 7.2%. We analyzed 256M+ tokens of traces and found this in under an hour. Here’s how 👇

TransluceAI's tweet photo. Why does GPT-5.1 Codex score 6.5% worse than GPT-5 Codex on Terminal-Bench, with the same scaffold? 🧵

GPT-5.1 times out at ~2x the rate of GPT-5. Excluding timeouts, GPT-5.1 wins by 7.2%. We analyzed 256M+ tokens of traces and found this in under an hour. Here’s how 👇

2

75

15

19

10K

xiaowuc1 retweeted

MidnightCodeCup

@midnightcodecup

5 months ago

Midnight Code Cup 2026 Qual - April, 11 Finals - July, 4-5 Save the dates!

0

31

13

4

7K

xiaowuc1 retweeted

Transluce

@TransluceAI

6 months ago

Transluce is developing end-to-end interpretability approaches that directly train models to make predictions about AI behavior. Today we introduce Predictive Concept Decoders (PCD), a new architecture that embodies this approach.

2

167

33

67

37K

xiaowuc1 retweeted

Transluce

@TransluceAI

6 months ago

Transluce is running our end-of-year fundraiser for 2025. This is our first public fundraiser since launching late last year.

TransluceAI's tweet photo. Transluce is running our end-of-year fundraiser for 2025. This is our first public fundraiser since launching late last year. https://t.co/obs6LetVSX

4

97

22

9

65K

xiaowuc1 retweeted

Jelani Nelson

@minilek

almost 2 years ago

Happening right now in Astana, Kazakhstan: @cognition_labs founding team member Andrew He (@ecnerwala) speaks to ICPC 2024 World Finalists about his career journey, and about their product Devin the AI software engineer.