Henry Wang

Verified account

@Henry_Flowgpt

Co-founder FlowGPT & Emochi & Kaon Lab| ex-AMZN building an AI-native content platform with AI-native company

San Francisco

Joined October 2021

5.4K Following

4.2K Followers

380 Posts

Pinned Tweet

5 months ago

We (Emochi, with 10M+ users) stopped using benchmark scores for production decisions about 6 months ago. The pattern was consistent: models ranking top on benchmarks underperformed in real usage. Models ranked lower drove better retention. A few notes on what we learned:

5

7

0

0

787

3 days ago

Some recent thoughts on company efficiency https://t.co/o6OoJhdPih

0

0

0

0

36

about 2 months ago

Two things worth putting time developing agent in production 1. Providing accurate context 2. Writing good test cases

0

0

0

0

73

about 2 months ago

check this

about 2 months ago

Open-sourced Claude Code Web. For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI. Runs with local Claude Code CLI or on a remote server via SSH tunnel. Most importantly: you can change it into any skin you like.

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

2

1

1

2

7K

1

11

0

3

6K

Who to follow

SimWorld (Paul)

My name is Paul... I am eternally curious, especially about computers and AI, so here I am. I love composing Space Ambient music and also enjoy photography...

レティシアAI

@srZJZyW7JmfgjZN

BOOTH DLsiteでも販売中。

Lover of books, History, Scifi, Anime Realising my Dreams through AI ART

Henry_Flowgpt retweeted

about 2 months ago

Open-sourced Claude Code Web. For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI. Runs with local Claude Code CLI or on a remote server via SSH tunnel. Most importantly: you can change it into any skin you like.

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

2

1

1

2

7K

about 2 months ago

https://t.co/hLkrgpw3db

0

1

0

2

65

about 2 months ago

Open-sourced Claude Code Web. For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI. Runs with local Claude Code CLI or on a remote server via SSH tunnel. Most importantly: you can change it into any skin you like.

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

Henry_Flowgpt's tweet photo. Open-sourced Claude Code Web.

For people who can’t run Claude Code Client locally, or prefer a browser workspace over the TUI.

Runs with local Claude Code CLI or on a remote server via SSH tunnel.

Most importantly: you can change it into any skin you like. https://t.co/PTUPCoJLTL

2

1

1

2

7K

about 2 months ago

https://t.co/K0oRG5jzlc internal free site leaked?

1

10

2

5

384

3 months ago

https://t.co/Sh0wse2JTY

0

0

0

0

52

3 months ago

Small tool that I built to help me give better interview: input: JD + resume - output: questions for each round input: transcript + JD - output: hiring suggestion with detailed reasons (so far 80%+ aligned with my decision)

1

0

0

0

73

3 months ago

@omooretweets lol agree

0

0

0

0

27

4 months ago

@RaVercettipmut Will be soon

0

0

0

0

38

5 months ago

We (Emochi, with 10M+ users) stopped using benchmark scores for production decisions about 6 months ago. The pattern was consistent: models ranking top on benchmarks underperformed in real usage. Models ranked lower drove better retention. A few notes on what we learned:

5

7

0

0

787

5 months ago

One takeaway for 2026: Eval infrastructure will determine product ceiling more than model selection. The question isn't "which model is best." It's "which system learns fastest from real users." Wrote up the full framework: https://t.co/RQXkDotJUM

0

2

0

0

217

5 months ago

We rebuilt the pipeline on three components: 1. Elo/TrueSkill on real conversations — filters out bad models in hours 2. Conversation-level A/B — not user-level (key distinction below) 3. Reward models trained on behavioral signals — closes the loop into training

0

1

0

0

173

5 months ago

Core issue: benchmarks optimize for "correct responses" on discrete tasks. Consumer AI optimizes for "experiences that feel right" across continuous interactions. These objectives aren't aligned. At scale, they actively conflict.

0

1

0

0

157

5 months ago

@venturetwins My lesson learned from building Emochi is that the key is not only the model itself but the closed loop eval-feedback-iteration infrastructure. Happy to share more thoughts

0

1

0

0

61

Henry_Flowgpt retweeted

9 months ago

Interviews are broken. resumes mislead. We helped 100k+ people land jobs & scaled to $10M ARR. now we’re rebuilding hiring from scratch. meet WorkTrial AI — where companies see the real work before they hire.

198

521

172

170

654K

11 months ago

If your product relies on user-facing prompt features, this post is for you. I’m sharing hard-won lessons from 10,000+ prompt iterations across complex, structure-sensitive workflows where every percentage point of success rate mattered. https://t.co/EiiDdLhHLE

10

45

9

16

39K

Henry_Flowgpt retweeted

11 months ago

Sharing my experience to help people write better production-ready prompts - check out "The First Production-Ready Prompts Guide" https://t.co/EiiDdLhHLE via @LinkedIn

0

1

1

0

323

11 months ago

Sharing my experience to help people write better production-ready prompts - check out "The First Production-Ready Prompts Guide" https://t.co/EiiDdLhHLE via @LinkedIn

0

1

1

0

323

Last Seen Users on Sotwe

Trends for you

Most Popular Users