Damien Hou

@houyr

code monkey

Cupertino, CA

Joined June 2007

612 Following

460 Followers

3.9K Posts

houyr retweeted

Peter Steinberger 🦞

@steipete

4 months ago

@Yuchenj_UW I don’t let Claude Code on my codebase. It’s all codex. Would be too buggy with Opus.

253

173

861

houyr retweeted

Ian Tracey

@ian_dot_so

5 months ago

https://t.co/6aIhogx6xt

286

houyr retweeted

clem 🤗

@ClementDelangue

8 months ago

First they ignore you, then they laugh at you, then they fight you, then you win!

750

160

166K

houyr retweeted

Garry Tan

@garrytan

9 months ago

Chesa Boudin’s parents were domestic terrorists: The Weather Underground They got tenured professorships after the violence Chesa is now a UC Berkeley Law Prof now after making SF unsafe for Asian elders as failed DA The corruption of the academy has been a 50+ year norm

108

945

474

549K

Who to follow

Indie Hacker. AI Enthusiast. Maker of 🤖GEOfast Maker of 🦞📱https://t.co/4X6Aq2eIPF (OpenClaw Mobile Dashboard) Build in Public🔮AI实战派🔧产品出海🤑 OpenClaw疯狂养虾中🦞

Quiz

@Quiz

#Prompt_engineer #ZKP_master #web3_guru #ETH_miner #crypto_evangelist #optimistic_hacker #serial_entrepreneur #comic_fan #honeybadger_fanatic

houyr retweeted

Eylon Levy

@EylonALevy

9 months ago

When people stop talking, bad things happen. Rest in peace, @charliekirk11

384

91K

13K

14K

Damien Hou @houyr

9 months ago

Hot take - the best coding agent for your side quest is the free one. Never burn your own $$$ to fund VC FOMO.

Damien Hou @houyr

over 1 year ago

the final flex. before R2 in a few weeks.

DeepSeek

@deepseek_ai

over 1 year ago

🚀 Day 6 of #OpenSourceWeek: One More Thing – DeepSeek-V3/R1 Inference System Overview Optimized throughput and latency via: 🔧 Cross-node EP-powered batch scaling 🔄 Computation-communication overlap ⚖️ Load balancing Statistics of DeepSeek's Online Service: ⚡ 73.7k/14.8k input/output tokens per second per H800 node 🚀 Cost profit margin 545% 💡 We hope this week's insights offer value to the community and contribute to our shared AGI goals. 📖 Deep Dive: https://t.co/x1rt3mCxF5

776

houyr retweeted

Thomas Wolf

@Thom_Wolf

over 1 year ago

Finally took time to go over Dario's essay on DeepSeek and export control and to be honest it was quite painful to read. And I say this as a great admirer of Anthropic and big user of Claude* The first half of the essay reads like a lengthy attempt to justify that closed-source models are still significantly ahead of DeepSeek. However, it mostly refers to internal unpublished evals which limit the credit you can give it, and statements like « DeepSeek-V3 is close to SOTA models and stronger on some very narrow tasks » transforming in a general conclusion « DeepSeek-V3 is actually worse than those US frontier models — let’s say by ~2x on the scaling curve » left me generally doubtful. The same applies to the takeaway that all discoveries and efficiency improvements of DeepSeek have been discovered long ago by closed-models companies, this statement mostly resulting from a comparison of DeepSeek openly published $6M training numbers with some vague « few $10M » on Anthropic side without providing much more details. I have no doubts the Anthropic team is extremely talented and I’ve regularly shared how impressed I am with Sonnet 3.5 but this longwinded comparison of open research with vague closed research and undisclosed evals has left me less convinced of their lead than I was before I reading it. Even more frustrating was the second half of the essay which dive into the US-China race scenario and totally misses the point that the DeepSeek model is open-weights, and largely open-knowledge due to its detailed tech report (and feel free to follow Hugging Face’s open-r1 reproduction project for the remaining non-public part: the synthetic dataset). If both DeepSeek and Anthropic models had been closed source, yes the arm-race interpretation could have make sense but having one of the model freely widely available for download and with detailed scientific report renders the whole « close-source arm-race competition » argument artificial and unconvincing in my opinion. Here is the thing: open-source knows no border. Both in its usage and its creation. Every company in the world, be it in Europe, Africa, South-America or the USA can now directly download and use DeepSeek without sending data to a specific country (China for instance) or depending on a specific company or server for running the core part of its technology. And just like most open-source library in the world are typically built by contributors from all over the world, we’ve already seen several hundred derivative models on the Hugging Face hub created everywhere in the world by teams adapting the original model to their specific use cases and explorations. What's more, with the open-r1 reproduction and the DeepSeek paper, the coming months will clearly see many open-source reasoning models being released by teams from all over the world. Just today, two other teams, AllenAI in Seattle and Mistral in Paris both independently released open-source base models (Tülu and Small3) which are already challenging the new state-of-the-art (with AllenAI indicating that its Tülu model surpasses the performance of DeepSeek-V3). And the scope is even much broader than this geographical aspect. Here is the thing we don’t talk nearly enough about: open-source will be more and more essential for our… safety! As AI becomes central to our lives, resiliency will increasingly become a very important element of this technology. Today we’re dependent on internet access for almost everything. Without access to the internet, we lose all our social media/news feeds, can’t order a taxi, book a restaurant, or reach someone on WhatsApp. Now imagine an alternate world to ours where all the data transiting through the internet would have to go through a single company’s data centers. The day this company suffers a single outage, the whole world would basically stop spinning (picture the recent CrowdStrike outage magnified a millionfold). Soon, as AI assistants and AI technology permeate our whole life to simplify many of our online and offline tasks, we (and companies using AI) will start to depend more on more on this technology for our daily activities and we will similarly start to find annoying or even painful any downtime in these AI assistants from outages. The most optimal way to avoid future downtime situations will be to build resilience deep in our technological chain. Open-source has many advantages like shared training costs, tunability, control, ownership, privacy but one of its most fundamental virtue in the long term –as AI becomes deeply embedded in our world– will likely be its strong resilience. It is one of the most straightforward and cost-effective ways to easily distribute compute across many independent providers and to even run models locally and on device with minimal complexity. More than national prides and competitions, I think it’s time to start thinking globally about the challenges and social changes that AI will bring everywhere in the world. And open-source technology is likely our most important asset for safely transitioning to a resilient digital future where AI is integrated into all aspects of society. *Claude is my default LLM for complex coding. I also love its character with hesitations and pondering, like a prelude to the chain-of-thoughts of more recent reasoning models like DeepSeek generations.

109

480

394K

Damien Hou @houyr

over 1 year ago

@RnaudBertrand I’m stunned by all this lack of critical thinking, intellectual honesty and the absurd level of groupthink from our most respected leaders in this space.

Damien Hou @houyr

over 1 year ago

Meanwhile in China

Unitree

@UnitreeRobotics

over 1 year ago

Unitree H1: Humanoid Robot Makes Its Debut at the Spring Festival Gala 🥰 Hello everyone, let me introduce myself again. I am Unitree H1 "Fuxi". I am now a comedian at the Spring Festival Gala, hoping to bring joy to everyone. Let’s push boundaries every day and shape the future together. #Unitree #HumanoidRobot #AI #Dance #SpringFestivalGalaRobot #SpringFestivalGala

342

973

743

houyr retweeted

Andrej Karpathy

@karpathy

over 1 year ago

"Move 37" is the word-of-day - it's when an AI, trained via the trial-and-error process of reinforcement learning, discovers actions that are new, surprising, and secretly brilliant even to expert humans. It is a magical, just slightly unnerving, emergent phenomenon only achievable by large-scale reinforcement learning. You can't get there by expert imitation. It's when AlphaGo played move 37 in Game 2 against Lee Sedol, a weird move that was estimated to only have 1 in 10,000 chance to be played by a human, but one that was creative and brilliant in retrospect, leading to a win in that game. We've seen Move 37 in a closed, game-like environment like Go, but with the latest crop of "thinking" LLM models (e.g. OpenAI-o1, DeepSeek-R1, Gemini 2.0 Flash Thinking), we are seeing the first very early glimmers of things like it in open world domains. The models discover, in the process of trying to solve many diverse math/code/etc. problems, strategies that resemble the internal monologue of humans, which are very hard (/impossible) to directly program into the models. I call these "cognitive strategies" - things like approaching a problem from different angles, trying out different ideas, finding analogies, backtracking, re-examining, etc. Weird as it sounds, it's plausible that LLMs can discover better ways of thinking, of solving problems, of connecting ideas across disciplines, and do so in a way we will find surprising, puzzling, but creative and brilliant in retrospect. It could get plenty weirder too - it's plausible (even likely, if it's done well) that the optimization invents its own language that is inscrutable to us, but that is more efficient or effective at problem solving. The weirdness of reinforcement learning is in principle unbounded. I don't think we've seen equivalents of Move 37 yet. I don't know what it will look like. I think we're still quite early and that there is a lot of work ahead, both engineering and research. But the technology feels on track to find them. https://t.co/JCxTdKpuzv

432

houyr retweeted

Andrej Karpathy

@karpathy

over 1 year ago

I don't have too too much to add on top of this earlier post on V3 and I think it applies to R1 too (which is the more recent, thinking equivalent). I will say that Deep Learning has a legendary ravenous appetite for compute, like no other algorithm that has ever been developed in AI. You may not always be utilizing it fully but I would never bet against compute as the upper bound for achievable intelligence in the long run. Not just for an individual final training run, but also for the entire innovation / experimentation engine that silently underlies all the algorithmic innovations. Data has historically been seen as a separate category from compute, but even data is downstream of compute to a large extent - you can spend compute to create data. Tons of it. You've heard this called synthetic data generation, but less obviously, there is a very deep connection (equivalence even) between "synthetic data generation" and "reinforcement learning". In the trial-and-error learning process in RL, the "trial" is model generating (synthetic) data, which it then learns from based on the "error" (/reward). Conversely, when you generate synthetic data and then rank or filter it in any way, your filter is straight up equivalent to a 0-1 advantage function - congrats you're doing crappy RL. Last thought. Not sure if this is obvious. There are two major types of learning, in both children and in deep learning. There is 1) imitation learning (watch and repeat, i.e. pretraining, supervised finetuning), and 2) trial-and-error learning (reinforcement learning). My favorite simple example is AlphaGo - 1) is learning by imitating expert players, 2) is reinforcement learning to win the game. Almost every single shocking result of deep learning, and the source of all *magic* is always 2. 2 is significantly significantly more powerful. 2 is what surprises you. 2 is when the paddle learns to hit the ball behind the blocks in Breakout. 2 is when AlphaGo beats even Lee Sedol. And 2 is the "aha moment" when the DeepSeek (or o1 etc.) discovers that it works well to re-evaluate your assumptions, backtrack, try something else, etc. It's the solving strategies you see this model use in its chain of thought. It's how it goes back and forth thinking to itself. These thoughts are *emergent* (!!!) and this is actually seriously incredible, impressive and new (as in publicly available and documented etc.). The model could never learn this with 1 (by imitation), because the cognition of the model and the cognition of the human labeler is different. The human would never know to correctly annotate these kinds of solving strategies and what they should even look like. They have to be discovered during reinforcement learning as empirically and statistically useful towards a final outcome. (Last last thought/reference this time for real is that RL is powerful but RLHF is not. RLHF is not RL. I have a separate rant on that in an earlier tweet https://t.co/RMIpFPVpuM)

360

14K

houyr retweeted

Amjad Masad

@amasad

over 1 year ago

This tweet is talking to Wallstreet

126

366

200K

Damien Hou @houyr

over 1 year ago

Basically X in the past week

signüll

@signulll

over 1 year ago

🤣

signulll's tweet photo. 🤣 https://t.co/7CclzdhZg4

187

280

162K

180

houyr retweeted

kache

@yacineMTB

over 1 year ago

Americans are loading the model in pytorch, after spending 5 hours with python versioning while wenfeng et Al are doing this with hand written cuda code

yacineMTB's tweet photo. Americans are loading the model in pytorch, after spending 5 hours with python versioning

while wenfeng et Al are doing this with hand written cuda code https://t.co/2iaxL385xV

183

299K

houyr retweeted

Nathan Lambert

@natolambert

over 1 year ago

DeepSeek app sitting at number 1 overall in the US Iphone App Store is not on my bingo card and is the biggest sign yet that the ChatGPT moat can maybe be cracked.

natolambert's tweet photo. DeepSeek app sitting at number 1 overall in the US Iphone App Store is not on my bingo card and is the biggest sign yet that the ChatGPT moat can maybe be cracked. https://t.co/ffoqpk8Tlh

935

119

158K

houyr retweeted

clem 🤗

@ClementDelangue

over 1 year ago

Our science team has started working on fully reproducing and open-sourcing R1 including training data, training scripts,... Full power of open source AI so that everyone all over the world can take advantage of AI progress! Will help debunk some myths I’m sure too. Thanks @deepseek_ai!

ClementDelangue's tweet photo. Our science team has started working on fully reproducing and open-sourcing R1 including training data, training scripts,...

Full power of open source AI so that everyone all over the world can take advantage of AI progress! Will help debunk some myths I’m sure too.

Thanks @deepseek_ai!

118

507

347K

houyr retweeted

Nicholas Wilt @CUDAHandbook

over 1 year ago

CUDA’s software stack has a few distinct pillars that are triumphs of software engineering (let alone software architecture). The driver API was built in C, portable across both operating systems and CPU architectures. Across the 6 years I worked on CUDA, no one questioned 1/x

118

414K

Damien Hou @houyr

over 1 year ago

@elonmusk How to download the bill? feed it to notebooklm and make a podcast to roast it.

houyr retweeted

Garry Tan

@garrytan

over 1 year ago

This is California healing If the Dems want to fix themselves it’s going to be through actually supporting things like this

236

140

420K

Damien Hou

@houyr

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users