Dr. Muddy Bhatt

@DrMuddy

MD PhD Alum @UCL. Computational Neuroscientist. Trying to build and scale human-centric AI. CEO @ Digital Locke.

London

Joined December 2009

839 Following

283 Followers

242 Posts

Pinned Tweet

Dr. Muddy Bhatt @DrMuddy

almost 3 years ago

My thoughts on how we should fix DAOs so they actually work in practice https://t.co/DOk7Am6vcc

454

Dr. Muddy Bhatt @DrMuddy

3 months ago

@ellenewilkinson hi I think you lost your wallet in Liverpool outside the malmaison hotel - have handed it into the desk there for you to pick up

DrMuddy retweeted

Yulu Gan

@yule_gan

8 months ago

Reinforcement Learning (RL) has long been the dominant method for fine-tuning, powering many state-of-the-art LLMs. Methods like PPO and GRPO explore in action space. But can we instead explore directly in parameter space? YES we can. We propose a scalable framework for full-parameter fine-tuning using Evolution Strategies (ES). By skipping gradients and optimizing directly in parameter space, ES achieves more accurate, efficient, and stable fine-tuning. Paper: https://t.co/Es44ZqfcJ6 Code: https://t.co/eduztHwrLS

383

415K

DrMuddy retweeted

Jackson Atkins

@JacksonAtkinsX

8 months ago

My brain broke when I read this paper. A tiny 7 Million parameter model just beat DeepSeek-R1, Gemini 2.5 pro, and o3-mini at reasoning on both ARG-AGI 1 and ARC-AGI 2. It's called Tiny Recursive Model (TRM) from Samsung. How can a model 10,000x smaller be smarter? Here's how it works: 1. Draft an Initial Answer: Unlike an LLM that writes word-by-word, TRM first generates a quick, complete "draft" of the solution. Think of this as its first rough guess. 2. Create a "Scratchpad": It then creates a separate space for its internal thoughts, a latent reasoning "scratchpad." This is where the real magic happens. 3. Intensely Self-Critique: The model enters an intense inner loop. It compares its draft answer to the original problem and refines its reasoning on the scratchpad over and over (6 times in a row), asking itself, "Does my logic hold up? Where are the errors?" 4. Revise the Answer: After this focused "thinking," it uses the improved logic from its scratchpad to create a brand new, much better draft of the final answer. 5. Repeat until Confident: The entire process, draft, think, revise, is repeated up to 16 times. Each cycle pushes the model closer to a correct, logically sound solution. Why this matters: Business Leaders: This is what algorithmic advantage looks like. While competitors are paying massive inference costs for brute-force scale, a smarter, more efficient model can deliver superior performance for a tiny fraction of the cost. Researchers: This is a major validation for neuro-symbolic ideas. The model's ability to recursively "think" before "acting" demonstrates that architecture, not just scale, can be a primary driver of reasoning ability. Practitioners: SOTA reasoning is no longer gated behind billion-dollar GPU clusters. This paper provides a highly efficient, parameter-light blueprint for building specialized reasoners that can run anywhere. This isn't just scaling down; it's a completely different, more deliberate way of solving problems.

$JacksonAtkinsX's tweet photo. My brain broke when I read this paper. A tiny 7 Million parameter model just beat DeepSeek-R1, Gemini 2.5 pro, and o3-mini at reasoning on both ARG-AGI 1 and ARC-AGI 2. It's called Tiny Recursive Model (TRM) from Samsung. How can a model 10,000x smaller be smarter? Here's how it works: 1. Draft an Initial Answer: Unlike an LLM that writes word-by-word, TRM first generates a quick, complete "draft" of the solution. Think of this as its first rough guess. 2. Create a "Scratchpad": It then creates a separate space for its internal thoughts, a latent reasoning "scratchpad." This is where the real magic happens. 3. Intensely Self-Critique: The model enters an intense inner loop. It compares its draft answer to the original problem and refines its reasoning on the scratchpad over and over (6 times in a row), asking itself, "Does my logic hold up? Where are the errors?" 4. Revise the Answer: After this focused "thinking," it uses the improved logic from its scratchpad to create a brand new, much better draft of the final answer. 5. Repeat until Confident: The entire process, draft, think, revise, is repeated up to 16 times. Each cycle pushes the model closer to a correct, logically sound solution. Why this matters: Business Leaders: This is what algorithmic advantage looks like. While competitors are paying massive inference costs for brute-force scale, a smarter, more efficient model can deliver superior performance for a tiny fraction of the cost. Researchers: This is a major validation for neuro-symbolic ideas. The model's ability to recursively "think" before "acting" demonstrates that architecture, not just scale, can be a primary driver of reasoning ability. Practitioners: SOTA reasoning is no longer gated behind billion-dollar GPU clusters. This paper provides a highly efficient, parameter-light blueprint for building specialized reasoners that can run anywhere. This isn't just scaling down; it's a completely different, more deliberate way of solving problems.$

341

12K

11K

Who to follow

IWAI

@iwai_ws

International Conference on Active Inference (IWAI)

Michelle Fay Cortez

@FayCortez

Senior editor for Global Business at Bloomberg News, now coming to you from DC! Tips & tricks: [email protected].

American_Stroke

@American_Stroke

The official account of the American Stroke Association, a division of @American_Heart. We're talking about how strokes are preventable, treatable and beatable!

Dr. Muddy Bhatt @DrMuddy

9 months ago

My 17 year old nephew saved my business 1000s of hours a year by showing me the tools he’s using to lazy/cheat his way through school (like @gammaapp). Meanwhile friends complain how Bain are charging $Ms to learn AI on their dime with ‘pilots’ that never ship. Go figure.

Dr. Muddy Bhatt @DrMuddy

about 2 years ago

@feelmuddy 👀

525

Dr. Muddy Bhatt @DrMuddy

about 2 years ago

@KatPaton13 Lots of people telling you don’t be too hasty etc… this was a non-issue for me because I KNEW I would never go back. If you’re in the same state of mind and you know what’s next - no need to delay. If not then agree finish up and then test waters as you train in parallel.

296

Dr. Muddy Bhatt @DrMuddy

over 2 years ago

@newscientist This is the definition of think smart for home automation. Absolutely brilliant 😄

DrMuddy retweeted

Will Manidis

@WillManidis

over 2 years ago

there are 18 year olds on r/localllama that are two years ahead of the field in understanding finetuning/deploying/scaling llms

617

262K

Dr. Muddy Bhatt @DrMuddy

over 2 years ago

@abhi_agarwal4 @Accel @fabrichq_ai Anyway you seem like a nice person and a strong founder. I’m sure you will raise and build a great start-up. Was just sharing my two cents on approach. Put me in the ‘I’ll prove him wrong’ bucket. I genuinely wish you the best of luck - we need strong founders like you building.

Dr. Muddy Bhatt @DrMuddy

over 2 years ago

@abhi_agarwal4 @Accel @fabrichq_ai As someone who might be DDing you for a fund I now know you’ve been rejected for a prestigious investment. That on the surface may mean nothing but it’s a data point I have that doesn’t exactly go in your favour…

Dr. Muddy Bhatt @DrMuddy

over 2 years ago

@abhi_agarwal4 @Accel @fabrichq_ai You shared the fact that you specifically didn’t get funded by them. Each to your own but if I was DDing you I wouldn’t take the risk. I’m all for building in public but this achieved very little other than antagonising those who rejected you

Dr. Muddy Bhatt @DrMuddy

over 2 years ago

@abhi_agarwal4 @Accel @fabrichq_ai Publicly sharing private VC communications is a bafflingly bad decision. Understand having conviction but this is grandstanding that doesn’t help you build better product or build trust from potential other investors who have seen you share private comms without permission.

209

Dr. Muddy Bhatt @DrMuddy

almost 3 years ago

@dannypostma @Cub3Loyalty

Dr. Muddy Bhatt @DrMuddy

almost 3 years ago

@kylegordonart @_buildspace @FarzaTV @isabella_orsi Freaking love it KG!

Dr. Muddy Bhatt @DrMuddy

almost 3 years ago

I wrote an article about DAOs, why they're broken and how to fix them, (DAOs-2.0) but its a 17 min read 😅. Do people read stuff that long on Medium? Whats the best place for me to publish? I thought about releasing an Arxiv paper but I dont think thats where this belongs...

261

Dr. Muddy Bhatt @DrMuddy

almost 3 years ago

My thoughts on how founders and investors should be thinking about how they can make impact in a post LLM world. In short: Focus on platform functionality and declarative UX. https://t.co/BOl86iS9mt

227

DrMuddy retweeted

CUB3 Loyalty @Cub3Loyalty

almost 3 years ago

Happy Friday Everyone. #TrackOfTheWeek is now live for its third week. Head to our Cub3 community and have your say as to who should win. https://t.co/zy9hHTeCFS #ProofOfBehavior