Don Park

@donpark

Founder and Chief Wizzy Gearhead of WizOps (

Redwood City, CA

Joined March 2007

530 Following

1.4K Followers

26.9K Posts

Don Park @donpark

about 4 hours ago

Ended up aborting bc it kept stopping too often, like just after it's going to do this and that. Not sure if it's the model or the harness (pi agent). Asked it to prepare a handoff doc to continue with more capable model.

Don Park @donpark

about 14 hours ago

AFAICT, Gemma 4 QAT models were trained with MatFormer which quantize the forward pass to simulate inference with quantized weights and train around the resulting errors. MatFormer has been around for at least two years, since the rise of Matryoshka embeddings.

Don Park @donpark

about 12 hours ago

AI developer community as a whole has a pretty sense of what's capable and what's lacking with frontier models but not what the minimum is on these simple tasks requiring good reasoning and reliable tool-uses. I think we have a better picture now with Gemma 4 QATs.

Don Park @donpark

about 9 hours ago

it just learned: "Basically, I should treat GPT 5.4 as a tool for "Knowledge Retrieval and Code Correction." If I'm stumped or hit an error, I use that link to get the fix and then proceed autonomously." moving in the right direction.

Who to follow

synabreu

@synabreu

HPC&AI PreSales, Former Microsoft Sr. Technical Evangelist, Tesla Korea Club Founder

about 11 hours ago

just asked Gemma4 12B QAT model to tackle writing a long running RLM-based agent that delegates only complex tasks to GPT 5.4, using the task itself as the test case. And off it went. No idea how it'll overcome its 256K context size if at all.

donpark's tweet photo. just asked Gemma4 12B QAT model to tackle writing a long running RLM-based agent that delegates only complex tasks to GPT 5.4, using the task itself as the test case. And off it went. No idea how it'll overcome its 256K context size if at all. https://t.co/ZEZJ4OopS1

Don Park @donpark

about 10 hours ago

I'm not expecting it to succeed. Instead, I want to see how it behaves when met with challenges. It should be seeking advices from GPT 5.4 after some struggle. 🤞

Don Park @donpark

about 11 hours ago

@james_clark Same. I know when my code is AI slop and when it isn't. The difference is whether I use AI as a tool or a self-driving car. The former takes attention and craftsmanship.

Don Park @donpark

2 days ago

that was a good test drive of pi agent. found the harness powerful and got a clear understanding of where leading SLMs are. quite capable but easily confused. They're not only smart enough to conduct RLM loop but needs it badly to stay focused.

Don Park @donpark

3 days ago

Trying pi agent. Starts light then adds new capabilities as needed. Starting bare on local Qwen3.6, it was able to analyze an app's LocateAnything integration. Then when I asked for web browsing capability, pi noticed I had agent-browser installed and integrated that. Nice.

118

Don Park @donpark

2 days ago

ofc, using an LLM for that is quite wasteful so I should've asked it to create a syntax checker tool instead. And it did, although I had to slap it many times as it kept getting confused by the context. I'll use a project dedicated to extending pi agent next time.

Don Park @donpark

2 days ago

This made me want to build a harness to livestream SLM terminal sessions that lets people watch it tackle challenges and steer it when it gets into trouble. I think robotic challenges would be more exciting than coding challenges. True mob at work.

Don Park @donpark

3 days ago

I'm still not sure how far SLMs can go but the entertainment value of watching a model hosted entirely on my mac is certainly there.

Don Park

@donpark

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users