ULTRACODE-SHIM IS NOW LIVE π₯
You can now run ANY model in UltraCode
I built a github repo to make this really easy for you, Just send your agent there and let him COOK
You deserve the flexibility to use LOCAL models & cost efficient models. So I made that happen for you π«Ά
@TheAhmadOsman i have a single 5090 and a 4090 & I feel like it works really well for me right now. I'd really like to get more optimization support for them tho, they have alot more power then i can squeeze out
Here's some benchmarks to see how the new Gemma 4 12b stacks up. Ran on my Linux Pc with a RTX 4090
Great at coding, outstanding actually
Stuggles with toolcalls
Solid speed
Not the best with following instructions
(the 31b model in the comparison testis gemma opus finetune)
Meet Gemma 4 12B!
A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.
Bridging the gap between edge efficiency and advanced reasoning. Here is whatβs new with Gemma 4 12B: π
@jackowhf mimo but like its so big can u count that since we cant run it local haha, ur best bet for re is get the $6 mimo plan and the $20 minimax plan and have mimi v2.5pro orchistrate and use a ton of minimax m3 subagents. ud get a ton of usage for that and it would serve u well
Here's some benchmarks to see how the new Gemma 4 12b stacks up. Ran on my Linux Pc with a RTX 4090
Great at coding, outstanding actually
Stuggles with toolcalls
Solid speed
Not the best with following instructions
(the 31b model in the comparison testis gemma opus finetune)
Caduceus now supports auto local routing
What this means, caduceus understands what local models you use & will automatically choose the best local models for the job
Models will easy unload & swap in vram when needed on demand
Try it today <3
Just hit 50k followers! π₯Ή
Wanted to say, I've been here for awhile & I couldn't be more grateful to be apart of this community with you all
I never would have imagined to have so many people in my life, I'm honored and I dont take it for granted at all
Thank you so much π«Ά
@TheAhmadOsman Its been a nightmare but ive been fixing every issue i have with them 1 at a time and posting on github for free to help others https://t.co/dATjP6Z0uI
It took me a lot of time, billions of tokens, hairloss, stress, loss of sleep
BUT I DID IT, I FIXED RUNNING LLMS ON WINDOWS!
If you've been using Windows and WSL and it's been giving you tons of headaches, you can finally relax
I made a guide to make it as stable as Linux π«‘
@the_dt1@NousResearch It uses them. The todo list is my personal choice for most tasks as i find it gets the job fully finished the proper way instead of wasting my time. The workflow is a honest implementation of ultracode dynamic workflows. It is fully open source. Fully customizable. Make it yours!
I built Caduceus for Hermes Desktop @NousResearch
Two snakes, one staff
Caduceus runs parallel AI agents in perfect sync, on any model you already have
It sizes up every task and picks the path: β’ Easy β answered instantly β’ Hard β a live to-do plan + agents fanned out in parallel, verified before it calls it done
Caduceus has a UI right inside Hermes so you watch every agent build your project in real time
Full demo π
@TiQTiQB00M@NousResearch awesome, so since its dynamic it will actually build a workflow for the task or you can give it a workflow and it will use this system to ececute it. I'm working on a bunch of things to speed it up without sacrificing quality, next few updates will make it much faster
Caduceus now supports auto local routing
What this means, caduceus understands what local models you use & will automatically choose the best local models for the job
Models will easy unload & swap in vram when needed on demand
Try it today <3
I built Caduceus for Hermes Desktop @NousResearch
Two snakes, one staff
Caduceus runs parallel AI agents in perfect sync, on any model you already have
It sizes up every task and picks the path: β’ Easy β answered instantly β’ Hard β a live to-do plan + agents fanned out in parallel, verified before it calls it done
Caduceus has a UI right inside Hermes so you watch every agent build your project in real time
Full demo π