@LegalMalvado@ahurtadobueno@Correos@notificados Creo que con minimax m2.7 o deepseek v4 flash llega de sobras, no he visto la web de correos pero mis clases de gym las reservo con m2.7 y sin problema.
@serre_ls@bluetouff i have 32gb unified memory but the model keeps failing to load, even with small context on lm studio. did it work with no issues for you?
Anyone else having issues with Hermes reconnecting after a gateway restart? Confirmed restart through Telegram and it stopped replying. Using Minimax 2.7 with no issues so far, si I'll check what happened when I can access the machine later today.
@VonNeumMeme I also have m4 base with 32gb. Qwen 27b is slow due to the memory bandwith, not the RAM. m4 pro is much faster for this model, but for me i also switched to qwen 35b a3b abd getting 40tok/s.
The prompt: "Use Canvas to create a realistic campfire scene. Flickering flames with natural movement, rising embers that float upward and fade out, glowing light that illuminates the surroundings, and logs at the base. Dark night background. Smooth animation, continuous loop. No external libraries."
Done setting up my Hermes Agent with a combination of Minimax 2.7 as principal and then my local Qwen as secondary.
Just setup HA with M2.7 and asked it to use the secondary model via LM Studio and to let me know when it will be used.
Done.
@sin_management@tedfarino@Teknium@zocomputer Ummm how does this work? Just ask hermes to navigate using zocomputer? Is it a specific skill. Looking forward to check some anti-bot sites with Hermes.
Surprised to see 47tok/sec directly in LM Studio but dropping a lot on Hermes Agent. Probably the context window is too much when using it? Will test with opencode to see if there is any improvement.
Today i started playing with local LLMs on my Mac Mini M4 32gb ram.
1st test on LM Studio: Qwen 3.5 27b q4 MLX: 7tok/s
2nd test: Qwen 3.5 9b MLX: 21 tok/s
3rd: Qwen 3.5 35B-A3B: 47 tok/s
Moving now to testing Hermes Agent!
I will be sharing learnings.
@cibernicola_es Si, me refiero a si usas algo en cloud para cosas importantes tipo sonnet 4.6 o opus y el resto en local o todo directamente en qwen. Estoy probando ambas cosas a ver si soy capaz de separar cosas que requieren un modelo más capaz de las que son mas faciles.