@NousResearch@Teknium Hi, what is the recommmended Max context length for hermes ? 64k 100k 200k ? Si tool calls and chat and compaction and everything works as ideally expected
@ComfyUI Hi, How did you set it up to run in a rtx5090 . Im trying but I’m getting out of memory . I have to either disable the 4 steps Lora’s or offload the clip to the cpu
@thdxr Wondering what is the best coding model to run offline with opencode on a 18gb ram Mac book pro m3 pro ? Had a long fly and try a bunch of 7b parameters with lmstudio and ollama but none use tools right, kind of imposible to use 😅 I want to be better prepared for next flight