@_ueaj@AndrewCurran_ that fully makes sense given how limited the interface is to use llms - just writing text strips so much nuance and taste that knowing how to prompt it really matters a lot. seems like fable had that intuition of understanding built in from the other side as well
@DWestkrew@thesnufki@haider1 not necessarily, it can outperform mythos AND they don’t have to compare it cause it isn’t an accessible model. kinda shady but i can see an argument being made that it’s the best frontier model available
There's a much funnier thing ensembles unlock though (if it's consistent). It doesn't matter if it's inefficient, really. if you can get Mythos by throwing 3 or 4 other weaker models in a trenchcoat, you can distill from the ensemble directly. I wonder, how many Qwens + Kimi's + Deepseeks + GLMs do you need to throw in a trenchcoat to get Mythos quality data? Can you stack enough 9b's to reach heaven?
@max_paperclips@LokiJulianus@teortaxesTex the only issue is the model providers themselves limiting it lmao; ironically neither of the actual model developers can do it cause they can’t use their model in that way
@viemccoy i'd assume the main issue is cost, right? given how inefficient some of the current models are with token output it'd probably cost a multitude more than running the good model (ofc it's extraordinary times rn)