@gospaceport IMO it's still Tiering. I've found bunch of situations where I need more. It's probably 80/20. Especially when it comes to planning or the "other" side of dev - refactor, optimization etc. Some very heavy stuff kimi won't do and only sota can (<1%). Q2.6 -> K2.6 -> GPT5.5.
@snwy_me Next weβll see method unlocked my seat heaters in my BMW. I hadnβt actually contemplated that metho is gonna find a lot of vulnerabilities in existing hardware.. Very interesting post thank you!
@RoboIntellect@NVIDIAAI@openclaw@NousResearch@LangChain@huggingface 550B at nvfp4 is ~225gig of weights. MOE means if you have fast enough system ram you can swap in and out of vram but it would hurt performance. 4x rtx6000 pro would run it (384gb) without swapping ~ $50k USD.
@david_nix@gospaceport@loktar00 I got mine in the 8xxx I now want another. I hope your Time Machine has room for two. If Iβd known about pipeline parallelism then I prob would have bought two.
@wagslane The old SEO approach with click through ads will not work though. I can see a model where you pay $1-3 a month for read only bot access, and maybe $50 or per api call for write (limit bot slop) in the big platforms. Companies who don't adopt this will be at risk of disruption.
@wagslane Yes - I can totally see there being tension between what people want (via bots) and established companies. This will spurr competitors that are bot friendly. I would argue you can "sell" to bots but it's more when the person gives a "research" request to the bot. ...
@theo Feels like a need for a new OSS compeditor. I know mattermost is reasonable. Esp with agentic engineering - something user friendly would be great. Teams only benefit is if you have a teams meeting and the chat stays in the teams app, vs zoom where it disapears.