Introducing Trinity Mini from @arcee_ai, an open-weight 26B sparse MoE model that activates just 3B parameters per token while delivering frontier-class reasoning.
AI natives can now use Trinity Mini on Together AI — and benefit from reliable inference for production-scale agentic workflows.
Today, we are releasing the best open-weight LLM by a US company: Cogito v2.1 671B.
On most industry benchmarks and our internal evals, the model performs competitively with frontier closed and open models, while being ahead of any US open model (such as the best versions of OpenAI’s GPT-OSS, Nvidia’s Nemotron and Meta’s Llama).
We also built an interface where you can try the model (it’s free and we don’t store any chats): https://t.co/lTrriCvgmd
Additionally, you can download the model on @huggingface, or try it out on @openrouter, @togethercompute, @FireworksAI_HQ , @ollama cloud, @runpod, @baseten, or run it locally using @ollama or @UnslothAI.
This model uses significantly fewer tokens amongst any similar capability models, because it has better reasoning capabilities. You will also notice improvements across instruction following, coding, longer queries, multi-turn and creativity.
📌 Model Weights: https://t.co/r43x8kgpTC
📌Openrouter: https://t.co/uR2cePHQyK
📌 HF Blog: https://t.co/ZZXVkhpkxo
Some notes on our approach + design choices below 👇
I’ve always loved stories,
the kind that make you feel something,
that linger long after the credits fade.
Plot Party is an agentic canvas built for visual storytelling,
the easiest way to turn imagination into cinema.
We are now in private beta.
@JoinPlotParty
The team at @SambaNovaAI are GETTING AFTER IT!
Some people believe in weekends, Blackbox AI and Sambanova don't!
We met over the past 2 weekends to test at scale and finalize our partnership.
Results are LEGIT!
more to come…
Day 1 of the #AIHWEdgeAISummit2024 has been a rousing success so far. 🙌
Our team had a wonderful time meeting the greatest minds in AI as we shared our newly launched SambaNova Cloud, the fastest API for developers! We can't wait for day 2.
Did you attend the event? Sound off on some of your personal highlights below👇 💭
#GenAI #AI
@ro_mattern@aton2006@v_mohan_@aiandsystems
🚀 World record performance: SambaNova is running Llama 3.1 405B at 114 t/s with full precision accuracy, in only one rack. Verified by @ArtificialAnlys! 🦙
This speed unlocks so many use cases for enterprises and developers that we cannot wait to see them built on our platform.
Apply for early access today: https://t.co/CSlbJbTFVj
We’re live at @agihouse_org Hackathon in Hillsborough! The SambaNova team is excited to see this community of hackers as they experience the speed and accuracy of SambaNova’s AI platform in action. 🚀
#Developers#API#FastAI
SambaNova’s @ro_mattern shares our super fast APIs to #developers today at the @agihouse_org Hackathon! At 1000 TPS on Llama3 8B, she demonstrates the capabilities of the lightning-fast Samba-1 Turbo.
#AI#Hackathon
Today’s #GoogleDoodle pays tribute to Dr. Martin Luther King Jr. on this U.S. national holiday named in his honor 🇺🇸
Dr. King's message of equality continues to inspire people to pursue justice and peace around the world.
→ https://t.co/nywbyhV9bo