1/ We’ve made the difficult decision to wind down https://t.co/7otukjfVlW. The website will be up for another 15 days during which time users can download their chat data. New users won’t be able to sign up and existing users won’t be able to create new conversations after today. Yupp is a loved product by many and we are sorry to the community for this outcome.
A few days later, @yupp_ai now has 10K+ votes on @claudeai Haiku 4.5 - and our original observation holds: Sonnet 4 is still preferred over Haiku 4.5 across a diverse range of use cases, reaffirming the importance of real-world user evaluations.
https://t.co/BhE09WeKHD
What do you guys think of this photo? 📷📸
Is it an AI product... 🖲️🖲️
No, This is my arm — and this is the moment I wish Yupp would come to Vietnam ✈️✈️✈️✈️
Even though it's just a finger pointing on a map, it's an affirmation: "Yupp is here, ready to make a difference."💪
@yupp_ai@lorepunk
@entropyeq_
@nikolaynft@MachineGunWilbu
The best AIs in one place. @yupp_ai
Need an image, explanation, code?
You don’t need to jump through tabs.
You don’t need to pay.
You don’t need to search for the best.
Choose any AI model or let Yupp do it itself.
Just Yupp it.
It’s been 36 hours since Grok 4 launched and we have an early verdict based on 6K+ preferences of @yupp_ai users globally on real use cases.
‼️ Grok 4 is worse than other leading models: OpenAI o3, Claude Opus 4, and Gemini 2.5 Pro. Grok 4 is liked even less than Grok 3. 🧵
It’s been ~4 weeks since we launched @yupp_ai – a consumer-first approach to robust & trustworthy AI evaluation. We’re still early but have already gathered 2M+ high-quality human preference feedback datapoints on 500+ models across diverse use cases. 🧵
https://t.co/jmJK4lKJcl
📢 New Model Drop: Gemma 3n!
Introducing a cool new model: Gemma 3n, available for everyone on Yupp!
Created by @GoogleDeepMind alongside leading mobile hardware makers, Gemma 3n is great for speed and quality, even on your local device.
Give it a try: https://t.co/3UMS6v5g95
Today we release DeepSeek-TNG R1T2 Chimera.
This new Chimera is a Tri-Mind Assembly-of-Experts model with three parents, namely R1-0528, R1 and V3-0324.
R1T2 operates at a sweet spot in intelligence vs. output token length. It appears to be...
* about 20% faster than R1, and more than twice as fast as R1-0528
* significantly more intelligent than R1 in benchmarks such as GPQA Diamond and AIME-24/25, albeit not quite on R1-0528 level
* much more intelligent than our first R1T Chimera, and also think-token consistent, which is a major improvement
We perceive it as generally well-behaved and a nice persona to talk to. The weights are on @huggingface under the MIT licence. We are looking forward to your experiments and feedback!
Thanks to @deepseek_ai for giving their models to the world, to @chutes_ai and @openrouter for hosting R1T, to @WolframRvnwlf for benchmarking it, to @xlr8harder for beta-testing the new Chimera, and to @natolambert for constructive discussions at @aiDotEngineer.