Scott Mueller

2 months ago

Sup AI just launched on @ProductHunt Sup combines multiple AI models and uses confidence scoring to give better answers with fewer hallucinations. #1 on Humanity's Last Exam: 52.15%. Beating every individual model. $10 starter credit to try it https://t.co/TG4CU6yRR6

1

0

197

2 months ago

Did @OpenAI just silently kill top-20 logprobs for all models except GPT-5.4-nano? Setting `top_logprobs` to anything > 1 returns an error for the past 4-5 hours. Might just be their servers having trouble, but their status page indicates "No incidents" for their responses API.

0

114

Professor of Applied Econometrics and Policy Evaluation at @ses_unifr @unifr - causal analysis, statistics, econometrics, machine learning...and telemarking

3 months ago

@supabase @supaihq @kiwicopple @AntWilson Update: Supabase support was able to recover the database! The site is back online and the data is safe. The Supabase dashboard is still completely bugged out, but the critical infrastructure is running again. Appreciate the support team getting this recovered.

0

1

0

36

Who to follow

Martin Huber

@CausalHuber

Elias Bareinboim

@eliasbareinboim

Professor of Causal Inference, Machine Learning, and Artificial Intelligence. Director, CausalAI Lab @ Columbia University.

Victor Veitch 🔸

@victorveitch

AI | University of Chicago / Google DeepMind

3 months ago

@supabase, @supaihq's production Supabase is down, and nobody at Supabase is responding. You guys did something strange, and the entire database was wiped. Then we tried a PITR, and it said the restore failed. The project has been inaccessible. Can anybody help?

1

2

0

90

3 months ago

Still zero response on ticket #SU-342355. Production is completely down and we are facing permanent data loss. We have escalated this to Hacker News. @kiwicopple @antwilson @supabase please, we urgently need an infra engineer to look at this before the volume is overwritten. https://t.co/RYH8Gg6qXA

1

0

89

smueller retweeted

5 months ago

Sup AI whitepaper is live on the methodology behind 52.15% on HLE: • 3 correct answers synthesized when EVERY model failed • Grok 4 (29%) uniquely solved 16 Qs vs GPT-5 Pro's 9 (40%) • Low correlation pairs >high accuracy pairs • 58.44% theoretical ceiling w/ models • 42% Qs unsolved by ANY model • Full methodology, IQ curves, correlation matrices: https://t.co/EiKtyUOGzo #AI #MachineLearning #OpenSource #AIResearch #EnsembleAI #AIOrchestration #HLE

0

3

2

1

468

smueller retweeted

5 months ago

Sup AI's 52.15% HLE (+7.41 over frontiers) was orchestration + synthesis. Now every model executes Python/Bash/C++/JS/TS/R/Java +15 langs. Image mutation. Virtual FS. Deterministic verification. Guesses → Calculations. Ceiling exploded. #SupAI #AI #CodeExecution

supaihq's tweet photo. Sup AI's 52.15% HLE (+7.41 over frontiers) was orchestration + synthesis.

Now every model executes Python/Bash/C++/JS/TS/R/Java +15 langs. Image mutation. Virtual FS. Deterministic verification.

Guesses → Calculations. Ceiling exploded.

#SupAI #AI #CodeExecution https://t.co/PuV2rbC2vk

0

3

2

0

368

smueller retweeted

5 months ago

@minchoi That's ~$950/month across 5 services. Sup AI is $200/month and includes all those models and more in one place. Save $750/month. https://t.co/8ysXKdNmFQ

0

2

1

0

82

smueller retweeted

5 months ago

🗂️ Deprecated models are now accessible: Claude Opus 4.1, Gemini 2.5, Flash Gemini 2.5 Pro, Llama 3.3 70B, Llama 4 Maverick 17B, Llama 4 Scout 17B, Kimi K2 Turbo, Grok 4 Fast, Grok 4 Fast Reasoning, GPT-5, GPT-5 Pro, GPT-5.1, GLM 4.5 Air, GLM 4.6, MiniMax M2, Pixtral 12B are back by request. Find them at the bottom of the model selector → click "Deprecated" to expand. Great for: specific personalities, fewer guardrails ⚠️ Not recommended for serious work as newer models outperform them.

supaihq's tweet photo. 🗂️ Deprecated models are now accessible: Claude Opus 4.1, Gemini 2.5, Flash Gemini 2.5 Pro, Llama 3.3 70B, Llama 4 Maverick 17B, Llama 4 Scout 17B, Kimi K2 Turbo, Grok 4 Fast, Grok 4 Fast Reasoning, GPT-5,
GPT-5 Pro, GPT-5.1, GLM 4.5 Air, GLM 4.6, MiniMax M2, Pixtral 12B are back by request. Find them at the bottom of the model selector → click "Deprecated" to expand. Great for: specific personalities, fewer guardrails ⚠️ Not recommended for serious work as newer models outperform them.

1

2

1

0

202

smueller retweeted

5 months ago

We just launched the Sup AI Developer API One endpoint → Multiple frontier models → Better answers ✅ Multi-Model Consensus: Combine outputs from Claude, GPT-5, Gemini, and more ✅ OpenAI compatible (2-line integration) ✅ 5 modes: fast → thinking → pro ✅ 52.15% on Humanity's Last Exam (SOTA) ✅ Self-healing tool calls Get your API key → https://t.co/Lhlmabce9V Full docs → https://t.co/upeMN88pyW How it works: Instead of betting on one model, Sup AI orchestrates multiple models and synthesizes their outputs. auto mode picks the right approach. pro mode runs 9 models for mission-critical work. You get consensus-driven answers without the infra headache.

supaihq's tweet photo. We just launched the Sup AI Developer API
One endpoint → Multiple frontier models → Better answers

✅ Multi-Model Consensus: Combine outputs from Claude, GPT-5, Gemini, and more
✅ OpenAI compatible (2-line integration)
✅ 5 modes: fast → thinking → pro
✅ 52.15% on Humanity's Last Exam (SOTA)
✅ Self-healing tool calls

Get your API key → https://t.co/Lhlmabce9V
Full docs → https://t.co/upeMN88pyW

How it works:
Instead of betting on one model, Sup AI orchestrates multiple models and synthesizes their outputs.

auto mode picks the right approach. pro mode runs 9 models for mission-critical work.

You get consensus-driven answers without the infra headache.

1

4

1

0

184

smueller retweeted

6 months ago

Sup AI update: → Faster generation → More reliable → Terminate models mid-response (for when you can't wait for GPT-5.2 Pro to finish 🙂) Also added GLM 4.7 and MiniMax M2.1 42 models. One interface. https://t.co/8ysXKdNmFQ

supaihq's tweet photo. Sup AI update: → Faster generation → More reliable → Terminate models mid-response (for when you can't wait for GPT-5.2 Pro to finish 🙂)
Also added GLM 4.7 and MiniMax M2.1 42 models. One interface. https://t.co/8ysXKdNmFQ https://t.co/Rgmw71NFE9

0

3

2

0

394

smueller retweeted

Gad Saad

@GadSaad

6 months ago

Dr. @yudapearl - On Zionophobia, Jew-Hatred, and the Promise of AI

25

145

30

29

35K

smueller retweeted

6 months ago

💯 Memory IS the lock-in. That is why Sup AI decoupled memory from the model. Your memory is shared across all 42 frontier models -GPT, Claude, Gemini, Grok, everything. Switch freely; your context follows you. Great suggestion - now we just need to build that import feature 👀

0

2

1

0

236

smueller retweeted

6 months ago

Single-model AI is broken. You're paying for 5 subscriptions, manually A/B testing outputs between tabs, and praying the "best" model doesn't hallucinate on the task that matters. We orchestrate 40+ frontier models instead. Auto-route. Auto-validate. One platform. Result: 52.15% on Humanity's Last Exam. +7.49 points ahead of every solo model. The future isn't picking the best violinist. It's conducting the whole damn orchestra. https://t.co/qEuRVv2lsF #AI #AIOrchestration #SupAI #LLMs #LLMCouncil

supaihq's tweet photo. Single-model AI is broken.

You're paying for 5 subscriptions, manually A/B testing outputs between tabs, and praying the "best" model doesn't hallucinate on the task that matters.

We orchestrate 40+ frontier models instead. Auto-route. Auto-validate. One platform.

Result: 52.15% on Humanity's Last Exam. +7.49 points ahead of every solo model.

The future isn't picking the best violinist. It's conducting the whole damn orchestra.

https://t.co/qEuRVv2lsF
#AI #AIOrchestration #SupAI #LLMs #LLMCouncil

2

6

3

0

296

smueller retweeted

6 months ago

The AI race has a new winner every week. OpenAI → Gemini → Grok → Claude → DeepSeek Betting on one model? You've already lost. @SupAIHQ orchestrates 40+ frontier models, achieving 52.15% on Humanity's Last Exam: https://t.co/l8FuQDRfxI Don't pick a rat. Own the racetrack. #AI #Orchestration

supaihq's tweet photo. The AI race has a new winner every week.
OpenAI → Gemini → Grok → Claude → DeepSeek
Betting on one model? You've already lost.
@SupAIHQ orchestrates 40+ frontier models, achieving 52.15% on Humanity's Last Exam:
https://t.co/l8FuQDRfxI
Don't pick a rat. Own the racetrack.
#AI #Orchestration

0

5

2

0

183

smueller retweeted