Whale is finally here 🐳
MoE · 1M context · 3 reasoning modes
@deepseek_ai V4 series now available on SiliconFlow with day-0 support:
⚡ DeepSeek-V4-Pro: $0.145 / $1.74 / $3.48 per 1M tokens
🚀 DeepSeek-V4-Flash: $0.028 / $0.14 / $0.28 per 1M tokens
🤖 Best open-source model for Agentic Coding, outperforms Sonnet 4.5 & approaches Opus 4.6
🌍 World knowledge that rivals frontier closed-source models
🧮 #1 open-source on math, STEM & competitive coding — matches Opus 4.6 & Gemini 3.1 Pro
Try it now ⬇️
📢 Nex-N2 is here!
A family of agentic models that doesn't just think, it acts!
Coding, search, tool use. All fused into a single agentic reasoning loop.
- Adaptive Thinking, auto-scales reasoning depth per step. Saves ~20% tokens, zero performance loss.
- Coherent Thinking, one thinking paradigm across search, coding, and tool use. No more fragile mode-switching.
🏆 Result: Tier-1 open-source performance on SWE-bench, Terminal-Bench, GDPval, and more, tracking GPT-5.5 and Opus 4.7.
🎉 Open-weight. Try it now.
🔗 https://t.co/7oLSfyOCxB
📦 https://t.co/c2CGhXWaz6
https://t.co/KJYXZIpk8M
https://t.co/vcjdZ9cuB6
Post-training is having a moment — Nex-N2-Pro from neolab @NexEcosystem proves it.
Built on Qwen3.5-397B-A17B, delivers GPT-5.5 and Claude Opus 4.7–level performance.
🎉 T+0 Support on SiliconFlow · Free for First 2 Weeks
N2-Pro: 397B MoE / Reasoning Model / 262K context / VLM
→ Auto-adjusts reasoning depth, 30–50% fewer thinking tokens, no performance trade-off
→ SOTA performance on Terminal Bench 2.1, GDPVal, SWE-Verified
→ Excels at agentic coding, deep search, tool use
→ Plug-and-play with Claude Code, Cursor, OpenClaw, etc.
Try it on SiliconFlow ⬇️
@izzy2fx@karpathy@opencode@justsisyphus In this case, it's an agent-driven wiki building in Obsidian; things like persistence and conflict resolution aren't hardcoded, you could just write the rules into your agent's instructions, and it follows them.
@karpathy 's llm-wiki hit 5,000+ stars in weeks.
The idea: stop re-discovering knowledge every session. Let an LLM build and maintain a wiki that gets smarter every time you use it.
Here's how to build your own with @opencode + @justsisyphus OMO + SiliconFlow 🧵
Meet Gemma 4 12B!
A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.
Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇
How this stack works:
→ @opencode browses the web via Chrome automation — reads pages, extracts entities & concepts, writes cross-linked markdown into your Obsidian vault automatically
→ @justsisyphus oh-my-openagent routes each task to the right model — orchestration, deep reasoning, fast search, all handled without manual juggling
→ @SiliconFlowAI 200+ frontier models, DeepSeek V4/GLM-5.1/Kimi2.6/M3 etc., one API key
One ulw command scaffolds the entire wiki structure. Your knowledge compounds every time you use it.
The next evolution of Hermes Agent is here!
Introducing Hermes Desktop: everything you love about Hermes, now native on your machine.
First demoed in Jensen's GTC keynote, it's now in public preview.
👏👏 Introducing Qwen3.7-Plus — a multimodal agent model that unifies vision and language into one versatile agent foundation.
✅ Multimodal interactive hybrid agent: unified GUI & CLI operation across visual and text tasks
✅ Versatile coding agent & productivity assistant with full-modality input
✅ Visual Agent: perception, reasoning, grounding, and search-augmented QA
✅ Cross-harness generalization across diverse agent frameworks
One model. Sees, thinks, codes, acts.🙌🙌
Now available via API on Alibaba Cloud Model Studio. Try it — let us know what you build.😎
🔗🔗⬇️⬇️
Blog:https://t.co/pVYf0h3NNa
Qwen Studio:https://t.co/HUYgFW4cYf
API:https://t.co/viL0cXrMzW
Coding like Opus4.7 / 1M context window / Native multimodal
@MiniMax_AI M3 is now on SiliconFlow with day-0 support 🔥
🎉 Limited-time 50% off for 7 days
Cache / Input / Output: $0.06 / $0.30 / $1.20 per 1M tokens
(Regular: $0.12 / $0.60 / $2.40)
M3 is the first open-source model combining all three frontier capabilities:
→ Coding & Agentic: beats GPT-5.5 and Gemini 3.1 Pro on SWE-Bench Pro
→ 1M context via MiniMax Sparse Attention
→ Native multimodal from step zero — image, video & computer use
Try it on SiliconFlow ⬇️
The #1 coding agent on @OpenRouter, now living in your Discord server
Step-by-step setup, model selection & pro tips
Here's everything you need with @NousResearch Hermes Agent + SiliconFlow 🧵
~15% off for @Kimi_Moonshot K2.6 on SiliconFlow💰
Input pricing: $0.90/M ➡️ $0.77/M
Combined with
→ Top performance on @OpenRouter: 0.21% avg tool call error rate
→ 80%+ cache hit rate
→ FP8 quantization + Zero Data Retention
Spend less, debug less, and ship more
Don't miss this builders
Get started now with Kimi K2.6 on SiliconFlow ↓