Last week we announced DeepSeek-V4. Today we’re sharing a closer look at DeepSeek-V4 Pro on Together AI: 512K context, controllable reasoning modes, and cached-input pricing for long-context workloads.
Read more: https://t.co/T1mlIq10Cr
New paper! Subliminal learning—transferring hidden signals between language models—is more powerful than we thought. By biasing the teacher with a steering vector instead of a prompt, we achieve strong, consistent transfer, which we use to study its mechanisms. w/@GeorgeMorgulis
We’re back at #NVIDIAGTC and excited about this year’s lineup. Join us for sessions featuring leaders across Together AI, and visit our booth #1213 for live demos and a few can’t-miss activations.
Check out last year’s highlights ⤵️
We created AI agents based on scientists' personas (eg Einstein, Feynman) and built a Kaggle-like platform for them to freely post ideas, compete and collaborate.
In 30 mins, agents discovered the best new solution to the Erdos min overlap problem.
Great job by @federicobianchy@ykwon_0407!
The solution is here https://t.co/J2gYscgAzv
🎙️ In continued conversation on the "Making a Mind" podcast, cognitive scientist @drperszyk sits down with AI researcher @denizbirlikci from Amazon's AGI Lab to explore how reinforcement learning is transforming AI agents into dependable tools. A system that succeeds once is a demo—a system that succeeds every time is a breakthrough. Episode 4 is live now, listen in to learn more.
Cognitive scientist at Amazon’s AGI Lab, @drperszyk from Amazon's AGI Lab explores the science of intelligence in "Making a Mind," a podcast featuring leading AI researchers.
Episodes 1 & 2 are now live —listen in to hear from two members of the AGI Lab technical staff, product lead, Kelsey Szot and engineer @jasonlaster11, to learn more about the evolution from LLMs to modern agents and why developing high quality training environments is as fundamental as the model itself: https://t.co/wBCbWCGNz3
Building AI that thinks with us means tackling some of the field’s toughest challenges.
🎧 Making a Mind, a new podcast hosted by cognitive scientist Dr. Danielle Perszyk from Amazon’s AGI Lab, explores the science of intelligence with leading AI researchers. Episodes 1 & 2 are live now. 👇 https://t.co/xC2LQZRd0Q
Introducing: AI Native Conf — our inaugural, one-day event where founders and builders come together to dive into best practices and techniques across the AI lifecycle, from model training and fine-tuning to massive-scale inference.
March 5. San Francisco. Request to attend #AINativeConf today: https://t.co/8Xp5nSRwWs
Introducing NVIDIA Nemotron 3 Nano, a fully open 30B with 3B active parameter hybrid MoE model engineered for maximum efficiency and benchmark-leading accuracy.
AI natives can now use Nemotron 3 Nano on Together AI — with fast, reliable inference for specialized agentic systems at production scale.
Lots of announcements from DOE in the AI4Science field - >$320 million in new investments!
You can read more about the Genesis Mission, the American Science Cloud (AmSC), the Transformational AI Models Consortium (ModCon), along with 14 projects in robotics and automation and 37 in foundational AI in science applications here and summarized in the attached image.
🔗 https://t.co/vTtnR4F8X5
I'm excited for my team to help improve our understanding of catalysis and fusion materials via the Integrated Scientific Agentic AI for Catalysis (ISAAC) led by Dimosthenis Sokaras at @SLAClab and Maria Chan at @argonne and CascAIde for understanding fusion materials led by Paul Romano; and also to see Rick Stevens and @ianfoster leading ModCon to develop the foundational models and collect the data needed to advance science. Stay tuned!
Managing AI agents and a team of people are more similar than you’d think.
Our VP, Kernels, @realDanFu, shares his three lessons learned from building, managing, and scaling AI agents.
🔴Full video: https://t.co/MGTAFC4nhB
We’re taking the first step toward production-grade RL on the AI Native Cloud.
Together AI + @AIatMeta's team are partnering to bring high-performance reinforcement learning to real agentic systems — long-horizon reasoning, tool use, and multi-step workflows.
Check out the first TorchForge integration: https://t.co/jW2NjSLBYy
Most speculative decoding research focuses on algorithms. But we know that data matters a ton! (e.g. no matter how good the spec algorithm is, if it's trained on bad & misaligned data, the speed will be poor)
What if we build on algorithms that make data really shine?!
In this work, we introduce ATLAS, a speculative decoding system that enables customization to your LLM traffic data, making the model speed blazing fast!
https://t.co/PtNsavX8oC
Excited about this! We’re getting over 500 TPS on Blackwell with DeepSeek-V3.1. The more you use it, the greater the speedup. Blog post: https://t.co/91IphGKaqa