Today we're announcing MAI-Thinking-1 with Microsoft and it will be available on Baseten soon.
Microsoft built something genuinely different here: a commercial-grade thinking model trained on clean data with no distillation from third-party models and designed to be fine-tuned by the enterprises using it. Microsoft AI guarantees 100% eyes-off on post-training data and Baseten will handle the fine-tuning and deployment at scale.
The future isn't one model. It's many models, each owned by the businesses that shaped them and MAI-Thinking-1 is a big step in that direction.
https://t.co/8w9k4jwrgq
Abridge moves at the speed of trust—with our health system partners, clinician users, and patients. Today, we’re moving further, faster than ever before.
Abridge Keynote
June 11
12PM EDT
We've raised $65 billion in Series H funding at a $965 billion post-money valuation, led by @AltimeterCap, Dragoneer, @Greenoaks, and @sequoia.
This investment will help us advance our research and expand our capacity to meet growing demand for Claude.
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors.
Available today at the same price.
Post-training your own frontier model has become the new default for the leading AI companies. Baseten's research team partnered with the brilliant team at @harvey and showed that post-trained open models can compete at the frontier on LAB. Post-training not only enables high-quality legal agents to be more accessible, but also allows more specialization for the workflows firms actually care about.
mobile users are mid-visit doing targeted literature search, asking about differentials... without having to type out context since the live transcript is getting fed in
I knew there was an appetite for this, but my expectations were exceeded, let's say!
Today we're sharing our first research collaboration with @baseten on open-weight legal agents.
Using signal from LAB (our Legal Agent Benchmark of 1,200+ tasks across 24 practice areas), we post-trained an open-weight model to match closed-source frontier performance.
Training open-weight agents for legal creates three major advantages for Harvey:
1) Cost and latency improvements:
The best-performing closed-source frontier models took an average of 22 minutes to complete each LAB task, with $50 per-task average inference cost.
Open-weight inference is substantially faster and cheaper.
2) Reasoning visibility:
In a high-stakes domain like legal, it's incredibly important to get visibility into agents' internal reasoning states - for audit and governance and also as a lever for Harvey to improve agent performance.
Closed-source foundation model providers avoid exposing raw reasoning tokens via API to prevent model distillation. For Harvey, that visibility is a major advantage.
3) Custom training:
Owning the model weights lets us customize training and modify architecture. One example: our blog’s final note on training a custom KV cache compactor.
More to come on our research collaboration with Baseten.
Most VCs wouldn’t touch Anthropic in 2023.
Yasmin Razavi did.
The Spark Capital partner led a $450M round when Anthropic had no public product, no revenue and a massive capital need. Now the AI giant’s rise has landed her on the Forbes Midas List for the first time. https://t.co/9rxuS91mb8 (Photo: Guerin Blask For Forbes) #ForbesMidas
“It went very quickly from ‘who will use it?’ to ‘we can’t keep up with the demand.’ That shift marked the transition from pilot to true enterprise scale.” — Dr. Ben Hohmuth, Chief Medical Informatics Officer and Clinical Lead for the Risant Value-Based Platform at @GeisingerHealth
Geisinger’s experience proves that clinician-led adoption of AI can drive rapid scale.
Read more about how Geisinger scaled Abridge: https://t.co/CLfBlhaQW2
Highly recommend this excellent 20vc episode featuring @ShivdevRao, CEO of Abridge - a Baseten customer we are incredibly proud to work with.
One stat that stood out: 40% of Abridge’s model outputs now come from in-house models. In high-stakes clinical workflows, milliseconds matter. Owning and optimizing your own models materially improves latency, reliability and quality.
For the health systems, doctors & nurses relying on the product every day, those gains compound quickly. It’s one of many reasons @AbridgeHQ has become such a trusted and beloved product across healthcare.
Anthropic co-founder Chris Olah was invited to speak at today's presentation of Pope Leo XIV's encyclical "Magnifica humanitas."
Read the full text of his remarks: https://t.co/CoBfkVOVcy
I like what the @baseten team is doing. Asset light infra provider focused on best possible model deployments. Run fast, scale it simply and secure it everywhere. 👏
We’re launching Benchling Inference, powered by @baseten
It is scalable GPU capacity across 15 clouds for our 1300+ customers, preloaded with today's top scientific models and the integrations to make in silico discovery work out-of-the-box for biopharma companies.
Startups get better economics and availability. Enterprises get best-in-class infrastructure that works alongside their cloud commits and data sovereignty requirements.
It's been a pleasure working with Baseten on this. They've spent six years building at the leading edge of inference and are the compute behind some of the most demanding AI in production.
https://t.co/LLXZ1BecdA
Had the best time talking with @niki4conviction about what top talent looks like in today’s market, the unique culture we’re building at Baseten, and why the people you hire define the company you become.
Thanks for hosting me and for the unwavering support we get from the entire team at @conviction
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.