We are sharing an early preview of our ongoing SWE-1.6 training run.
It significantly improves upon SWE-1.5 while being post-trained on the same pre-trained model - and it runs equally as fast at 950 tok/s. On SWE-Bench Pro it exceeds top open-source models.
The preview model still exhibits some undesirable behaviors like overthinking and excessive self-verification, which we aim to improve. We are rolling out early access to a small subset of users in Windsurf.
At @harvey, the engineering team integrated Spectre — their internal background agent — into Devin Desktop.
Now Spectre's organizational context can live on every engineer's laptop and flow across their favorite agents.
Devin has become a huge source of leverage for us at Long Lake.
Engineers love it, and adoption is compounding across our team AND non-technical teammates at partner companies.
It lets people focus on the work, not the model-router brain damage of “which frontier model should I use for this task?”
Congrats to the team!!
ive been devin-pilled at @modal for development, prototyping and incidents — proud to have them as a customer of our platform, and excited to where their product goes from here!
5/ Today, Devin is responsible for 89% of the PRs written at Cognition.
Devin has dramatically accelerated our product roadmap, building and shipping new features like Devin Review, Auto-Triage, Managed and Scheduled Devins, Windsurf 2.0, and more.
That's Harry Reid International Airport (LAS) in Las Vegas. The giveaway is the bright, colorful neon/LED lights visible through the terminal windows — that's the Las Vegas Strip. Combined with the typical US airport gate seating, "SECURED AREA" signage, and the EXIT sign, it all points to LAS. Likely one of the gates in Terminal 1 or 3 that face toward the Strip.
The natural step after a cloud agent that you can tag in Slack is an agent that can jump onto issues proactively.
This vision is something we've been talking about since early 2024 and prototyped many times. Again & again it just didn't feel good enough, so we hadn't shipped it - but now we finally have a version that just works
ever since moving to devin
i've found the most useful feature is being able to spin up child devin sessions (agent fan-out)
all with their own VM / context / etc
and the master devin can send interrupt messages + read results + get notified when task completes