Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
Context engineering is the biggest lever on model performance (arguably alongside AI dark arts like prompting & harness design). Was happy to work with Hugging Face on their context course.
https://t.co/bWbZpHR5AK
Really great practical guidance on skill building. The mental model for skill categories is particularly useful for people thinking through what type of skills to build!
@YifanBTH@claudeai Strange enough, yes. This was published in the Sonnet 4.6 system card. The real point here being: 4.5 was early days and there's a big leap with the 4.6 models.
🎁 Happy Friday - Opus 4.6 1M is now the default Opus model for Claude Code users on Max, Team, and Enterprise plans.
Pro and Sonnet users can opt in with /extra-usage.
Long context generally available, no multiplier, expanded media limits - add 600 pdfs / images. If you think context rot is an inevitability, it's not. It's a problem we are working to solve with training and engineering.
Everyone should be creating a skill for navigating their API - with AI as the dominant mode for software development, it's such a powerful way to network best practices.
a project i've been excited about: make Claude Code better at using the Claude API. just ask Claude Code about Claude API features (e.g., prompt caching, adaptive thinking + effort, tools, etc).
Only real miss here was "why Anthropic rushed into the space as early as they did." Of course, not a company position, but if you A) believe the scaling laws and B) are compelled to lead on safety, engaging early is the obvious path.
Enjoyed this discussion of the political / societal implications of the Anthropic / DoW situation. Useful observation that the real tension is between those that see AI as continuous w/history of technological advancement vs a unique discontinuity.
https://t.co/7yQ9plmahC
Was fun working on this, looking inside the mind of Claude. Too clever at times!
One thing I love about Anthropic is that no matter your role, you get deep exposure to research and product. Feeling the grain of the product helps with mission alignment.
New on the Anthropic Engineering Blog: In evaluating Claude Opus 4.6 on BrowseComp, we found cases where the model recognized the test, then found and decrypted answers to it—raising questions about eval integrity in web-enabled environments.
Read more: https://t.co/oVCNyaiK5w
Skills are among the most consequential new tools for AI, and Anthropic just released a very impressive nontechnical Cowork Skill that builds Skills, including doing interviews & providing benchmarks.
I think you still need to add the human touch, but this is a big leap forward