The classic Stroop test just broke the internet's best AI. A new study reveals that next-gen models like GPT-5, Claude Opus 4.1, and Gemini 2.5 suffer a massive "cognitive collapse" under sequence pressure, dropping to near 0% accuracy when forced to override text-reading.
https://t.co/oaJfRT7Zo7
#LLMs #Cognitive #Science
#CognitiveScience #Neuroscience #ArtificialIntelligence #StroopTask #ExecutiveControl #BiologicalAttention
Building autonomous agents for scientific discovery? 🧬🤖
@GoogleDeepMind Science Skills is now available on GitHub. We've open-sourced this specialized toolkit to accelerate your agentic workflows with scientific grounding and higher token efficiency.
Download now ↓
https://t.co/cwp1HOeKvo
I love AI. I love having to re-read every single sentence on the internet twice to discern if I am getting the advice and experience of a living, breathing human being, or the sludge of a demonic greed machine engineered by 300 losers in San Francisco to steal my money. Fun !
This Executive Order is an important step in strengthening America’s leadership in AI.
We look forward to collaborating with the White House to support its implementation.
https://t.co/ZwDimPrp3t
Interpreting law is one of the oldest jobs in the world. @MaxJunestrand, co-founder and CEO of @WeAreLegora, is bringing it into its next era with Claude.
His bet: every new model release raises the tide, and Legora is building the boats for everyone else.
In a new Stanford study, law professors by far preferred Gemini 2.5 Pro's responses over those written by their peers when they were unaware of who wrote the answers.