And I itโs not just YMYL info. We found AI was wrong 40% of the time on facts about ANY brand. Thatโs why we built ArcAI Accuracy, the only AI accuracy checker battle-tested by top enterprise companies. Learn more at https://t.co/FuenbBvCXo
Researchers at EPFL proved your AI is lying to you.
Not sometimes. Most of the time.
They built one of the hardest hallucination tests ever made with Max Planck Institute. 950 questions. Four domains where being wrong actually hurts. Legal. Medical. Research. Coding.
Then they ran every top model on it.
The results.
GPT-5. Wrong 71.8% of the time.
Claude Opus 4.5. Wrong 60% of the time.
Gemini 3 Pro. Wrong 61.9% of the time.
DeepSeek Reasoner. Wrong 76.8% of the time.
These are the smartest AI models on Earth. The ones you trust with your career. Your health. Your money.
You think turning on web search fixes it.
It doesn't.
Claude Opus 4.5 with web search. Still wrong 30.2% of the time.
GPT-5.2 thinking with web search. Still wrong 38.2% of the time.
The internet attached. Still lying to you in 1 out of every 3 answers.
Now the part that should scare you.
Medical questions. The one place being wrong can kill you.
GPT-5 hallucinated 92.8% of the time on medical guidelines.
Claude Haiku 4.5 hallucinated 95.7% of the time.
Gemini 3 Flash hallucinated 89% of the time.
Nine out of ten medical answers from popular AI models. Wrong.
It gets worse.
The longer you talk to it, the more it lies.
Early mistakes cascade. The model starts citing its own earlier hallucinations as facts. Your third message is more wrong than your first.
The paper, in its own words: "hallucinations remain substantial even with web search."
This is what hundreds of millions of people are doing right now. Asking software that lies in the majority of its answers. About their health. About their job. About their legal case. About their code.
Most are not checking.
Most never will.
But please. Keep using ChatGPT for medical advice.
The doctors need a break.
https://t.co/dHBP5CDpTM
@NYTGames please please allow grace on streaks! Horrible to find a long streak broken because I was traveling and missed a day. Like 3 days, maybe a week, then if you complete it goes back in your streak.
Google dropping support for showing 100 results on the page with the URL parameter &num=100 https://t.co/8226x4sxxV via @tehseowner and @emvhaccuranker
BREAKING!
In the @seoClarity Research Grid, no matter which US ecomm domain we look at, we're seeing a dramatic rise in AIOs in transactional keywords at the same time the Product feature is starting to decline.
7 min video on what this means https://t.co/EIHOCPQ1G9
CONFIRMED: AI Overviews still almost entirely INFORMATIONAL in intent
We looked at over 1 million keywords from our @seoClarity Research Grid (US) and found that over 96% have informational user intent.
AIOs show in purely transactional queries only 1.2% of the time.
BREAKING!
In the @seoClarity Research Grid, no matter which US ecomm domain we look at, we're seeing a dramatic rise in AIOs in transactional keywords at the same time the Product feature is starting to decline.
7 min video on what this means https://t.co/EIHOCPQ1G9