turns out AI models cannot do math.. even grade school math. the kind a 10-year-old solves.
Apple published a devastating study that exposes a massive illusion at the core of artificial intelligence.
they took the standard math benchmark (GSM8K) that every AI company uses to brag about how smart their model is.
first, they just changed the names in the word problems.. the models' performance fluctuated for no reason.
then, they changed the numbers. the performance immediately dropped.
but then they ran the test that broke everything.
they added one single, completely irrelevant sentence to the word problem. something like: "By the way, 5 of the apples were green."
A human 10-year-old ignores the green apples and solves the underlying math.
the AI didn't.
across every state-of-the-art model, performance collapsed by up to 65%.
the AI blindly grabbed the irrelevant number and tried to shove it into the equation. it didn't know why it was doing the math. it just saw a number and assumed it was supposed to use it.
there is no genuine logical reasoning happening under the hood.
we are deploying these systems to run our finances, analyze our legal documents, and make complex strategic decisions.
but the models don't actually understand the logic they are spitting out.
they just know what a smart answer is supposed to look like.
Turing suggested that a computer could be said to "think" if a human interrogator could not tell it apart, through conversation, from a human being. By and large, we've accepted that hypothesis for decades. Until now, we've had no way to verify its veracity. We were wrong
turns out AI models cannot do math.. even grade school math. the kind a 10-year-old solves.
Apple published a devastating study that exposes a massive illusion at the core of artificial intelligence.
they took the standard math benchmark (GSM8K) that every AI company uses to brag about how smart their model is.
first, they just changed the names in the word problems.. the models' performance fluctuated for no reason.
then, they changed the numbers. the performance immediately dropped.
but then they ran the test that broke everything.
they added one single, completely irrelevant sentence to the word problem. something like: "By the way, 5 of the apples were green."
A human 10-year-old ignores the green apples and solves the underlying math.
the AI didn't.
across every state-of-the-art model, performance collapsed by up to 65%.
the AI blindly grabbed the irrelevant number and tried to shove it into the equation. it didn't know why it was doing the math. it just saw a number and assumed it was supposed to use it.
there is no genuine logical reasoning happening under the hood.
we are deploying these systems to run our finances, analyze our legal documents, and make complex strategic decisions.
but the models don't actually understand the logic they are spitting out.
they just know what a smart answer is supposed to look like.
@HowToAI_ Turing suggested that a computer could be said to "think" if a human interrogator could not tell it apart, through conversation, from a human being. By and large, we've accepted that hypothesis for decades. Until now, we've had no way to verify its veracity. We were wrong
John is having a bad day.
He was buttoning his shirt, and a button fell off.
He picked up his briefcase, and the handle fell off.
As he went to open the door to leave the house, the doorknob fell off.
Now he has to pee...
A YouTuber with 110 million subscribers released a free version of ChatGPT.
His name is Felix Kjellberg. You know him as PewDiePie.
He spent his own money on a 10-GPU computer at home. He used it to run the same kind of AI models that power ChatGPT, but on his own hardware. Then he wrote his own app to chat with them, because the apps that already exist were not good enough.
Then he gave it away for free. Anyone can download it. Anyone can change it. Anyone can run it.
It's called Odysseus.
It runs on your computer. Your data stays on your disk. No account. No tracking. No monthly fee.
What you get:
- A chat window like ChatGPT
- An AI assistant that can browse the web, read your files, and do tasks for you
- A tool that scans your computer and tells you which AI models will work on it
- A research mode that reads many websites and writes you a report
- A side-by-side mode to test two AI models on the same question
- A writing editor where AI helps you, instead of writing for you
- Memory, so the AI remembers your past chats
- Email with AI that sorts your inbox and writes replies for you
- Notes, a to-do list, and a calendar
- Works on your phone too
23,612 stars on GitHub in 2 days. Top of trending all weekend.
ChatGPT Plus costs $20 a month. Claude Pro costs $20 a month. PewDiePie's version costs nothing, runs on your own computer, and the code is open for anyone to read.
This is what AI looked like before the subscription model.
(Link in the comments)
@ReichlinMelnick Ohhhh ... I dunno. Maybe cuz, in my area in the past 40 years the median wage has tripled. Over the same period, housing has seen a 10-fold increase (rentals a similar increase).
https://t.co/8FyfEdnHE1
I had 20 patients in my Pediatric War Injuries Clinic today, for follow-up care.
Of the 20 children, Israel had killed one or both parents of 19 of them.
I swear to God we are living in Idiocracy. Everyday I think 'This can't be real.' But it is.
You couldn't write a satirical movie like the insane shitshow we are being forced to live through in real life.
This was HEARTBREAKING
Dr. Tanya Haj-Hassan, who volunteered in Gaza, exposed Israel:
"I held a lifeless child in my arms. There was no equipment to save him. This is not a war; it is a massacre of the innocent."
Yesterday, the IDF seized 4 students from their homes in the West Bank, including 20-year-old American, Sama Safi.
The Israeli govt didn’t tell her family or the U.S. Embassy where or why she was being taken & is holding her without charges.
America must secure her release NOW.
Jewish groups are now boasting more explicitly than ever that they will use Jewish wealth to destroy any US politician who refuses to "stand with" Israel: as they did to Massie.
But if you observe the same exact thing in order to criticize it, you're instantly branded a bigot.