i feel like nicole kidman in AMC, this is the shit we come to twitter dot com for! to ask whether white people wash their legs, how a xenomorph would dress, whether she fucks the text man for texts. we come to this place for…. magic.
Pusha T talks about the greatness of JAY-Z for The New York Times’ 30 Greatest Living American Songwriters feature:
"There haven’t been many people who can speak for a generation, and speak to the mentality of what Black youth was going through, for everyone who indulged in the allure of street culture. He gave us such a tutorial, his whole career, about street life, drug culture, luxury, the pitfalls as well as the floss. And then he took “Allure,” from “The Black Album,” and basically admitted, Man, I fell victim to the game. It is just the best representation of a rap artist speaking to his demographic, which was a lot of the kids exposed to the crack era from every angle. Everybody who was playing in that world, you know, we all had moments of clarity. Regardless of whether it was an opportunity in music, a near-death experience of a friend or yourself, a run-in with the law — everybody had this feeling and told themselves, That’s it. And the game called them right back. That chorus and bridge really captured the feelings of anyone living that life. And the hook spoke to a level of admission of, like, I know I’m doing wrong. I see clearly. I’m over it. I’m done. But everybody, you know, folded and ran back, like an addict.
He’s talking about a real experience, and his mission is to articulate, in the best possible way, his feelings at the time. When you draw from real experience, there’s a level of passion that comes across, I’ve always felt like that was something he did very well. Even his more commercial records, they always still carried a heavy weight of lyricism. “N****s in Paris,” “Hard Knock Life,” “Otis” — these are all hit singles, and his verses carry the weight of mixtape verses.
One of his best performances, to me, is “Hovi Baby.” Lyrically it is by far one of the best [expletive]-talking, acrobatic, philosophizing, I mean, come on. Listen: At that point I was scared of Jay-Z. This is another stride of lyricism, philosophy, I’m-the-best braggadocio, bravado. And he’s, like, tap-dancing all over this beat. Later on in the song, he starts talking about how he’s chasing the snare around and he’s actually doing it. To me, that was a Super Saiyan moment. “Hovi Baby” scared the hell out of me."
Colombian officials have authorized a plan to cull dozens of hippos roaming freely through a region in the center of the country, where they threaten villagers and displace native species years after notorious drug lord Pablo Escobar brought in the first ones. https://t.co/L0YjvRnPAT
🚨SHOCKING: Apple just proved that AI models cannot do math. Not advanced math. Grade school math. The kind a 10-year-old solves.
And the way they proved it is devastating.
Apple researchers took the most popular math benchmark in AI — GSM8K, a set of grade-school math problems — and made one change. They swapped the numbers. Same problem. Same logic. Same steps. Different numbers.
Every model's performance dropped. Every single one. 25 state-of-the-art models tested.
But that wasn't the real experiment.
The real experiment broke everything.
They added one sentence to a math problem. One sentence that is completely irrelevant to the answer. It has nothing to do with the math. A human would read it and ignore it instantly.
Here's the actual example from the paper:
"Oliver picks 44 kiwis on Friday. Then he picks 58 kiwis on Saturday. On Sunday, he picks double the number of kiwis he did on Friday, but five of them were a bit smaller than average. How many kiwis does Oliver have?"
The correct answer is 190. The size of the kiwis has nothing to do with the count.
A 10-year-old would ignore "five of them were a bit smaller" because it's obviously irrelevant. It doesn't change how many kiwis there are.
But o1-mini, OpenAI's reasoning model, subtracted 5. It got 185.
Llama did the same thing. Subtracted 5. Got 185.
They didn't reason through the problem. They saw the number 5, saw a sentence that sounded like it mattered, and blindly turned it into a subtraction.
The models do not understand what subtraction means. They see a pattern that looks like subtraction and apply it. That is all.
Apple tested this across all models. They call the dataset "GSM-NoOp" — as in, the added clause is a no-operation. It does nothing. It changes nothing.
The results are catastrophic.
Phi-3-mini dropped over 65%. More than half of its "math ability" vanished from one irrelevant sentence.
GPT-4o dropped from 94.9% to 63.1%.
o1-mini dropped from 94.5% to 66.0%.
o1-preview, OpenAI's most advanced reasoning model at the time, dropped from 92.7% to 77.4%.
Even giving the models 8 examples of the exact same question beforehand, with the correct solution shown each time, barely helped. The models still fell for the irrelevant clause.
This means it's not a prompting problem. It's not a context problem. It's structural.
The Apple researchers also found that models convert words into math operations without understanding what those words mean. They see the word "discount" and multiply. They see a number near the word "smaller" and subtract. Regardless of whether it makes any sense.
The paper's exact words: "current LLMs are not capable of genuine logical reasoning; instead, they attempt to replicate the reasoning steps observed in their training data."
And: "LLMs likely perform a form of probabilistic pattern-matching and searching to find closest seen data during training without proper understanding of concepts."
They also tested what happens when you increase the number of steps in a problem. Performance didn't just decrease. The rate of decrease accelerated. Adding two extra clauses to a problem dropped Gemma2-9b from 84.4% to 41.8%. Phi-3.5-mini from 87.6% to 44.8%. The more thinking required, the more the models collapse.
A real reasoner would slow down and work through it. These models don't slow down. They pattern-match. And when the pattern becomes complex enough, they crash.
This paper was published at ICLR 2025, one of the most prestigious AI conferences in the world.
You are using AI to help you make financial decisions. To check legal documents. To solve problems at work. To help your children with homework. And Apple just proved that the AI is not thinking about any of it. It is pattern matching. And the moment something unexpected shows up in your question, it breaks. It does not tell you it broke. It just quietly gives you the wrong answer with full confidence.
We are excited to announce that we have been granted marketing rights in Italy 🇮🇹
Through the @NFL Global Markets Program, we are eager to connect with the Italian community, support American football development from youth to elite levels, and grow the Browns global fanbase 🗺️
What Jai Lucas has done in his 1st year at Miami is REMARKABLE.
The Hurricanes went 7-24 a season ago.
Miami just won its 26th game of the year in the NCAA Tournament.
That 19-win improvement ties the BIGGEST win total turnaround in D-I men’s college hoops history.