How much p-hacking is happening among AI benchmark makers?
IE: Coming up with 100 model capability tests, and then only publishing results that show a nice upward slope that matches our intuitions about improving strength over time?
Hey @AnthropicAI, I am so SO SOOO frustrated!! How do I escalate to a human agent? Fin is not helping and neither is Claude itself.
I tried to upgrade to Pro in February through https://t.co/MVgnhLiqFm but was told the billing details didn't go through...
AI has an insane rate of progress when we observe what it does for jobs that take place on screens.
Progress is slower for jobs in the world of atoms.
Since the intellectual class spend all their time on screens, it's easy to think "AI will do all the jobs!"
We are like 15th century farmers thinking about mechanization. If you mechanized agriculture, what other jobs could there even be?
Perhaps employment in the on-screen sector will shrink to a small percentage.
Of course stuff on screens affects things in the real world too. But the rate of progress is different. On screens (i.e. in simulation), the self-driving car problem has been solved for many years. IRL, we are still years and years away from replacement of human drivers.
I think one reason for divergent predictions on how much AI will matter is that the "6% growth" AI experts are extrapolating from stuff on screens to stuff with atoms, while the "3% growth" economists are not.
2200 days ago, I caught COVID for the first time.
I was not vaccinated as it had just arrived so there was no vaccine. It was a mild infection, so I continued running. Nobody could have told me then, that I would develop Long Covid and cancer, and lose my life to it.
@leightjessica I’m interested in something like this too and will be following. I’m hardly a power user and I just got started, but I will say that I’ve learned a lot just by playing around with it. I created a separate folder, copied data into it, and just started asking it to do things.
@Coach_Yac Shanahan should have just kept it simple and stuck with what has been working for 6 games. He was running scared of this Seattle defense from the first play of the game.
Turn 36 today & I’ve spent my entire thirties with #LongCovid. When I turned 30 I was biking to work 10 miles a day, & since my mild covid infection — I struggle to walk more than a block or work more than a few hours a week.
Here’s 4 things I wish everyone knew:
#49ers K Jake Moody has made 11 of his last 21 field goal attempts in the regular season.
He's out on field warming up as I type this but it's just simply not anywhere close to good enough.