Luisa Groher

@luisagroher

Denver, CO

Joined June 2013

886 Following

5.6K Followers

1.6K Posts

luisagroher retweeted

Kirk Borne

@KirkDBorne

5 days ago

[Download 585-page PDF eBook] Game Theory: https://t.co/DUxMDRc1Ll ————— #GameTheory #Gamification #Mathematics #Statistics #Probability

KirkDBorne's tweet photo. [Download 585-page PDF eBook]

Game Theory: https://t.co/DUxMDRc1Ll
—————
#GameTheory #Gamification #Mathematics #Statistics #Probability https://t.co/OuM1cwPhUw

730

149K

luisagroher retweeted

Yishan

@yishan

6 days ago

A big problem with research studies on AI models is that given how long the peer review process is, the results are always out-of-date by the time the paper is published. This time, we have something better! The typical reaction to research results like this roughly goes "You're just testing on old models. Today's models are way better and surely can do it now!" But the best solution is for these papers to also open-source all of their testing framework so that upon publication, others can reproduce their results, as well as run it on the newest models of the day - and into the future. After all, "this is the worst they'll ever be" so what really matters is determining when they DO pass the threshold. As it turns out, the authors of this paper DID open-source their evaluation framework! Here: https://t.co/iXLwmItKwu So I figured... let's re-run the tests on the latest models! Summary of our results are here: https://t.co/1Dzj0UcJUQ One drawback is that, unfortunately, the authors didn't (or weren't legally able to) open-source ALL the testing data, since apparently some of it is copyrighted by JAMA/NEJM etc. That's a separate problem with the medical research publishing industry for another time. However, we were able to reproduce the test on the public datasets they did include! First, we re-ran the same tests (as closely as we could) on the old models the paper claimed to use, in order to establish a baseline and determine how much "drift" there would be. (Answer: not too much) Then we ran those tests on the newest frontier models we could find. The results are: the most capable models today (GPT-5.5 Pro) did outperform the best models from before (79/100 vs 69/100), but did not improve enough to be considered sufficient for reliable medical use. In fact, the paper's criterion for "fit for reliable medical use" is more stringent, requiring the models to be robust under perturbation and bad data, knowing when to say there's not enough information, give clinically valid reasoning rather than hallucinations, etc. Those sound pretty reasonable to me. I wasn't able to reproduce that kind of qualitative evaluation, but even on the basic pass/fail test using public datasets of interpreting radiology images, the newest models are better, but not yet quite good enough. Nevertheless, I would like to praise the paper's authors for at least open-sourcing what they could, enabling me to (fairly quickly) attempt to reproduce their results. This is definitely a step in the right direction! While my reproduction wasn't able to be comprehensive, it certainly gave me useful directional info and - perhaps more importantly - allowed me (a random dude on the internet) to directly reproduce the results in their paper and validate them. I would like to encourage ALL authors of research papers on AI models to do similar open-sourcing of their experimental frameworks!

612

146

260K

luisagroher retweeted

elvis

@omarsar0

9 days ago

Most AI code review tools look at one repo at a time. But the bug usually isn't in the code that changed. It's in what that change quietly breaks three repos away. @QodoAI just shipped Cross Repo Review to solve this. I tested it on my own repos. Here's what it caught.

13K

Luisa Groher @luisagroher

12 days ago

@BillAckman lol, you are missing a lot.

Who to follow

Forward Party Colorado

@FWD_Colorado

Official Twitter account for the Forward Party of Colorado. Join us to help drive our democracy Forward!

Bbear1025

@bbear1025

Just a hard working, Constitution loving, family is my world!! #1A #2A #PureBlood #TrumpWon- Back after 743 days exile! No crypto, no DMs

OFFOR MBAH

@OFFORMBAH

Take Responsible for your actions.

Luisa Groher @luisagroher

15 days ago

@havivrettiggur This MOU makes sense when you consider that Marco Rubio is no Henry Kissinger

181

Luisa Groher @luisagroher

24 days ago

@VivianBercovici @MosabHasanYOSEF 💯 x 💯

luisagroher retweeted

Sara Hooker

@sarahookr

27 days ago

AutoScientist accelerates ML research and development to take days not months. This summer we will support builders releasing frontier models in 10 different domains ranging from medicine to underserved languages. All final models will be released to @huggingface and @kaggle 🔥

12K

luisagroher retweeted

Bojan Tunguz

@tunguz

about 2 months ago

Yes

luisagroher retweeted

Katherine Graham

@KateXGate

about 2 months ago

The System Cannot Validate Itself Gödel Part III Hilbert’s dream wasn’t just completeness. It was self-certainty. A closed loop. A formal system that could: generate truth, verify truth, and prove its own consistency from within itself. Gödel shattered that final hope. Any sufficiently powerful formal system cannot fully validate itself using only its own internal rules. That was the Second Incompleteness Theorem. And it changed far more than mathematics. ⸻ Gödel revealed something profound: Every system rests on assumptions it cannot fully justify from inside itself. Mathematics. Logic. Computation. Even physics. At some level, every framework eventually reaches: axioms, primitives, unprovable starting points. Not because humans are stupid. Because formal structure itself has limits. ⸻ This realization quietly shaped: theoretical computer science, cryptography, AI theory, proof verification, complexity theory, philosophy of mind, and modern debates about consciousness and computation. Alan Turing extended related ideas into computation itself: some problems are fundamentally undecidable. Not hard. Undecidable. ⸻ But Gödel is also widely misunderstood. He did not prove: “science is fake” “logic doesn’t work” “anything spiritual must be true” or that mathematics failed. In fact, mathematics became even stronger afterward. Gödel didn’t "destroy" formalism. He revealed its horizon. The map still works. It’s just not the totality of reality. ⸻ That may be Gödel’s deepest contribution: Not nihilism. Humility. The recognition that truth is always slightly larger than the systems we build to contain it.

KateXGate's tweet photo. The System Cannot Validate Itself
Gödel Part III

Hilbert’s dream wasn’t just completeness.

It was self-certainty. A closed loop.

A formal system that could:

generate truth,
verify truth,
and prove its own consistency from within itself.

Gödel shattered that final hope.

Any sufficiently powerful formal system cannot fully validate itself using only its own internal rules.

That was the Second Incompleteness Theorem.

And it changed far more than mathematics.

⸻

Gödel revealed something profound:

Every system rests on assumptions it cannot fully justify from inside itself.

Mathematics.
Logic.
Computation.
Even physics.

At some level, every framework eventually reaches:

axioms,
primitives,
unprovable starting points.

Not because humans are stupid.

Because formal structure itself has limits.

⸻

This realization quietly shaped:

theoretical computer science,
cryptography,
AI theory,
proof verification,
complexity theory,
philosophy of mind,
and modern debates about consciousness and computation.

Alan Turing extended related ideas into computation itself:
some problems are fundamentally undecidable.

Not hard.
Undecidable.

⸻

But Gödel is also widely misunderstood.

He did not prove:

“science is fake”
“logic doesn’t work”
“anything spiritual must be true”
or that mathematics failed.

In fact, mathematics became even stronger afterward.

Gödel didn’t "destroy" formalism.

He revealed its horizon.

The map still works.

It’s just not the totality of reality.

⸻

That may be Gödel’s deepest contribution:

Not nihilism.

Humility.

The recognition that truth is always slightly larger than the systems we build to contain it.

137

luisagroher retweeted

Gavin Brown

@gavinrbrown1

about 2 months ago

Gradient descent does not work. I will die on this hill.

241

336

471

339K

luisagroher retweeted

Big Brain AI

@realBigBrainAI

2 months ago

Meta's Chief AI Scientist Yann LeCun: building agentic systems on LLMs is a recipe for disaster.

150

253

762

287K

luisagroher retweeted

Sam Hogan 🇺🇸

@samhogan

2 months ago

All the best programmers I know are starting to write code by hand again

663

337

luisagroher retweeted

Justin Skycak

@justinskycak

2 months ago

The most dangerous form of laziness is performative productivity. Notes, tabs, highlights, summaries, plans. A whole pile of activity arranged to avoid direct contact with the work.

167K

Luisa Groher @luisagroher

3 months ago

I dont know that this needs a paper. it's pretty much common sense

Gary Marcus

@GaryMarcus

3 months ago

“The sharpest drop came from people who used the model for direct answers, not from those who used it more like a hint system, which suggests the real issue is not AI exposure itself but replacing effort with completion. The result is not that AI makes people less capable by default, but that answer outsourcing can shrink the mental effort that normally trains skill.” Even more results on the dangers of outsourcing your thinking to AI.

430

259

93K

luisagroher retweeted

Paula Breytenbach🇿🇦🇺🇸 @PolyannaBrey

3 months ago

National Geographic Award winning photograph of the year. So beautiful. 🐝🐝🐝

320

23K

395K

luisagroher retweeted

Bojan Tunguz

@tunguz

3 months ago

Infotainment has given us an illusion of learning. AI Agents are giving us an illusion of productivity.

110

luisagroher retweeted

Connor Shorten

@CShorten30

3 months ago

I learned a lot from our discussion of Reason-ModernColBERT and Reasoning-Intensive Retrieval 🧠 Firstly, check out the ReasonIR dataset from Meta if you haven't already! This is an incredible resource for training search models! 🛠️ Secondly, there are two things going on with Reasoning-Intensive Retrieval: you have complicated human questions, such as BRIGHT or FreshStack 🥞, and this is where the current focus mostly is. You also have the idea of searching with Agentic reasoning, such as Chain-of-Thought and so on. This is fairly new and I am super super excited about AgentIR from @zijian42chen et al. This will have a huge impact on search 👇

CShorten30's tweet photo. I learned a lot from our discussion of Reason-ModernColBERT and Reasoning-Intensive Retrieval 🧠

Firstly, check out the ReasonIR dataset from Meta if you haven't already! This is an incredible resource for training search models! 🛠️

Secondly, there are two things going on with Reasoning-Intensive Retrieval: you have complicated human questions, such as BRIGHT or FreshStack 🥞, and this is where the current focus mostly is.

You also have the idea of searching with Agentic reasoning, such as Chain-of-Thought and so on. This is fairly new and I am super super excited about AgentIR from @zijian42chen et al.

This will have a huge impact on search 👇

17K

luisagroher retweeted

Gary Marcus

@GaryMarcus

3 months ago

Either we accept this as a society, and set a precedent for allowing virtually all jobs to be replaced with almost no compensation. Or we speak up now. For artists. For writers. For musicians. For everybody.

456

131K

luisagroher retweeted

✒️

@Literariium

3 months ago

To improve your writing, read more. To improve your thinking, write more. To improve your storytelling, present more. To improve your energy, rest more. To improve your understanding, teach more. To improve your network, give more. To improve your happiness, appreciate more.

Literariium's tweet photo. To improve your writing, read more.
To improve your thinking, write more.
To improve your storytelling, present more.

To improve your energy, rest more.
To improve your understanding, teach more.

To improve your network, give more.
To improve your happiness, appreciate more. https://t.co/VmFAaTCtGu

171K

luisagroher retweeted

Rohan Paul

@rohanpaul_ai

3 months ago

Tim Ferriss (best known for his “4-Hour” book series) on how he uses AI: “I hesitate to use AI for anything I want to keep in my head.” Because AI doesn’t just assist — it can fully replace your thinking. The cost? Cognitive muscles atrophy fast.

178

113

29K

Luisa Groher

@luisagroher

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users