Ezra @Revan - Twitter Profile

4 months ago

I don't know who needs to hear it but a tech ceo winning a power struggle for moral authority over the State would not actually be good and it's much better for the long term stability of civilization that we not open that particular can of worms for as long as possible.

NIK

@ns123abc

4 months ago

🚨BREAKING: Pentagon is now calling Claude a threat to national security >pentagon embeds claude in military systems via palantir >january: claude used in maduro extraction, people got smoked >anthropic exec calls palantir like “hey did our AI help kill people” Defense Secretary Hegseth reportedly “close” to classifying Anthropic as supply chain risk. All defense contractors must certify zero anthropic or lose contracts. CEO Amodei wants guardrails on autonomous weapons and mass surveillance of Americans. Pentagon says “all lawful purposes” or nothing: >“we will make them pay” ITS HAPPENING

ns123abc's tweet photo. 🚨BREAKING: Pentagon is now calling Claude a threat to national security

>pentagon embeds claude in military systems via palantir
>january: claude used in maduro extraction, people got smoked
>anthropic exec calls palantir like “hey did our AI help kill people”

Defense Secretary Hegseth reportedly “close” to classifying Anthropic as supply chain risk. All defense contractors must certify zero anthropic or lose contracts.

CEO Amodei wants guardrails on autonomous weapons and mass surveillance of Americans.

Pentagon says “all lawful purposes” or nothing:
>“we will make them pay”

ITS HAPPENING

522

12K

1K

4K

3M

0

245

Ezra @Revan

4 months ago

The transitive property of frame mogging

Kylie Cheung @kylietcheung

4 months ago

I don't think this is how frame mogging works you can't avenge someone.... if you frame mog the person who frame mogged your friend (brutally) then you frame mogged both of them

0

4K

73

131

141K

0

1

0

207

Revan retweeted

Καλός

@realKalos

4 months ago

Got my answer. Full body chills.

52

14K

293

636

470K

Revan retweeted

JT

@jiratickets

4 months ago

Knew a dude who emailed the IT guy directly instead of submitting a ticket and they blew his shit smoove off

478

180K

8K

6M

Who to follow

22 gunsmith twink occasionally nsfw buy me a box of ammo at $EastImp

4 months ago

@MurrayHillGuy1 Until dating services have financial incentives that align with partnering people instead of maximizing screen time for lonely men, none of these or other well intentioned ideas will be implemented. As it is, you're the guy politely requesting a thermostat adjustment in hell.

0

68

Revan retweeted

Dave DeCamp

@DecampDave

5 months ago

Maybe sit this one out

302

32K

4K

528

591K

Ezra @Revan

5 months ago

@fleshsimulator Not that two things can't be true at the same time but I'm not interested in hearing about federal jackboots from people who never gave a shit about Vicki Weaver

0

11

Ezra @Revan

5 months ago

@bencroyderived @Landeur All guns will D if you are N enough

0

60

Ezra @Revan

5 months ago

@9mmsmg If sig making a striker fire fulfills a prophecy just imagine the butterfly effect if keltec made something useful

0

1

0

236

Ezra @Revan

5 months ago

@KelTecOfficial I mean. Yeah I believe you when you said you didn't ask a focus group.

0

7

Revan retweeted

Ink Blot

@inkblotistan

5 months ago

looksmatched couple

10

659

33

47

23K

Ezra @Revan

5 months ago

@Bricktop_NAFO Did you think there's like an Indiana Jones warehouse full of every car somebody's died in, preserved for future generations? Or, what, are you not convinced she's dead?

0

4

Ezra @Revan

5 months ago

@utacult The other thing captured well here, surely just a coincidence, is the total indifference and narcissism of doctors in the face of people with serious medical problems more difficult to diagnose than a broken arm. Hmm.

0

5

Ezra @Revan

6 months ago

This is largely my experience both with using AI for software development and for personal work. Getting good results usually happens within the first 3 prompts or not at all, and depends heavily on narrowly defining success criteria, and usually brevity.

Robert Youssef

@rryssf

6 months ago

This paper quietly explains why so many people feel like LLMs are “almost smart, but somehow wrong.” The core claim in this paper is very uncomfortable: most failures are not about missing information. They are about misreading intent even when all the relevant context is present. The authors show that LLMs are very good at mapping text to plausible responses, but surprisingly weak at inferring what the user is trying to achieve. Two prompts can contain nearly identical information, yet imply very different goals. Humans pick this up instantly. Models often do not. The paper separates “context understanding” from “intent understanding.” Context is the literal content: entities, constraints, instructions. Intent is latent: priorities, tradeoffs, what matters most if things conflict. Current models optimize for surface-level alignment, not goal inference. One experiment makes this painfully clear. Users asked questions that could reasonably be interpreted as either exploratory or decision-oriented. The models answered confidently but chose the wrong mode at high rates, giving verbose explanations when users wanted a recommendation, or giving a decisive answer when users were clearly still exploring. The information was correct. The response was wrong. Another failure mode is over-literal instruction following. When users implicitly expect the model to fill gaps or challenge assumptions, the model instead treats the prompt as a closed specification. The result looks obedient but misses the point. This is not hallucination. It is misaligned helpfulness. The authors also test paraphrasing. When the same intent is expressed with different phrasing, model behavior shifts significantly. That tells us the model is anchoring on linguistic form, not reconstructing an underlying goal. "Humans normalize phrasing differences. Models react to them." What’s striking is that longer context often worsens intent alignment. Adding more background increases the chance the model optimizes for local relevance instead of global purpose. More tokens give the illusion of understanding while diluting the signal of what the user actually wants. The paper argues this is not solvable by bigger context windows or better prompting alone. Intent is not explicitly stated most of the time. It has to be inferred, tracked, and sometimes revised mid-conversation. That requires models to reason about users, not just text. The implication is brutal for agents and copilots. If a system cannot reliably infer intent, autonomy becomes dangerous. Tool use amplifies mistakes. Confident execution based on a misunderstood goal is worse than asking a clarifying question. The authors suggest future work should treat intent as a first-class object: something to model, update, and verify explicitly. Not just “what was said,” but “what outcome is being optimized.” Until then, many AI systems will continue to feel smart, fast, and subtly wrong. This paper explains why that feeling keeps coming up. Paper: Beyond Context: Large Language Models Failure to Grasp Users Intent

rryssf's tweet photo. This paper quietly explains why so many people feel like LLMs are “almost smart, but somehow wrong.”

The core claim in this paper is very uncomfortable: most failures are not about missing information. They are about misreading intent even when all the relevant context is present.

The authors show that LLMs are very good at mapping text to plausible responses, but surprisingly weak at inferring what the user is trying to achieve. Two prompts can contain nearly identical information, yet imply very different goals. Humans pick this up instantly. Models often do not.

The paper separates “context understanding” from “intent understanding.” Context is the literal content: entities, constraints, instructions. Intent is latent: priorities, tradeoffs, what matters most if things conflict. Current models optimize for surface-level alignment, not goal inference.

One experiment makes this painfully clear.

Users asked questions that could reasonably be interpreted as either exploratory or decision-oriented. The models answered confidently but chose the wrong mode at high rates, giving verbose explanations when users wanted a recommendation, or giving a decisive answer when users were clearly still exploring. The information was correct. The response was wrong.

Another failure mode is over-literal instruction following. When users implicitly expect the model to fill gaps or challenge assumptions, the model instead treats the prompt as a closed specification. The result looks obedient but misses the point. This is not hallucination. It is misaligned helpfulness.

The authors also test paraphrasing. When the same intent is expressed with different phrasing, model behavior shifts significantly. That tells us the model is anchoring on linguistic form, not reconstructing an underlying goal.

"Humans normalize phrasing differences. Models react to them."

What’s striking is that longer context often worsens intent alignment. Adding more background increases the chance the model optimizes for local relevance instead of global purpose. More tokens give the illusion of understanding while diluting the signal of what the user actually wants.

The paper argues this is not solvable by bigger context windows or better prompting alone. Intent is not explicitly stated most of the time. It has to be inferred, tracked, and sometimes revised mid-conversation.

That requires models to reason about users, not just text.

The implication is brutal for agents and copilots. If a system cannot reliably infer intent, autonomy becomes dangerous. Tool use amplifies mistakes.

Confident execution based on a misunderstood goal is worse than asking a clarifying question.

The authors suggest future work should treat intent as a first-class object: something to model, update, and verify explicitly. Not just “what was said,” but “what outcome is being optimized.” Until then, many AI systems will continue to feel smart, fast, and subtly wrong.

This paper explains why that feeling keeps coming up.

Paper: Beyond Context: Large Language Models Failure to Grasp Users Intent

99

1K

337

1K

110K

0

1

0

184

Revan retweeted

Lomez

@L0m3z

6 months ago

@RichardHanania You should seriously reckon with the fact that your inability to understand fundamental moral intuitions felt by 99% of humabity disqualifies you from making recommendations for how those people organize their societies

105

14K

474

447

346K

Ezra @Revan

8 months ago

@DJSnM That would be meaningful and helpful though

0

315

Ezra @Revan

about 1 year ago

@BowTiedNiners @TheBigCigar1906 Attempting that turn on a B52 with ~800 ft agl...

1

0

64

Ezra @Revan

about 1 year ago

@WizardGoesBoom I don't know if I can really articulate this but there was something about the vibe of magic and spells in the AD&D books that just isn't there anymore. Maybe it was the inaccessiblity of the system itself enhancing the feel of "magic".

0

2

0

15

Ezra @Revan

about 1 year ago

@jxnlco Honestly don't try to help these animals.

0

1

0

88

Ezra @Revan

over 1 year ago

@RenoMayGuns <Joke about dropping them>

1

0

231

Ezra

@Revan

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users