Yashmeet Gambhir @YashG98 - Twitter Profile

YashG98 retweeted

9 months ago

IMO — this paper misses the core driver of hallucinations A LLM with a billion neurons is like a billion tiny databases — database per neuron When you prompt it, the LLM looks in all the databases (i.e. neurons) for patterns it recognizes For example, when you prompt "Kim Kardashian is dating ..." The LLM looks in its billions of little hash tables and pulls out patterns: - vocabulary (words like Kim, instagram, etc.) - grammar (subjects -> verb -> object) - semantics (Kim Kardashian's known associates) But here's the problem.... when you prompt it for something unfamiliar, the LLM still recognizes some patterns (e.g. good grammar) - vocabulary (words like Kim, instagram, etc.) - grammar (subjects -> verb -> object) But if it doesn't find all the right cache entries: - semantics (Kim Kardashian's known associates) - date ranges (maybe she dated different people at different times) Then the LLM will make next-token predictions based on the hash-hits it found... but without the benefit of the hash-misses it lacks. So to return to the prompt: "Kim Kardashian is dating ..." - Grammar patterns: the next token will be a noun - Semantic patterns: the next token will be a first name (because "is dating" is usually followed by a name) - Gender pattern: the next token will be a male - Relationship patterns: the next token will be a male Kim is associated with a lot ... but if it can't find the hash-hit in its internal neuraons for the SPECIFIC male she's dating... it can hit on other things.... like - generic male names - males who appear in articles with Kim - other grammatically correct words like "no-one" We call this a hallucination, but IMO it's closer to a cache miss. So how do you solve hallucination? This paper from OpenAI suggests that we solve hallucination by putting "I don't know" in a bunch of the databases. But this isn't how you solve for cache misses — this is just how you create more cache hits of a certain type. If you had a database which was returning erroneous results, would you *fill* the database with "I don't know" entries???... On the one hand, that WOULD increase the chances that the erroneous result was "I don't know"... so you'd make some partial progress at a surface level. But IMO it's not solving the underlying problem... which is closer to detecting the sources/datapoints used for each prediction (MoE, RAG, etc. are making progress on this). IMO - a more fundamental solution would involve solving attribution-based control (link below)

47

721

80

602

100K

Yashmeet Gambhir @YashG98

9 months ago

@john_calso_ @orchard_robots Let's goo John this is sick, congrats!

0

1

0

60

YashG98 retweeted

Owain Evans

@OwainEvans_UK

over 1 year ago

Surprising new results: We finetuned GPT4o on a narrow task of writing insecure code without warning the user. This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis.  This is *emergent misalignment* & we cannot fully explain it 🧵

OwainEvans_UK's tweet photo. Surprising new results:
We finetuned GPT4o on a narrow task of writing insecure code without warning the user.
This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis.
 This is *emergent misalignment* & we cannot fully explain it 🧵 https://t.co/kAgKNtRTOn

427

7K

938

4K

2M

Yashmeet Gambhir @YashG98

about 2 years ago

@alexalbert__ Would like to hear more about Anthropic's preparation for global elections -- specifically what harms are anticipated and what evaluations are most relevant

0

1

0

67

Who to follow

Part Time Doctor👨🏻‍⚕️, Full Time Michigan 〽️ and Detroit sports fan #GoBlue

RGB

@shazalkhan24

WB ‘17 WSU ‘21 💰

Yashmeet Gambhir @YashG98

about 4 years ago

@byseanstapleton @a16z @neo @lachygroom Congrats! Super exciting work 🎉

0

YashG98 retweeted

Tim Urban

@waitbutwhy

over 4 years ago

Go hug someone

177

13K

3K

897

0

Yashmeet Gambhir @YashG98

about 5 years ago

@UMichBlockchain @a16z @amico_jeffrey @CalBlockchain @HarvardLawBFI @PennBlockchain @ColumbiaCBA @StanfordCrypto So excited for you all! Congrats 🎊

1

0

YashG98 retweeted

Michigan Blockchain

@MichBlockchain

about 5 years ago

Ecstatic to announce that @a16z is delegating 2.5M $UNI and 50K $COMP to Blockchain at Michigan! We look forward to advancing the future of DeFi with some of our favorite friends @amico_jeffrey @CalBlockchain @HarvardLawBFI @PennBlockchain @ColumbiaCBA @StanfordCrypto and more

9

106

18

8

0

Yashmeet Gambhir @YashG98

about 5 years ago

@thebrunchguy Feel like current social media enables connection, but incentivizes consumption :(. Related to your pinned tweet

0

1

0

YashG98 retweeted

Barack Obama

@BarackObama

about 6 years ago

As millions of people across the country take to the streets and raise their voices in response to the killing of George Floyd and the ongoing problem of unequal justice, I’ve heard many ask how we can sustain momentum to bring about real change.

13K

670K

117K

4K

0

Yashmeet Gambhir @YashG98

over 6 years ago

@kobroskys Get 1 or 2 people to join you in one stack in Hatcher 😂

1

0

YashG98 retweeted

UMich Running Club @MRun

over 6 years ago

FROM 3-4 PM THE #GivingBlueday TWEET WITH THE MOST RETWEETS WINS $1,000!!! Help MRun out and give this a retweet so we can pay for #NircaNats and keep the club affordable for everyone!!

22

54

123

0

YashG98 retweeted

Michigan Blockchain

@MichBlockchain

over 7 years ago

Michigan team specialist David Sun shares his experience at SF Blockchain Week, from Dharma’s view on debt to Weyl’s fireside chat! https://t.co/CCqmEM1ED2

0

7

2

0

YashG98 retweeted

Shehryar @Shehryar_Ahmed_

almost 8 years ago

After all my hard work of how I wanted to start my clothing line, after all the crap people gave me, I am proud to announce my clothing line @sher__rag will be dropping September 15. Thank you everyone for supporting through this long journey.