Christopher L. @LionLambe - Twitter Profile

3 months ago

Occasionally I accidentally come on X instead of Bluesky out of old habit. I've been here 5 minutes and sweet Jesus is this place a cesspool.

0

35

Christopher L. @LionLambe

3 months ago

@kim_X_artist @FoodProfessor Yes because Galen Weston doesn't currently control what goes into stores already. How does the large corporate cock taste Kim?

0

1

0

14

Christopher L. @LionLambe

3 months ago

@covidanger2020 @FoodProfessor Yes Argentina known for it's... Liberal democracy... lol

1

0

7

Christopher L. @LionLambe

3 months ago

@TruthLion100 @FoodProfessor The more nonsense social media you take in the more you think it isn't.

0

2

Who to follow

Jennifer Hollett

@jenniferhollett

Media, tech, & politics. Executive Director @TheWalrus Before that, news/gov @Twitter On here, wondering why I'm on here. Also: https://t.co/ic6hjIMYRx

Proudly delivering vital services to Torontonians in partnership with the City since 1917. Follows/retweets ≠ political endorsements

Christopher L. @LionLambe

3 months ago

@JustinCalow @FoodProfessor How does Galen Westons cock taste?

0

9

Christopher L. @LionLambe

3 months ago

@ErosVilleneuve @FoodProfessor @TruthLion100 O no, not the dreaded affordable groceries. O lord Galen please stick your cock further down this man's throat so he can gurgle it harder.

0

6

Christopher L. @LionLambe

3 months ago

@laprairie @FoodProfessor Things like this and Canada Post aren't supposed to be businesses, governments don't run businesses, they are supposed to run services.

0

1

0

21

Christopher L. @LionLambe

3 months ago

@BTozzio @FoodProfessor Yeah so horrible to use tax funds to actually benefit citizens vs. bailing out corporations. Suck harder.

0

10

LionLambe retweeted

Abdul Șhakoor

@abxxai

3 months ago

BREAKING: 🚨 Someone just tested 35 AI models across 172 billion tokens of real document questions. The hallucination numbers should end the "just give it the documents" argument forever. Here is what the data actually showed. The best model in the entire study, under perfect conditions, fabricated answers 1.19% of the time. That sounds small until you realize that is the ceiling. The absolute best case. Under optimal settings that almost no real deployment uses. Typical top models sit at 5 to 7% fabrication on document Q&A. Not on questions from memory. Not on abstract reasoning. On questions where the answer is sitting right there in the document in front of it. The median across all 35 models tested was around 25%. One in four answers fabricated, even with the source material provided. Then they tested what happens when you extend the context window. Every company selling 128K and 200K context as the hallucination solution needs to read this part carefully. At 200K context length, every single model in the study exceeded 10% hallucination. The rate nearly tripled compared to optimal shorter contexts. The longer the window people want, the worse the fabrication gets. The exact feature being sold as the fix is making the problem significantly worse. There is one more finding that does not get talked about enough. Grounding skill and anti-fabrication skill are completely separate capabilities in these models. A model that is excellent at finding relevant information in a document is not necessarily good at avoiding making things up. They are measuring two different things that do not reliably correlate. You cannot assume a model that retrieves well also fabricates less. 172 billion tokens. 35 models. The conclusion is the same across all of them. Handing an LLM the actual document does not solve hallucination. It just changes the shape of it.

abxxai's tweet photo. BREAKING: 🚨 Someone just tested 35 AI models across 172 billion tokens of real document questions.

The hallucination numbers should end the "just give it the documents" argument forever.

Here is what the data actually showed.

The best model in the entire study, under perfect conditions, fabricated answers 1.19% of the time. That sounds small until you realize that is the ceiling. The absolute best case. Under optimal settings that almost no real deployment uses.

Typical top models sit at 5 to 7% fabrication on document Q&A. Not on questions from memory. Not on abstract reasoning. On questions where the answer is sitting right there in the document in front of it.

The median across all 35 models tested was around 25%.

One in four answers fabricated, even with the source material provided.

Then they tested what happens when you extend the context window. Every company selling 128K and 200K context as the hallucination solution needs to read this part carefully.

At 200K context length, every single model in the study exceeded 10% hallucination. The rate nearly tripled compared to optimal shorter contexts.

The longer the window people want, the worse the fabrication gets. The exact feature being sold as the fix is making the problem significantly worse.

There is one more finding that does not get talked about enough.

Grounding skill and anti-fabrication skill are completely separate capabilities in these models.

A model that is excellent at finding relevant information in a document is not necessarily good at avoiding making things up. They are measuring two different things that do not reliably correlate. You cannot assume a model that retrieves well also fabricates less.

172 billion tokens. 35 models. The conclusion is the same across all of them.

Handing an LLM the actual document does not solve hallucination. It just changes the shape of it.

261

5K

1K

3K

478K

LionLambe retweeted

Shayan Sardarizadeh

@Shayan86

3 months ago

An ongoing military investigation has determined that the United States is responsible for a deadly Tomahawk missile strike on an Iranian elementary school, according to US officials and others familiar with the preliminary findings. https://t.co/C8mmufE6I8

13

258

111

18

17K

LionLambe retweeted

Center on Conscience & War @CCW4COs

4 months ago

Service members: If you agree Trump’s war on Iran is wrong, you don’t have to participate. We can help you explore your options. Call 1-800-379-2679

CCW4COs's tweet photo. Service members: If you agree Trump’s war on Iran is wrong, you don’t have to participate. We can help you explore your options. Call 1-800-379-2679 https://t.co/LOAA7k9ElX

412

17K

7K

2K

3M

LionLambe retweeted

Matthew Hoh

@MatthewPHoh

4 months ago

If you are or know a servicemember, including reserves and National Guard, who are concerned about their options in not taking part in a war they feel is illegal or immoral, please contact @CCW4COs or @girights. Please share.

18

2K

854

70

48K

LionLambe retweeted

Linus ✦ Ekenstam

@LinusEkenstam

4 months ago

Who is building a data transfer tool to move all your historical data from ChatGPT to Claude?

131

3K

151

434

186K

Christopher L. @LionLambe

4 months ago

AI was supposed to make the world a better place, instead it is being used for War.

Sam Altman

@sama

4 months ago

Tonight, we reached an agreement with the Department of War to deploy our models in their classified network. In all of our interactions, the DoW displayed a deep respect for safety and a desire to partner to achieve the best possible outcome. AI safety and wide distribution of benefits are the core of our mission. Two of our most important safety principles are prohibitions on domestic mass surveillance and human responsibility for the use of force, including for autonomous weapon systems. The DoW agrees with these principles, reflects them in law and policy, and we put them into our agreement. We also will build technical safeguards to ensure our models behave as they should, which the DoW also wanted. We will deploy FDEs to help with our models and to ensure their safety, we will deploy on cloud networks only. We are asking the DoW to offer these same terms to all AI companies, which in our opinion we think everyone should be willing to accept. We have expressed our strong desire to see things de-escalate away from legal and governmental actions and towards reasonable agreements. We remain committed to serve all of humanity as best we can. The world is a complicated, messy, and sometimes dangerous place.

15K

34K

4K

14K

38M

0

10

LionLambe retweeted