Christopher Porter @cpporter1000 - Twitter Profile

over 2 years ago

After reading the @nytimes lawsuit against @OpenAI and @Microsoft, I find my sympathies more with OpenAI and Microsoft than with the NYT. The suit: (1) Claims, among other things, that OpenAI and Microsoft used millions of copyrighted NYT articles to train their models (2) Gives examples in which OpenAI models regurgitated NYT articles almost verbatim But the presentation muddies (1) and (2), and I saw a lot of commentary on social media that -- because of what I believe is a muddied presentation -- draws a link between them that I'm not sure is what people think it is. On (1): I understand why media companies don't like people training on their documents, but believe that just as humans are allowed to read documents on the open internet, learn from them, and synthesize brand new ideas, AI should be allowed to do so too. I would like to see training on the public internet covered under fair use -- society will be better off this way -- though whether it actually is will ultimately be up to legislators and the courts. On (2): I suspect a lot of the examples of ChatGPT regurgitating articles nearly verbatim were due to a RAG-like mechanism where the user prompt causes the system to browse the web, retrieve a specific article and then print it out. (If my statement here isn't accurate, I would love to see the @nytimes clarify this.) If this is the case, then (i) To OpenAI's credit, they seem to have already updated their software to make this much less likely, and (ii) This is also a much easier problem to fix than if an LLM were to regurgitate text using only the pre-trained weights, which AFAIK very rarely happens (and which, given its rarity, also raises the question of how much harm to NYT this has actually caused). To be clear, I believe independent media is important for democracy and must be protected. I also sympathize with media businesses worried about Generative AI disrupting their businesses. But I'm not convinced the NYT lawsuit is the right way to do this. Usual caveat: I am not a lawyer and am not giving legal advice or any other form of advice here. You can also read more details of my take on this below. https://t.co/wkZSMHsvNA

257

3K

531

827

946K

cpporter1000 retweeted

Arvind Narayanan

@random_walker

over 2 years ago

A thread on some misconceptions about the NYT lawsuit against OpenAI. Morality aside, the legal issues are far from clear cut. Gen AI makes an end run around copyright and IMO this can't be fully resolved by the courts alone. (HT @sayashk @CitpMihir for helpful discussions.)

11

300

84

260

162K

cpporter1000 retweeted

Misch Strotz

@mitch0z

over 2 years ago

I invited Luxembourgish Artist Alain Welter to my home & showed him @letz_ai. The resulting video offers a glimpse into a traditional artist's first interaction with AI. If you're curious about AI, or if it seems daunting to you, this video is for you.

0

18

6

3

4K

Christopher Porter @cpporter1000

over 2 years ago

Anyone interested in learning more about generative AI? Fully online option available for folks outside of Des Moines. https://t.co/PQFqnQi2qi

0

125

Who to follow

Angeliki Koutsoukou-Argyraki

@AngelikiKoutso1

Mathematics, computer science and logic @RoyalHolloway @Cambridge_CL Other: art, philosophy, society. World citizen. Pacifist. Friend. Own views.

Jason Chen

@JZS_Chen

PMM for DevRel at Intuit. Knows a bit about the history, philosophy, and practices of mathematics, especially set theory. Onions my own.

David J. Webb

@DJWebbMath

Logician, musician, special edition. I run upper division math at Chaminade University, and study computability theory/LEAN when I can. Opinions my own. He/him.

cpporter1000 retweeted

Cameron Buckner @cameronjbuckner

over 2 years ago

https://t.co/enC3rrWqsL I have a systematic philosophical overview of the last ten years of research in deep learning (and recommendations for promising next targets) hopefully heading soon to a bookstore near you. Have a look at the table of contents if this might interest you.

6

74

20

26

9K

Christopher Porter @cpporter1000

almost 3 years ago

@kjoshanssen A close game?

0

1

0

51

Christopher Porter @cpporter1000

almost 3 years ago

So long Cal and Stanford.

1

3

0

230

cpporter1000 retweeted

Erik Brynjolfsson

@erikbryn

almost 3 years ago

Back when we were writing the Second Machine Age, @amcafee and I would play a game where we try to name an occupation that could not be done by AI or robotics. Barber seemed like a good candidate.

21

208

34

20

149K

Christopher Porter @cpporter1000

almost 3 years ago

@n_hold Did it look like this?

1

0

52

Christopher Porter @cpporter1000

about 3 years ago

@CMihut Congratulations, Cris! That's great news!

1

0

31

Christopher Porter @cpporter1000

about 3 years ago

Excited to be a part of this!

SC Iowa STEM Region @SC_Iowa_STEM

about 3 years ago

We are so excited for @MooreDMPS Day @DrakeUniversity! Tomorrow -- 10am-1pm Thanks to all the Drake faculty and staff who went above and beyond to create programming and space for these 5th graders.

SC_Iowa_STEM's tweet photo. We are so excited for @MooreDMPS Day @DrakeUniversity! Tomorrow -- 10am-1pm
Thanks to all the Drake faculty and staff who went above and beyond to create programming and space for these 5th graders. https://t.co/TISEmpgAsM

1

6

0

740

1

5

1

0

305

cpporter1000 retweeted

François Chollet

@fchollet

about 3 years ago

Don't score AI using tests designed for humans. In particular because, with humans, the default assumption is that *they haven't already seen* the content you're giving them. With a LLM, the default assumption should be that, *if it's on the Internet, it's already been memorized*

38

2K

259

223

314K

cpporter1000 retweeted

Riley Goodside

@goodside

over 3 years ago

A thread of interesting Bing Search examples:

12

713

88

395

319K

cpporter1000 retweeted

Iowa Wolves @iawolves

over 3 years ago

🖥️ 𝙝𝙚𝙡𝙡𝙤, 𝙬𝙤𝙧𝙡𝙙! The Wolves' newest threads pay homage to Iowa's place in computing history and hit the floor TOMORROW ⚫️🟢 AUCTION: https://t.co/yxU4copzjr

iawolves's tweet photo. 🖥️ 𝙝𝙚𝙡𝙡𝙤, 𝙬𝙤𝙧𝙡𝙙!

The Wolves' newest threads pay homage to Iowa's place in computing history and hit the floor TOMORROW ⚫️🟢

AUCTION: https://t.co/yxU4copzjr https://t.co/u5QgfcpVne

3

54

10

1

32K

cpporter1000 retweeted

David Smerdon @dsmerdon

over 3 years ago

Why does chatGPT make up fake academic papers? By now, we know that the chatbot notoriously invents fake academic references. E.g. its answer to the most cited economics paper is completely made-up (see image). But why? And how does it make them? A THREAD (1/n) 🧵