Nithum @Nithum - Twitter Profile

almost 2 years ago

Check out our most recent Explorable "Can Large Language Models Explain Their Internal Mechanisms?" https://t.co/GQ6v9ZptAE

0

7

0

1

182

Nithum retweeted

Google AI

@GoogleAI

almost 2 years ago

Can large language models (LLMs) explain their internal mechanisms? Check out the latest AI Explorable on Patchscopes, an inspection framework that uses LLMs to explain the hidden representations of LLMs. Learn more → https://t.co/mvmix9hKs0

GoogleAI's tweet photo. Can large language models (LLMs) explain their internal mechanisms? Check out the latest AI Explorable on Patchscopes, an inspection framework that uses LLMs to explain the hidden representations of LLMs. Learn more → https://t.co/mvmix9hKs0 https://t.co/UGnFpLMJCJ

17

562

147

186

50K

Nithum retweeted

Google AI

@GoogleAI

almost 3 years ago

While large language models appear to have a rich understanding of the world, how do we know they’re not simply regurgitating from training data? Check out the latest AI Explorable on a phenomenon called grokking to learn more about how models learn. → https://t.co/Okc9GvJjuN

37

2K

464

591

319K

Nithum retweeted

Adam Pearce @adamrpearce

almost 3 years ago

Do Machine Learning Models Memorize or Generalize? https://t.co/Ln3xIZhKLs An interactive introduction to grokking and mechanistic interpretability w/ @ghandeharioun, @nadamused_, @Nithum, @wattenberg and @iislucas

20

1K

243

917

257K

Who to follow

Arnaud Doucet

@ArnaudDoucet1

Senior Staff Research Scientist @GoogleDeepMind. Previously @UniofOxford.

Pierre-Luc Bacon

@pierrelux

Prof. at @UMontrealDIRO @MILAMontreal

Nils Feldhus

@nfelnlp

Post-doctoral Researcher at BIFOLD / TU Berlin interested in interpretability and analysis of language models. Guest researcher at DFKI Berlin.

Nithum retweeted

iislucas (Lucas Dixon) @iislucas

about 3 years ago

Some of my thoughts on generative AI... and a reboot of the PAIR blog... https://t.co/lTE67mhDDL #responsibleai #hci #machinelearning #GenerativeAI

1

13

1

2

1K

Nithum retweeted

Adam Pearce @adamrpearce

about 3 years ago

Confidently Incorrect Models to Humble Ensembles by @Nithum, @balajiln and Jasper Snoek https://t.co/JNVYLq1Bib

1

30

7

4

4K

Nithum @Nithum

about 3 years ago

ML models sometimes make confidently incorrect predictions when they encounter out of distribution data. Ensembles of models can make better predictions by averaging away mistakes. https://t.co/GkO5tMseoo

0

2

0

74

Nithum retweeted

Andy Coenen

@_coenen

over 3 years ago

In partnership with @GoogleMagenta, we invited 13 professional writers to use Wordcraft, our experimental LaMDA-powered AI writing tool. We've published all of the stories written with the tool, along with a discussion on the future of AI and creativity. https://t.co/D3KK8DM1Lo

3

49

16

12

0

Nithum retweeted

Adam Pearce @adamrpearce

over 3 years ago

Most machine learning models are trained by collecting vast amounts of data on a central server. @nicki_mitch and I looked at how federated learning makes it possible to train models without any user's raw data leaving their device. https://t.co/qRHqbJ2VNL

0

64

26

21

0

Nithum retweeted

TensorFlow @TensorFlow

almost 4 years ago

🤔 We've come a long way with #NLP, but what have language models actually learned? Watch Senior Software Engineer at Google PAIR, Nithum Thain, discuss AI language model learnings → https://t.co/k1MbtojO9T

0

58

16

10

0

Nithum @Nithum

about 4 years ago

Check out our new explorable on machine learning calibration: Machine learning models express their uncertainty as model scores, but through calibration we can transform these scores into probabilities for more effective decision making. https://t.co/5fS21WM23A

0

118

32

42

0

Nithum retweeted

Martin Görner @martin_gorner

over 7 years ago

Beautiful "RNN with attention" tutorial from one of the authors of Google's troll-fighting AI @Nithum. https://t.co/82bVY0wcEZ. We presented this toxic comment detection model together in the "Tensorflow and modern RNNs without a PhD" talk. Excuse our French 🤬!