Martin Wattenberg @wattenberg - Twitter Profile

wattenberg retweeted

Laura Wattenberg, Baby Name Wizard @BNW

about 3 years ago

https://t.co/x0MHQHF7WA

0

7

1

5K

Martin Wattenberg @wattenberg

about 3 years ago

@filip_rejmus @viegasf Moreover, different heads have a different level of focus on position, which leads to "sharper" or "blurrier" spiral shapes. Finally, the input sequences we used are different lengths, so there are more points at earlier positions than later ones—that affects the shape as well.

0

7

0

1

119

Martin Wattenberg @wattenberg

about 3 years ago

Visualize transformer attention! AttentionViz, created by Catherine Yeh and expanded by Yida Chen, helps you explore transformer self-attention by visualizing query and key vectors in a joint embedding. Paper: https://t.co/GKom1BN5Zl Website: https://t.co/rCeA2e5xKv

wattenberg's tweet photo. Visualize transformer attention!

AttentionViz, created by Catherine Yeh and expanded by Yida Chen, helps you explore transformer self-attention by visualizing query and key vectors in a joint embedding.

Paper: https://t.co/GKom1BN5Zl
Website: https://t.co/rCeA2e5xKv https://t.co/KI7QEuUWi5

4

389

82

215

65K

Martin Wattenberg @wattenberg

about 3 years ago

@filip_rejmus @viegasf You'll notice that some spirals are more "connected" than others. These correspond to heads that pay attention to multiple nearby positions, so the queries / keys trace out a path between positional encodings. There's a kind of linear interpolation between position vectors. (2/n)

1

2

0

127

Who to follow

IEEE VIS

@ieeevis

The premier forum for visualization advances for academia, government, and industry. We invite you to share your research, insights, and enthusiasm at IEEE VIS

Jeffrey Heer

@jeffrey_heer

UW Computer Science Professor. Data, visualization & interaction. he/him. @uwdata @uwdub @vega_vis ex-@trifacta

MIT Visualization Group

@mitvis

We’re a research group at @MIT_CSAIL using data visualization as a petri dish to study intelligence augmentation.

Martin Wattenberg @wattenberg

about 3 years ago

Here's a close-up of hue and brightness heads in a vision transformer. Work done with tremendous help from Aoyu Wu and @chenxcynthia!

wattenberg's tweet photo. Here's a close-up of hue and brightness heads in a vision transformer. Work done with tremendous help from Aoyu Wu and @chenxcynthia! https://t.co/L1nll9gkS4

0

15

3

2

3K

Martin Wattenberg @wattenberg

about 3 years ago

Language models have some beautiful spiral plots reflecting positional patterns. And a vision model has heads that arrange images according to brightness and hues. But there’s a lot more to find! What else can you see?

wattenberg's tweet photo. Language models have some beautiful spiral plots reflecting positional patterns. And a vision model has heads that arrange images according to brightness and hues. But there’s a lot more to find! What else can you see? https://t.co/ZyW3cd00EI

1

26

6

5K

Martin Wattenberg @wattenberg

about 3 years ago

Toasters have blinking lights, cars have speedometers. Should chatbots have dashboards too? A speculative essay: The System Model and the User Model: Exploring AI Dashboard Design https://t.co/nurFiNd9sF

wattenberg's tweet photo. Toasters have blinking lights, cars have speedometers. Should chatbots have dashboards too?

A speculative essay: The System Model and the User Model: Exploring AI Dashboard Design
https://t.co/nurFiNd9sF https://t.co/ZBPMcYqUPO

1

25

4

5

4K

wattenberg retweeted

David Bau @davidbau

over 3 years ago

I want to show the NSF there would be broad support+utility for a "National Deep Inference" service for >100b LLMs. If your research would be enabled by an inference service on open LLMs w API access+overrides to internal activations, params, gradients: Please Like this thread!

16

465

49

26

77K

wattenberg retweeted

Harvard SEAS @hseas

over 3 years ago

Students explore the aesthetics of computing in new computer science course at SEAS. "CS73: Code, Data, and Art" is co-taught by @wattenberg and @viegasf, and teaches students how to create abstract art and communicate data sets through visualizations. https://t.co/se8D0Tkbqd

hseas's tweet photo. Students explore the aesthetics of computing in new computer science course at SEAS. "CS73: Code, Data, and Art" is co-taught by @wattenberg and @viegasf, and teaches students how to create abstract art and communicate data sets through visualizations. https://t.co/se8D0Tkbqd https://t.co/dX4JpGB7zg

0

26

8

1

0

Martin Wattenberg @wattenberg

over 3 years ago

Starting to get used to Mastodon. Find me at @[email protected]

1

9

2

1

0

Martin Wattenberg @wattenberg

over 3 years ago

For information on the application process, see https://t.co/11XIcnahCO

0

4

1

0

Martin Wattenberg @wattenberg

over 3 years ago

Thinking about grad school next year? Interested in visualization, machine learning interpretability, or human/AI interaction? Consider Harvard. @viegasf and I are continuing to build our lab!

wattenberg's tweet photo. Thinking about grad school next year? Interested in visualization, machine learning interpretability, or human/AI interaction? Consider Harvard. @viegasf and I are continuing to build our lab! https://t.co/HMO9PeMHhP

1

107

17

28

0

wattenberg retweeted

The Guardian

@guardian

over 3 years ago

The labyrinthine patterns traced by birds on the wing – in pictures https://t.co/dfhQk78xWC

5

48

12

5

0

Martin Wattenberg @wattenberg

over 3 years ago

I'm teaching with @OpenProcessing for the first time, and am completely impressed with how polished and friendly the system is. Every detail is on point. Last class a student spontaneously said, "OpenProcessing is just so great." I agree!

1

23

1

0

Martin Wattenberg @wattenberg

over 3 years ago

The project is a systematic study of how a neural network represents more features than it has neurons, under a variety of conditions. (Oh, and I can't tweet without a typo, apparently. That's @AnthropicAI of course!)

1

16

1

0

Martin Wattenberg @wattenberg

over 3 years ago

Tiny neural networks have a surprisingly rich inner life, and may hold clues to how their larger cousins work. This image: evolution of feature vectors during learning. Full story: https://t.co/5lULpDZQBE (in collaboration with the great interpretability team at @AnthopicAI)

wattenberg's tweet photo. Tiny neural networks have a surprisingly rich inner life, and may hold clues to how their larger cousins work. This image: evolution of feature vectors during learning. Full story: https://t.co/5lULpDZQBE (in collaboration with the great interpretability team at @AnthopicAI) https://t.co/1HhIzpYQZ2

11

683

86

283

0

wattenberg retweeted

Golan Levin @golan

almost 4 years ago

SMASHOMANCY is a "divination system for cracked phone screens" and @djbaskin retains pole position as one of my favorite living creatives.

0

11

5

4

0

Martin Wattenberg @wattenberg

almost 4 years ago

@gro_tsen I like this paper: https://t.co/mNpwfLI0EA Sample finding: If you ask people to review the last restaurant they went to, they're less polarized than if they freely choose a restaurant to review, evidence for your second hypothesis

0

1

0

Martin Wattenberg @wattenberg

almost 4 years ago

@sharoz @RuthRosenholtz Wonderful, thanks! Part of the explanation should say why the "3" doesn't stand out when everything's the same color. The shape strongly contrasts with any other individual digit in a side-by-side comparison. And in fact it would stand out if all the other digits were 1's and 7's

2

1

0

Martin Wattenberg

@wattenberg

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users