Stephen Witt @StephenWitt - Twitter Profile

Pinned Tweet

Stephen Witt @stephenwitt

6 months ago

The Thinking Machine has won the FT Business Book of the Year award!

9

37

3

5

5K

Stephen Witt @stephenwitt

24 days ago

@nabeelqu @MushtaqBilalPhD No question the Granta piece is slop, but it does lead to a more interesting question.... what can you detect that Pangram can not? And how?

0

51

Stephen Witt @stephenwitt

about 2 months ago

@ATabarrok @Pete_Monahan_JD Alex, why does the market fail here? Most insurance is competitively priced. My take is that purchasers of $1MM+ assets become insensitive to add-ons ("What's another 4 grand?") introduced at the end of the buying process. But perhaps there's more going on?

0

1

0

97

stephenwitt retweeted

Josh Glancy @joshglancy

6 months ago

The editors @timesculture asked a few of us to pick our best read of the year. I went for @stephenwitt's riveting biography of Jensen Huang, which helped me (sort of) understand chips but is also an extraordinary human study https://t.co/R6um94ShLe

0

8

3

7

3K

Who to follow

AIM

@AIM_UK

The Association of Independent Music is the non-profit trade body exclusively representing the UK's independent recorded music sector.

Dataminr

@Dataminr

Changing the world with real-time event and risk detection.

Mike Murphy

@DigDeepBSB

He/Him. @TheIceGarden EIC - @theixsports, Editor for @BlueshirtBanter | Creator: @TheirHC

Stephen Witt @stephenwitt

6 months ago

For the New York Times, I wrote about ChatGPT and the race to build the world's best friend. https://t.co/Q5s7JUhFB9

1

7

0

1

1K

Stephen Witt @stephenwitt

6 months ago

@gls_energy Thanks George... hope all is well

1

0

55

Stephen Witt @stephenwitt

6 months ago

The Thinking Machine has won the FT Business Book of the Year award!

9

37

3

5

5K

Stephen Witt @stephenwitt

7 months ago

@khoomeik To be fair, Nvidia's tensor cores do take a n-dimensional tensor and reduce it to many smaller two-dimensional mathematical objects for processing. That part is correct. Where I erred is that it is incorrect to label the resulting matrix multiplication as "two-dimensional."

1

7

0

576

Stephen Witt @stephenwitt

7 months ago

@ben_golub Update: I'm wrong about this.

8

268

2

9

39K

Stephen Witt @stephenwitt

7 months ago

@ben_golub I'm right about this. Tensors are downscaled to 2-D when the operation is executed in the computer. Read about it here: https://t.co/NryYqbhrQZ

6

9

0

2

8K

Stephen Witt @stephenwitt

7 months ago

@boazbaraktcs @aidanprattewart Actually, I really appreciate your time and insight. Thank you Boaz

1

69

0

1

21K

Stephen Witt @stephenwitt

7 months ago

@boazbaraktcs @aidanprattewart Okay, I formally retract this one. This would not be a two-dimensional multiply.

1

42

0

1

4K

Stephen Witt @stephenwitt

7 months ago

@boazbaraktcs @aidanprattewart Yes, sure, I mean it's all atoms and electrons in the end. But don't Nvidia's tensor cores operate at exactly the level of abstraction I'm discussing? Aren't they just little machines for producing infinity amounts of 4x4 matrix multiplies? Of simple, single-variable matrices?

6

14

0

2K

Stephen Witt @stephenwitt

7 months ago

@khoomeik @boazbaraktcs @aidanprattewart Okay, got it.

0

32

0

3

2K

Stephen Witt @stephenwitt

7 months ago

@boazbaraktcs @aidanprattewart Look, if you say I'm wrong—well, I believe you. But it still seems to me that processing the operation at the tensor core level happens in 2-d? I must be missing something.

8

13

0

5K

Stephen Witt @stephenwitt

7 months ago

The point I'm trying to make is that if you ask a data center to do a higher-dimensional tensor operation, it will break it apart and downscale it to 2-D at the level of the tensor core. So in this sense, most matrix multiplies *in the data center* are 2-D, regardless of what they look like to a programmer at a workstation. Now, maybe that's too reductive! Or maybe, alternatively, it's not even reductive enough. After all, below this level of abstraction the operations are even more primitive. But it seems worth noting that Nvidia's empire is built on this specific operation, at this specific level of abstraction. That's the theme of this article.

3

6

0

3

5K

Stephen Witt @stephenwitt

7 months ago

I don't dismiss it! It's probably the single-most important mathematical operation of our time. We are building nuclear power plants to do it. But it's hard, cumbersome and unwieldy. The reason Nvidia has a $5 trillion market capitalization is because the complexity of the operation grows by the cube of the elements. Now admittedly, non-commutativity is not *why* it's hard, but also: I didn't say that!

1

2

0

336

Stephen Witt @stephenwitt

7 months ago

@aidanprattewart @boazbaraktcs Again, tensor cores do everything in 2-D read about it here: https://t.co/NryYqbhrQZ

6

9

0

12

12K

Stephen Witt @stephenwitt

7 months ago

I guess the question is what level of abstraction do you want to focus on. I think for programmers at workstations this stuff is really elegant. I think if you have a Ph.D. in linear algebra you can see this as truly beautiful. But that's not what this article is about! It's about what's being executed *inside* the data center. At that level, it's essentially tensor cores spamming 2-d matrix multiplies to infinity. (And yes, below that, it's all one-d. And below that, it's electrons being pushed around.) It's like a building made of brick. The building is beautiful. The individual brick? Well... you kind of have to squint

1

0

879

Stephen Witt @stephenwitt

7 months ago

@ben_golub (note that the somewhat confusing 4x4x4 terminology refers to four 4x4 matrices, not one 4x4x4 matrix)

1

0

665

Stephen Witt @stephenwitt

7 months ago

@ben_golub Big reading comp fail here. Matrix multiplies for AI are indeed executed in two-d, regardless of the number of dimensions in the software framework. The operation below is the cogwheel of capitalism.

stephenwitt's tweet photo. @ben_golub Big reading comp fail here. Matrix multiplies for AI are indeed executed in two-d, regardless of the number of dimensions in the software framework.

The operation below is the cogwheel of capitalism. https://t.co/7wWRXbZ6FG

15

10

0

3

3K

Stephen Witt

@stephenwitt

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users