dron @_dron_h - Twitter Profile

dron @_dron_h

about 23 hours ago

@GoogleMagenta this is really cool work!! can't wait to play with it

0

4

0

80

_dron_h retweeted

Google Magenta Project

@GoogleMagenta

1 day ago

Introducing Magenta RealTime 2 (MRT2): the live music model you can play as an instrument. MRT2 offers MIDI and prompt controls, and runs natively on a MacBook with <200ms latency. Open weights. Open source inference engine. Suite of apps and plugins. Hear what it can do and try it out for yourself below 🧵

11

250

53

216

46K

dron @_dron_h

14 days ago

@pranav_vd @raymondmfeng raymond spotted!!!!

0

1

0

115

dron @_dron_h

14 days ago

@zacknovack ooh super cool!!

0

1

0

16

Who to follow

CMU CS + Math + ML | Deep Learning Researcher | Team Canada-ISEF 2023 | TEDx Speaker/Author.

Oscar Petrov

@oscar_petrov

multi-scale agency, biological intelligence, what it means to flourish | prev: @brown, �� @medialab

dron @_dron_h

15 days ago

@nabla_theta @QiaochuYuan a pointer is a Thing that points to another Thing, and even here i have created an implicit binding between the two Things when they need not be related fully abstracted, devoid of implementation or concrete ground

0

10

dron @_dron_h

15 days ago

@nabla_theta @QiaochuYuan closures and currying are also pretty emblematic to me? we're in a fuzzy space here but currying is one of those things that feels very "trivial" to me — it's a primitive! what is a pointer? what is a cache? what is a function? none of these are constrained (much)

1

0

14

dron @_dron_h

15 days ago

anti sae sae club

Goodfire

@GoodfireAI

15 days ago

The most popular way to interpret AI is missing the bigger picture. Models think in curved shapes. But sparse autoencoders (SAEs) work with straight lines. Can they still capture models’ curved neural geometry? Yes, but not how you might think! (1/7)

24

1K

150

760

169K

1

41

3

3K

_dron_h retweeted

Goodfire

@GoodfireAI

15 days ago

The most popular way to interpret AI is missing the bigger picture. Models think in curved shapes. But sparse autoencoders (SAEs) work with straight lines. Can they still capture models’ curved neural geometry? Yes, but not how you might think! (1/7)

24

1K

150

760

169K

dron @_dron_h

15 days ago

@nabla_theta @QiaochuYuan my CS education was extremely PL-coded though, it's possible this is more of a eurotheory thing

0

9

dron @_dron_h

15 days ago

@nabla_theta @QiaochuYuan ah. well to me haskell is (one of) the prime examples i have in mind about these abstractions

2

0

33

dron @_dron_h

22 days ago

@acapellascience as in most cases ai is not like other technologies! the scale/generality is very different from past forms of "cheating"

0

9

dron @_dron_h

22 days ago

@acapellascience the niche must emerge and we must make it so but the default case is death by optimization, there are forces that make this death desirable for some

1

0

13

dron @_dron_h

22 days ago

@prompt_Tunes yup this work is building on top of that! https://t.co/3DJvjD8nAQ

Sheridan Feucht @sheridan_feucht

22 days ago

But how does this addition mechanism actually work? In agreement with @NeelNanda5, @tianyi_zhou12, @thesubhashk, and others, we found that Llama calculates addition using Fourier features. Specifically, periods 2, 5, and 10 (also 20, 50, 100) stuck out, corroborating prior work.

sheridan_feucht's tweet photo. But how does this addition mechanism actually work? In agreement with @NeelNanda5, @tianyi_zhou12, @thesubhashk, and others, we found that Llama calculates addition using Fourier features. Specifically, periods 2, 5, and 10 (also 20, 50, 100) stuck out, corroborating prior work. https://t.co/AKbqKVs5xc

1

44

1

9

4K

0

24

2

4

3K

_dron_h retweeted

Goodfire

@GoodfireAI

22 days ago

Neural networks do math by rotating shapes. We found a shape-rotating calculator hidden inside an LLM – and it’s used for more than just math! (1/6)

122

4K

554

3K

933K

dron @_dron_h

22 days ago

@sheridan_feucht it's so good!

0

4

0

52

dron @_dron_h

22 days ago

look at those beautiful subgroups

Sheridan Feucht @sheridan_feucht

22 days ago

Next, we analyze the neurons in MLP 18 that are responsible for actually doing addition. We focus on a set of 28 MLP neurons found using DAS (Geiger et al., 2023). They form clear subgroups that fire at different frequencies and align with the Fourier probes that we trained!

sheridan_feucht's tweet photo. Next, we analyze the neurons in MLP 18 that are responsible for actually doing addition. We focus on a set of 28 MLP neurons found using DAS (Geiger et al., 2023). They form clear subgroups that fire at different frequencies and align with the Fourier probes that we trained! https://t.co/bNLIn1rwaX

1

50

1

17

14K

1

37

1

11

4K

_dron_h retweeted

Sheridan Feucht @sheridan_feucht

22 days ago

Neural networks have beautiful feature geometry, but do they have mechanisms that actually interface with those structures? At @GoodfireAI this spring, we discovered one: a re-usable addition mechanism that reads/writes to Fourier features from prior work. 🧵

7

246

41

112

63K

dron @_dron_h

23 days ago

@nabla_theta @QiaochuYuan this is my excuse for why i've bounced right off any math textbook i've ever opened

0

10

dron @_dron_h

23 days ago

@nabla_theta @QiaochuYuan so often explaining a CS-shaped concept often has the feeling of being obvious or trivial or horrendously abstract/devoid of (concrete) meaning — this is somewhat deliberate

2

0

22

dron

@_dron_h

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users