Introducing Magenta RealTime 2 (MRT2): the live music model you can play as an instrument.
MRT2 offers MIDI and prompt controls, and runs natively on a MacBook with <200ms latency.
Open weights. Open source inference engine. Suite of apps and plugins.
Hear what it can do and try it out for yourself below 🧵
@nabla_theta@QiaochuYuan a pointer is a Thing that points to another Thing, and even here i have created an implicit binding between the two Things when they need not be related
fully abstracted, devoid of implementation or concrete ground
@nabla_theta@QiaochuYuan closures and currying are also pretty emblematic to me? we're in a fuzzy space here but currying is one of those things that feels very "trivial" to me — it's a primitive!
what is a pointer? what is a cache? what is a function? none of these are constrained (much)
The most popular way to interpret AI is missing the bigger picture.
Models think in curved shapes. But sparse autoencoders (SAEs) work with straight lines.
Can they still capture models’ curved neural geometry? Yes, but not how you might think! (1/7)
The most popular way to interpret AI is missing the bigger picture.
Models think in curved shapes. But sparse autoencoders (SAEs) work with straight lines.
Can they still capture models’ curved neural geometry? Yes, but not how you might think! (1/7)
@acapellascience the niche must emerge and we must make it so
but the default case is death by optimization, there are forces that make this death desirable for some
But how does this addition mechanism actually work? In agreement with @NeelNanda5, @tianyi_zhou12, @thesubhashk, and others, we found that Llama calculates addition using Fourier features. Specifically, periods 2, 5, and 10 (also 20, 50, 100) stuck out, corroborating prior work.
Next, we analyze the neurons in MLP 18 that are responsible for actually doing addition. We focus on a set of 28 MLP neurons found using DAS (Geiger et al., 2023). They form clear subgroups that fire at different frequencies and align with the Fourier probes that we trained!
Neural networks have beautiful feature geometry, but do they have mechanisms that actually interface with those structures?
At @GoodfireAI this spring, we discovered one: a re-usable addition mechanism that reads/writes to Fourier features from prior work. 🧵
@nabla_theta@QiaochuYuan so often explaining a CS-shaped concept often has the feeling of being obvious or trivial or horrendously abstract/devoid of (concrete) meaning — this is somewhat deliberate