Alessandro Ferrari @BioAlessandro - Twitter Profile

Pinned Tweet

3 months ago

Building a compiler + HSL framework to turn @__tinygrad__ kernels into VHDL, and synthesize the perfect FPGA for a given compute graph. Tinygrad UOps -> KernelIR (my custom IR) -> Amaranth hardware modules

BioAlessandro's tweet photo. Building a compiler + HSL framework to turn @__tinygrad__ kernels into VHDL, and synthesize the perfect FPGA for a given compute graph.

Tinygrad UOps -> KernelIR (my custom IR) -> Amaranth hardware modules https://t.co/9YhncOo0s8

14

487

29

262

25K

Alessandro Ferrari

@BioAlessandro

1 day ago

I love when claude code jerks my whole UI because it finished writing a complete slop summary of the changes I am currently reading

0

20

Alessandro Ferrari

@BioAlessandro

10 days ago

Guy who puts🇻🇦in his bio not because he is religious but because he likes claude

0

31

Alessandro Ferrari

@BioAlessandro

25 days ago

@GaddipatiHarsha How about making a product that will receive upvotes? That feels like a better use of YC resources😂

1

3

0

179

Alessandro Ferrari

@BioAlessandro

about 1 month ago

New X banner

NASA's Kennedy Space Center

@NASAKennedy

about 1 month ago

The planet can spell your name – literally. 🔤🌍 This Earth Day, see your name written in landscapes captured by Landsat: https://t.co/kcP12dhsI2

NASAKennedy's tweet photo. The planet can spell your name – literally. 🔤🌍

This Earth Day, see your name written in landscapes captured by Landsat: https://t.co/kcP12dhsI2 https://t.co/z2Ubn42iY1

2K

183K

28K

77K

56M

0

3

0

190

Alessandro Ferrari

@BioAlessandro

about 1 month ago

@javi_2326 @andrewgwils Oh don’t worry, I’m not a math or physics major. Unfortunate reality for me as well.

0

75

Alessandro Ferrari

@BioAlessandro

about 1 month ago

@viennaCtrl_ @andrewgwils Everything is math

0

75

Alessandro Ferrari

@BioAlessandro

about 1 month ago

@predict_addict @andrewgwils Because I’m a cs major and consciously chose not to do math and physics knowing it would have probably been better. Hindsight 20/20

1

0

96

Alessandro Ferrari

@BioAlessandro

about 1 month ago

@adityaxprasad “Ex sweatshop worker, able to pull 20 hour days”

0

2

0

89

Alessandro Ferrari

@BioAlessandro

about 1 month ago

@adityaxprasad Biohacking

0

40

Alessandro Ferrari

@BioAlessandro

about 2 months ago

To settle the "buy GPUs vs rent" debate for side projects: once you buy that RTX PRO 6000, you will procrastinate and let it idle. If you're renting it for $2/hr, you will be more productive working on your side project than you will at your day job.

0

3

0

394

Alessandro Ferrari

@BioAlessandro

about 2 months ago

@datavorous_ An agent orchestration harness with a CEO, engineers, and validator agents enabling devs to push 150kloc/day? Seems like the right next step if this kid wants to stop shipping weekend side projects and make a real impact on the world!

0

1

0

125

Alessandro Ferrari

@BioAlessandro

about 2 months ago

@quantian1 Yea, the same way everyone can trivially setup FTP with a CVS system on top and recreate Dropbox

1

17

1

10K

Alessandro Ferrari

@BioAlessandro

about 2 months ago

PrisML team iteratively adding bits each couple weeks until they land on bf16 from first principles

Sahin Lale

@SahinLale

about 2 months ago

Turns out adding 0 helps :) Today we’re introducing Ternary Bonsai 🌳, a family of end-to-end 1.58-bit language models in 8B, 4B, and 1.7B sizes. Ternary Bonsai 8B is within 5% of Qwen 3 8B at 9x lower memory. Still tiny. Noticeably smarter

9

201

13

56

18K

0

3

0

381

Alessandro Ferrari

@BioAlessandro

about 2 months ago

The european mind (me) cannot comprehend that you would see an $18 flight ticket and your first instinct is buy all of them for $3400 Well played, well played

Alex Kehr

@alexkehr

about 2 months ago

the american mind (me) cannot comprehend european airline flight prices can i just book all 190 seats for $3400 and have a private 737 flight?

alexkehr's tweet photo. the american mind (me) cannot comprehend european airline flight prices

can i just book all 190 seats for $3400 and have a private 737 flight? https://t.co/xPCy5VJ9Zy

604

37K

296

1K

13M

0

5

0

358

Alessandro Ferrari

@BioAlessandro

about 2 months ago

I respect the A/B testing

Jessica Paquette @barrelshifter

about 2 months ago

corrupting the youth by having them fix random llvm bugs

4

124

11

5

3K

0

2

0

208

BioAlessandro retweeted

Justin Xia @justinqxia

about 2 months ago

Renting H100s from runpod to write tinygrad bounties like a medieval peasant paying a tithe to his feudal lord for a meager plot of compute. I toil day and night, hoping my bounty harvest is enough to win the respect of the king and avoid starvation

2

15

2

1

652

Alessandro Ferrari

@BioAlessandro

about 2 months ago

@itselouardi 😂have to keep it for intellectual honesty. I fucked up

1

4

0

64

Alessandro Ferrari

@BioAlessandro

about 2 months ago

It's nice that we could get Bonsai-family support so quickly, but this is a bit disingenuous. I have never contributed to tinygrad so I am not in a position to critique this, however this implementation unpacks the 1bit weights as float16 and runs computations on float16 instead of running custom kernels on the packed weights, nullifying a lot of the benefits of the Bonsai architecture. Q1_0 It is a packed 1-bit format: for each block of 128 weights, you store 16 bytes of bits and 2 bytes for a shared fp16 scale. 128 weights take 18 bytes total. If you unpack those same 128 weights into float16, that becomes 256 bytes (14x). This is basically unpacking the "bit-based llm" in normal float16 and running calculations that way. My understanding of that llama.cpp’s Bonsai support keeps the weights in the quantized Q1_0 representation and uses kernels that operate on that packed format, which is the whole point. Again, I do not mean this as a shot at the implementation itself. Getting support working this quickly is genuinely cool. I might also be misunderstanding some parts of this, hopefully not too much, but would love to be corrected.

the tiny corp

@__tinygrad__

about 2 months ago

Just merged an external PR for Bonsai-8B support (1 bit LLM). Because tinygrad has the correct abstractions, it was 5 lines. https://t.co/BLljWDANgq https://t.co/GlXWqPbYg5

7

249

12

70

36K

3

38

1

18

15K

Alessandro Ferrari

@BioAlessandro

about 2 months ago

Yea, just replied as well, I was totally misunderstanding. For some reason I thought that the ggml loading to tensor wouldn't be fused with the rest of the code (not sure why I would think that) so the scheduler only saw the multiplication with the float16 d, and separately saw the rest of the model. The memory usage doesn't lie

0

1

0

216

Alessandro Ferrari

@BioAlessandro

Last Seen Users on Sotwe

Trends for you

Most Popular Users