Fabrizio Milo @fabmilo - Twitter Profile

Pinned Tweet

Fabrizio Milo @fabmilo

10 months ago

Created with @NotebookLM after discussing these topics with @Cyndesama @tensorqt @Niccolg92 and few others

1

13

0

3

1K

Fabrizio Milo @fabmilo

about 3 hours ago

Rent compute add Dstack And serve for fun or profit

DailyPapers

@HuggingPapers

1 day ago

NVIDIA just released an optimized GLM-5.2 on Hugging Face A 753B parameter MoE with 1M context, quantized to NVFP4 for Blackwell GPUs— nearly matching FP8 accuracy.

HuggingPapers's tweet photo. NVIDIA just released an optimized GLM-5.2 on Hugging Face

A 753B parameter MoE with 1M context,
quantized to NVFP4 for Blackwell GPUs—
nearly matching FP8 accuracy. https://t.co/tjtk0dVPEW

41

2K

137

877

197K

0

1

0

55

Fabrizio Milo @fabmilo

about 4 hours ago

@katedeyneka I heard them too, seem just pyrotechnics 🧨

1

0

396

fabmilo retweeted

will brown

@willccbb

1 day ago

something has definitely shifted in the past few weeks. seeing a huge uptick in large enterprises wanting to secure compute and post-train their own models in house, frequently on top of GLM-5.2. everyone is starting to understand how open source wins.

90

2K

179

571

226K

Who to follow

Nassir Marrouche

@nmarrouche

Father. Cardiac Electrophysiologist. Personalizing heart disease using imaging and Digital Health. Tulane University. Tweets are my own.

srisatish

@srisatish

this will be fun. maker @h2oai, engineer, democratizing ai. https://t.co/ds9uTziDxL, ai for good, democratize! time is the only non-renewable resource.

Tim Higgins

@timkhiggins

@wsj columnist & @cnbc contributor | author of books about Tesla ("Power Play") & Apple ("iWar").

Fabrizio Milo @fabmilo

1 day ago

@rohanpaul_ai @sundeep Isnt this just “capacity” of a model?

0

1

0

72

Fabrizio Milo @fabmilo

2 days ago

@vytalow @ycombinator Interested

0

1

0

68

Fabrizio Milo @fabmilo

2 days ago

@quantbagel I wish I had that kind of problem

0

66

Fabrizio Milo @fabmilo

2 days ago

Surprised by this news. I wasn't bullish on modular after my experience with Swift-Tensorflow but now even less so. Congrats on the relative fast exit to Chris.

Modular

@Modular

3 days ago

We’re excited to announce that Modular has entered an agreement to be acquired by @Qualcomm. The future of unified compute has never been stronger. Read the full announcement: https://t.co/FiQUL5CvNj

11

271

40

161K

0

1

0

222

Fabrizio Milo @fabmilo

2 days ago

@eXist3nZ_89 @antirez

0

50

Fabrizio Milo @fabmilo

2 days ago

GPT 5.5 PRO made me smile. While asking to design new objective functions for LLMs he came back to me with one option labeled as "one of my favorite". Clearly an indication of human data underneath but for a second it gave a "human flair" as a response.

fabmilo's tweet photo. GPT 5.5 PRO made me smile. While asking to design new objective functions for LLMs he came back to me with one option labeled as "one of my favorite". Clearly an indication of human data underneath but for a second it gave a "human flair" as a response. https://t.co/idhwJetWZf

0

75

Fabrizio Milo @fabmilo

2 days ago

I was so excited for the good deal. Too good to be true.

0

47

Fabrizio Milo @fabmilo

2 days ago

@per_simmons_ Now we just need a MIT source code unreal engine system

0

90

Fabrizio Milo @fabmilo

2 days ago

They have support for quite a few models too https://t.co/2po3VofREK has anyone tried to fine tune one for your in-house model ?

Charles 🎉 Frye

@charles_irl

2 days ago

The no-longer-secret ingredient is DFlash by @zhijianliu_ and @jianchen1799. If you train a custom DFlash speculator on your data, you can get to lower latencies than any generic inference API can achieve. That's the benefit of owning your inference!

6

102

13

40

10K

0

1

0

1

177

Fabrizio Milo @fabmilo

2 days ago

Seems the whole inference market is trying to get off NVDA chips. What will be the operational friction to adopt non-off-the-shelves hardware?

OpenAI

@OpenAI

3 days ago

We’ve designed and built our first AI chip: Jalapeño. Designed from the ground up by OpenAI and brought to production with @Broadcom, Jalapeño is purpose-built for the LLM workloads powering ChatGPT, Codex, the API, and future agentic products. Chips are foundational to the AI economy. Building our own expands our full-stack platform from products to models to infrastructure, and will help us scale intelligence, serve more people, and expand access to AI.

OpenAI's tweet photo. We’ve designed and built our first AI chip: Jalapeño.

Designed from the ground up by OpenAI and brought to production with @Broadcom, Jalapeño is purpose-built for the LLM workloads powering ChatGPT, Codex, the API, and future agentic products.

Chips are foundational to the AI economy. Building our own expands our full-stack platform from products to models to infrastructure, and will help us scale intelligence, serve more people, and expand access to AI.

1K

22K

2K

3K

6M

0

3

1

0

143

Fabrizio Milo @fabmilo

3 days ago

@__tinygrad__ C + Cuda

0

66

Fabrizio Milo @fabmilo

3 days ago

@bernhardsson @modal How would you add a pricing service on top of it? Let’s say If I want to sell my own trained llm

0

104

Fabrizio Milo @fabmilo

7 days ago

@ivictorialiao Cards against humanity is still one of my favorite :D

2

0

64

Fabrizio Milo @fabmilo

7 days ago

Huge if actually works

alphaXiv

@askalphaxiv

9 days ago

Introducing autoresearch for arXiv papers Change 'arxiv' to 'autoarxiv' in any paper URL An agent deploys to resolve setup issues on the codebase, run a minimal reproduction, and estimate full replication cost. Read more below

47

3K

382

3K

476K

0

1

0

165

Fabrizio Milo @fabmilo

8 days ago

@MrAhmadAwais We are at a point where llms with the right skill.md can create a full binary that serves just one model . Checkout @antirez ds4 project. What local hardware do you have?

0

30

Fabrizio Milo @fabmilo

8 days ago

@willccbb If you know how those equations are derived is all a hack of approximations

0

54

Fabrizio Milo

@fabmilo

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users