Fred Bliss @fblissjr - Twitter Profile

Pinned Tweet

9 months ago

@sh_reya I've been a broken record for the past 3 years (and the prior 9 before it running and exiting a data consultancy, and the prior 10 before that in 'ETL' and data warehouse work) that at some point, everyone will care about the data side here. Not to mention the modeling side.

1

5

0

303

Fred Bliss @fblissjr

12 days ago

@AradhyeAgarwal And in different contexts. Thinking about it from both sides of the coin in LLM world and DiT world

0

1

0

61

Fred Bliss @fblissjr

12 days ago

@AradhyeAgarwal I like this question because the answer might change over time.

1

0

85

Fred Bliss @fblissjr

18 days ago

@fofrAI Appreciate when you show failure modes. Helps us all.

0

34

Who to follow

StarMorph

@StarmorphAI

Agentic typescript engineering + software design Claude | Codex | Hermes practicing local inference https://t.co/7XXyRHZsv2

Dustin @ EnvelopeBudget

@envelopebudget

Revolutionizing personal finance one envelope at a time 💌. Get your budget on track with our app, built for pros and beginners alike.

Fred Bliss @fblissjr

20 days ago

@francoisfleuret @hardmaru Yeah you guys always put out compelling stuff. Never boring and always forward thinking.

0

1

0

425

fblissjr retweeted

hardmaru

@hardmaru

21 days ago

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them independently. The trick? Treating the network’s forward pass like a diffusion model denoising a signal. This reinterpretation slashes the memory needed to train deep models. In our #ICLR2026 paper (https://t.co/PK5h0mqQSo), we matched end-to-end performance across ViTs, DiTs, and LLMs. We did this while training just one isolated block at a time.

154

6K

639

4K

743K

Fred Bliss @fblissjr

23 days ago

@AndrewCurran_ @emollick Inevitably everyone will retreat to their own private communities until whenever we centralize again happens. Feels like it will be like early bbs systems then irc channels then public niche forums then bam MySpace and Facebook

0

1

0

36

Fred Bliss @fblissjr

23 days ago

@karpathy @pankajmathur_ @shreyansj Titles like this are my favorite. Especially these days when a title can narrow how you’re perceived and seen and what you do. When it’s a lot of things on a small team even more so. When you cross business and technical most titles would infer non technical.

0

1

0

904

Fred Bliss @fblissjr

23 days ago

@fofrAI @adhik_Joshi Me neither but don’t see why it couldn’t.

0

18

Fred Bliss @fblissjr

24 days ago

@levie Every single thing you deploy in AI means more people to iterate and maintain it. It's never 'done', like any data or software project. Just means different skills.

0

42

Fred Bliss @fblissjr

27 days ago

@joannejang The harness is the model(s) and a whole lot of system behind it. Excited for generative UI. Google is doing it well. Claude getting there slowly. But would love to see more than chat and canvas

0

32

fblissjr retweeted

Carlos Santana

@DotCSV

28 days ago

Creo que muchos están enfocando mal el modelo de Gemini Omni al compararlo con Seedance 2.0 cuando conceptualmente son cosas distintas. Este es un modelo para editar vídeos (a la Nano Banana) como nunca antes habíamos tenido!

61

2K

138

814

543K

Fred Bliss @fblissjr

29 days ago

@pvncher And decomposing queries is the way. As granular as you need to and verifiers/subagents (whatever we wanna call clear context except what’s provided by the parent to succeed) to evaluate whatever that granular decomp is at each layer.

0

1

0

34

Fred Bliss @fblissjr

29 days ago

@pvncher Agree on serial orchestration

1

0

59

Fred Bliss @fblissjr

29 days ago

@AndrewCurran_ @inductionheads Also would explain why OpenAI is trying to buy a company that makes diffusion based language models(or did buy them… I’m a few days out of the loop)

0

2

0

45

Fred Bliss @fblissjr

29 days ago

@AndrewCurran_ @inductionheads Diffusion hybrid models is my bet. Crazy fast in a way that changes how you can now use LLMs. Which means new product experiences.

1

2

0

328

Fred Bliss @fblissjr

about 1 month ago

@dreamingtulpa For many years nonstop

0

17

Fred Bliss @fblissjr

about 1 month ago

@jonasgeiping @guinansu @kyutai_labs Or at least the two approaches remind me of each other. And moshi was the first time I saw how real voice to voice can work

0

22

Fred Bliss @fblissjr

about 1 month ago

@jonasgeiping @guinansu This is a really great idea. The prompt injection one and separating output and input is sorta like how @kyutai_labs first did moshi in a way? Full duplex?

1

0

150

fblissjr retweeted

Mario Zechner

@badlogicgames

about 1 month ago

oi #2!

0

36

1

17

9K

fblissjr retweeted

Jonas Geiping

@jonasgeiping

about 1 month ago

What do we gain? First off, we can improve latencies because we now overlap thinking, system inputs, tool use and even auditing calls (and we show this in the paper). Second, we find that the models we train in a clean ablation with this format actually have a significantly easier time withstanding prompt injections, because it is easier to separate input and output if they are separate streams.

jonasgeiping's tweet photo. What do we gain? First off, we can improve latencies because we now overlap thinking, system inputs, tool use and even auditing calls (and we show this in the paper).

Second, we find that the models we train in a clean ablation with this format actually have a significantly easier time withstanding prompt injections, because it is easier to separate input and output if they are separate streams.

2

52

6

20

14K

Fred Bliss

@fblissjr

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users