Emerson Segura @Emerson - Twitter Profile

Emerson Segura @emerson

about 21 hours ago

@micoolcho @SharpaRobotics Wow... how did it do that paper separation..didn't even have to lick its own finger :)

1

0

56

Emerson Segura @emerson

about 21 hours ago

@micoolcho Its impressive , yet likely differet rl policies for each config.. so rl does not need to know that robot can be reconfigured..as long as correct policy is active .. so should be almost regular approach to making these run -- very clever mechanical engineering tho

1

0

14

Emerson Segura @emerson

1 day ago

@micoolcho @DirectDriveTech so clever

0

42

Emerson Segura @emerson

3 days ago

@MatthieuWyart @mattmiesnieks wow clever work

0

37

Who to follow

Kelly Hungerford

@KDHungerford

Digital Strategy and Marketing Technology Director @Sunstar_Global Consumer Group. CoFounder @wdswitzerland

Lucio Ronaldo

@LucioRonaldoRo1

Engenheiro Mecânico, Cristão, pela Direita e muito Patriota. Vovô do Lucca e da Antonella. 🇧🇷 Igual aos Outros, só que mais legal!

Kevin Brock

@kbrock84

Just another silly developer on Twitter. Probably coding to Pantera right now.

Emerson Segura @emerson

5 days ago

@ErenChenAI nice work

0

1

0

123

Emerson Segura @emerson

5 days ago

@scaling01 @GaryMarcus Nvidia will not have a moat like it has now(pytorch,vllm etc. dont need cuda already) it's peak nvidia today. No technical reason why Google and others can't match Antrpc/OAI at coding&soon (look at Cursor already)- also coding is only killerapp atm,yet other uses will emerge.

0

46

Emerson Segura @emerson

7 days ago

@chamath also its over 60% "software assistant" -- a market that MSFT and Google will eat up once they get their poop togather (as Cursor recently did)

0

12

Emerson Segura @emerson

7 days ago

@GaryMarcus There is a positive: all the paid for and deoloyed GPUs (spent capex) will make AI more affordable --- like all then Sun servers and ISPs and fiber broadband investment in the original dot com era, didnt go to waste after the bubble burst they were well utilized and grew later

0

2

0

418

Emerson Segura @emerson

7 days ago

@chris_j_paxton Clever hack! Yet not ideal.. even paint job would help (zebra pattern etc). --- the new "Everything's Computer" is "Everything is Image" lol -- why add more dimentions to your VLA whe ya can just cram force as an image lol

0

70

Emerson Segura @emerson

7 days ago

@GaryMarcus we don't talk about this :) please don't pop the .. lets call it," ai dot com" bubble -- the last time this happened it took the Nasdaq about 15 years to recover back to 5000

0

1

705

Emerson Segura @emerson

8 days ago

@elonmusk When do all the 220k GB300 go online?

0

4

2

0

50

Emerson Segura @emerson

8 days ago

@HedgieMarkets so they lost $500 million selling pizza? Wow, could have started Anthropic with that money..

0

125

Emerson Segura @emerson

8 days ago

@PTrubey @SakanaAILabs extreme example(if this works): u buy a regular 96GB GPU, train DeepSeekV4 at home,if you just wait long enough -- Atm. this is not possible,the min hardware needed is a GPU like B200 to "fit" the entire model in VRAM, and atm that is at min one DGX (8xB200) server, about $500k

0

1

0

213

Emerson Segura @emerson

8 days ago

@DavidSHolz Power/Electricity is why... also likely why OpenAI paused Sora.. its something like 20x to 10x the # of conccurent users that can be supported for llm vs videogen per GPU -- llms are efficient on compute vs diffusion (albeit diffusion type models are current irreplaceable)

0

83

Emerson Segura @emerson

8 days ago

Impressive work from Japan! (new way to train llms with less VRAM required per compute cluster, while all-reduce is still needed)

hardmaru

@hardmaru

9 days ago

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them independently. The trick? Treating the network’s forward pass like a diffusion model denoising a signal. This reinterpretation slashes the memory needed to train deep models. In our #ICLR2026 paper (https://t.co/PK5h0mqQSo), we matched end-to-end performance across ViTs, DiTs, and LLMs. We did this while training just one isolated block at a time.

152

6K

644

4K

735K

0

1

0

153

Emerson Segura @emerson

8 days ago

@JayKapoorNYC @startupjag Hey: #1 robot backflips are not staged or teleop #2 VLAs are not used for most humanoid robot locomotion, like dancing , jumps, etc. There are people doing useful work in all those domains, and many years of work left, yet that's no reason to missinform and conflate facts ..

1

3

0

221

Emerson Segura @emerson

8 days ago

@thestreamingdev what is this.. a prompt for what model?

1

0

103

Emerson Segura @emerson

8 days ago

@CongrongX nice work

0

401

Emerson Segura @emerson

8 days ago

@DominiqueCAPaul sounds fun!

0

1

0

30

Emerson Segura @emerson

9 days ago

@Scobleizer yep, 1) LLM are not good enough out of the box for most enterprise cases.2) post-training is hard, requires both ml skills & clean domain-specific data--enterprise and their consultants do not have the skills to implement this yet,for the next while Vertical Startups will do well

0

1

0

27

Emerson Segura

@emerson

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users