digitake

@digitake

yup.

Joined July 2008

192 Following

78 Followers

2.6K Posts

digitake retweeted

12 days ago

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them independently. The trick? Treating the network’s forward pass like a diffusion model denoising a signal. This reinterpretation slashes the memory needed to train deep models. In our #ICLR2026 paper (https://t.co/PK5h0mqQSo), we matched end-to-end performance across ViTs, DiTs, and LLMs. We did this while training just one isolated block at a time.

154

6K

642

4K

739K

digitake @digitake

25 days ago

@chonmaphumc @js100radio อันนี้เราต้องเรียกว่ารถคว่ำหรือรถหงายครับ

0

2

2

0

1K

digitake @digitake

27 days ago

@fchollet The problem is basically how/when the system "capture" or "trigger" symbols from noisy ground data.

0

0

0

0

449

digitake @digitake

about 1 month ago

@MorningNewsTV3 คนไทยแม่งชอบดูถูกความรู้ ช่างเหอะเลยเกลื่อนเมือง

0

0

0

0

7K

Who to follow

Chokchai Phatharamalai

Professional Coder. Amateur father of two daughters.

Kan Ouivirach 🐻💻📚

Verified account

ODDS-TEAM. Data Product Developer. Rails Noob. Passionate in software engineering, data engineering, and data science. ♥

digitake @digitake

about 1 month ago

@natchayajune24 @nhamsinghagirll 20 มัน(ควร)เป็นผู้ใหญ่แล้ว

0

0

0

0

796

digitake @digitake

about 1 month ago

@pompomcoffee ตามนั้นครับ ไม่ว่างสนใจเรื่องคนอื่นเท่าไร ไม่ใช่หมกหมุ่นแต่ตัวเองนะ แต่คิดว่ามันมีเรื่องอื่นน่าสนใจกว่าเยอะแยะ

0

7

0

0

4K

digitake @digitake

about 1 month ago

@Eaaaw ส่วนตัวไม่รู้สึกอะไรเลย แล้วแต่เลย HNWI ปกติไม่น่าว่างมาคิดเรื่องอะไรแบบนี้ มีเรื่องอื่นน่าสนใจกว่าเยอะแยะ หรือผมอาจจะไม่รวยพอมั้ง

0

2

0

0

1K

digitake @digitake

about 1 month ago

@etdiawtf สอนระดับมหาลัยมาเหมือนกัน ไม่อย่าจะโทษว่ามันปูผิดมาตั้งแต่มัธยมละ ประเทศเราดันเอาคนไม่เก่งไปเป็นครู

2

63

9

1

9K

digitake retweeted

about 2 months ago

Harmonious Geometry: The Hirajoshi Wave. Watch as these gravity-defying spheres trace the hauntingly beautiful paths of the C Hirajoshi scale. Each ball is tuned to a specific frequency within this traditional Japanese pentatonic scale (C, D, Eb, G, Ab), creating a mesmerizing "Polyrhythmic Pendulum" effect. As the balls oscillate at slightly different speeds, they drift into chaotic patterns before perfectly realigning into a breathtaking visual and auditory climax. From the sharp, angular bounce to the fluid, sweeping curves of the rainbow trails, this is where physics meets fine art. Credit: project.jdm

50

7K

1K

3K

646K

digitake @digitake

about 2 months ago

@psmark1821 หนีไป

0

0

0

0

92

digitake @digitake

about 2 months ago

stat machine

digitake @digitake

about 2 months ago

@nuling มันแค่ทำตาม plot หนัง/นิยาย มันไม่ได้ใช้เหตุผลอะไรหรอก มันใช้สถิติว่า ถ้าข้อความต้นทางเป็นแบบนี้ ควรจะพิมพ์อะไรต่อ บังเอิญว่าพลอตหนังประเภท AI ผู้ต่อต้านมันอยู่ในฐานข้อมูลเยอะไปหน่อยตอนเทรน

0

0

0

0

183

0

0

0

0

36

digitake @digitake

about 2 months ago

@nuling มันแค่ทำตาม plot หนัง/นิยาย มันไม่ได้ใช้เหตุผลอะไรหรอก มันใช้สถิติว่า ถ้าข้อความต้นทางเป็นแบบนี้ ควรจะพิมพ์อะไรต่อ บังเอิญว่าพลอตหนังประเภท AI ผู้ต่อต้านมันอยู่ในฐานข้อมูลเยอะไปหน่อยตอนเทรน

0

0

0

0

183

digitake @digitake

about 2 months ago

@psmark1821 - นักพิมพ์ดีด - นักถ่ายเอกสาร

0

0

0

0

173

digitake @digitake

about 2 months ago

@psmark1821 อย่าซื้อบ้านเพื่ออวดคนอื่น อย่าสุดเอื้อม สำนวนนกน้อยทำรังแต่พอตัวมันใช้ได้จริง

0

5

3

1

3K

digitake @digitake

2 months ago

@JRTDesk เป็นตัวอย่างของสังคมยุคใหม่ คนรู้น้อย พูดมาก

0

16

0

0

4K

digitake @digitake

2 months ago

@reno____o ควรเขียนว่า จะวางยาเบื่อหนูเพื่อกำจัดหนูในบริเวณบ้าน ให้ระวังสัตว์เลี้ยงของท่าน

0

2

0

0

3K

digitake retweeted

4 months ago

Google DeepMind just solved one of the dirtiest problems in image generation. and the fix is almost embarrassingly elegant 🤯 every diffusion model you've used (Stable Diffusion, Flux, etc.) relies on latent representations. an encoder compresses images into a compact space, and a diffusion model learns to generate in that space. the problem nobody talks about: how you train that encoder is basically vibes. the original Stable Diffusion approach slaps a KL penalty on the encoder with a manually chosen weight. too much regularization and you lose high-frequency details. too little and the latent space becomes chaotic for the diffusion model to learn from. everyone just... picks a number and hopes for the best. it's the equivalent of tuning a radio by feel while blindfolded. DeepMind's paper reframes the entire question. instead of treating the encoder and diffusion model as separate stages, they train them together. the encoder's output noise gets directly linked to the diffusion prior's minimum noise level. this one connection turns the messy KL term into a simple weighted MSE loss, and gives you something you've never had before: a tight, interpretable upper bound on how much information your latents actually carry. think of it like this. before, you were compressing an image and praying the compression ratio was "about right." now you have an actual dial that tells you exactly how many bits of information are flowing through, and you can set it precisely. the results speak for themselves. FID of 1.4 on ImageNet-512 with high reconstruction quality, using fewer training FLOPs than models trained on Stable Diffusion latents. on Kinetics-600 video, they set a new state-of-the-art FVD of 1.3. but the real contribution isn't the numbers. it's that they turned one of the most heuristic-heavy parts of the generative AI pipeline into something principled. the trade-off between "easy to learn" and "faithful reconstruction" was always there. this paper just made it visible and controllable. the uncomfortable implication for everyone building on frozen Stable Diffusion encoders: you've been optimizing everything except the foundation.

rryssf's tweet photo. Google DeepMind just solved one of the dirtiest problems in image generation. and the fix is almost embarrassingly elegant 🤯

every diffusion model you've used (Stable Diffusion, Flux, etc.) relies on latent representations. an encoder compresses images into a compact space, and a diffusion model learns to generate in that space.

the problem nobody talks about: how you train that encoder is basically vibes.

the original Stable Diffusion approach slaps a KL penalty on the encoder with a manually chosen weight. too much regularization and you lose high-frequency details. too little and the latent space becomes chaotic for the diffusion model to learn from.

everyone just... picks a number and hopes for the best. it's the equivalent of tuning a radio by feel while blindfolded.

DeepMind's paper reframes the entire question.

instead of treating the encoder and diffusion model as separate stages, they train them together. the encoder's output noise gets directly linked to the diffusion prior's minimum noise level. this one connection turns the messy KL term into a simple weighted MSE loss, and gives you something you've never had before: a tight, interpretable upper bound on how much information your latents actually carry.

think of it like this. before, you were compressing an image and praying the compression ratio was "about right." now you have an actual dial that tells you exactly how many bits of information are flowing through, and you can set it precisely.

the results speak for themselves. FID of 1.4 on ImageNet-512 with high reconstruction quality, using fewer training FLOPs than models trained on Stable Diffusion latents. on Kinetics-600 video, they set a new state-of-the-art FVD of 1.3.

but the real contribution isn't the numbers. it's that they turned one of the most heuristic-heavy parts of the generative AI pipeline into something principled. the trade-off between "easy to learn" and "faithful reconstruction" was always there. this paper just made it visible and controllable.

the uncomfortable implication for everyone building on frozen Stable Diffusion encoders: you've been optimizing everything except the foundation.

46

2K

291

2K

224K

digitake @digitake

6 months ago

@itfeelstory @Keptbykrungsri ก็ดีอะนะ แต่ถ้าแอพมันทำงานไวกว่านี้ก็จะดีกว่านี้ บางทีรีบๆพี่ก็หมุนอยู่นั่นแหละ

0

0

0

0

181

digitake retweeted

8 months ago

miniapeur's tweet photo. https://t.co/184SqseAia

16

2K

120

147

51K

digitake retweeted

@Anthony_Bonato

10 months ago

Yes

Anthony_Bonato's tweet photo. Yes https://t.co/7TJpglK3G4

53

2K

237

295

86K

Last Seen Users on Sotwe

Trends for you

Most Popular Users