Taha ⵣ @mlnomadpy - Twitter Profile

Pinned Tweet

6 days ago

during my time learning about contrastive/self-supervised learning, it always felt mythical on how it exactly works and what mechanism it introduce. I created these two blogs to explain my learning during the past few years, and simplify the concepts links below

5

297

33

298

19K

Taha ⵣ

@mlnomadpy

about 7 hours ago

@giffmana fr deepmind had much more than just a good chatbot they ipo'ed a bit early 💔

0

1

0

365

Taha ⵣ

@mlnomadpy

about 8 hours ago

@EnMaroc ما هذا الخراء

0

53

Taha ⵣ

@mlnomadpy

about 9 hours ago

imo deepmind should ipo too lol

1

3

0

20K

Who to follow

Kaito | 海斗

@_kaitodev

🇲🇦 | building the missing piece of video storytelling @joinodysser | prev. @_buildspace | 51.2k on IG | not from @uwaterloo

ACHRAF 🇲🇦🇵🇸

@aamri_achraf

Software Engineer Gamer & DIY lifestyle

baba bihi 💬🌾

@_Bihi23

Software engineer by day ⛅️ , everything else by night oo My bsky account: https://t.co/hQk5F8dy8R

Taha ⵣ

@mlnomadpy

about 10 hours ago

@osanseviero hollyyyyy mama

0

147

Taha ⵣ

@mlnomadpy

about 22 hours ago

@osanseviero GOATs

0

178

Taha ⵣ

@mlnomadpy

1 day ago

@mtschannen this huge!!!!!

0

2

0

637

mlnomadpy retweeted

Michael Tschannen @mtschannen

1 day ago

For the past years my research focus was on unifying models and training paradigms across modalities. Today I'm excited that we're releasing our latest model aligned with this theme: Gemma 4 12B, a dense encoder-free model which processes raw text, image, and audio inputs! 1/

mtschannen's tweet photo. For the past years my research focus was on unifying models and training paradigms across modalities. Today I'm excited that we're releasing our latest model aligned with this theme:

Gemma 4 12B, a dense encoder-free model which processes raw text, image, and audio inputs!

1/ https://t.co/4J2JKCtzU5

24

1K

125

512

99K

Taha ⵣ

@mlnomadpy

1 day ago

@googlegemma @0xEbaad

0

1

0

218

mlnomadpy retweeted

Google Gemma

@googlegemma

1 day ago

Meet Gemma 4 12B! A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license. Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇

googlegemma's tweet photo. Meet Gemma 4 12B!

A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.

Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇 https://t.co/gf4FZv0WZb

363

12K

2K

5K

3M

Taha ⵣ

@mlnomadpy

2 days ago

Lloyd R. Welch (1974). Lower Bounds on the Maximum Cross Correlation of Signals. IEEE Transactions on Information Theory, 20(3), 397–399. John J. Benedetto and Matthew Fickus (2003). Finite Normalized Tight Frames. Advances in Computational Mathematics, 18(2–4), 357–385. Vardan Papyan, X. Y. Han, David L. Donoho (2020). Prevalence of Neural Collapse During the Terminal Phase of Deep Learning Training. Proceedings of the National Academy of Sciences (PNAS), 117(40), 24652–24663. Tongzhou Wang and Phillip Isola (2020). Understanding Contrastive Representation Learning Through Alignment and Uniformity on the Hypersphere. ICML 2020 (PMLR 119), 9929–9939. arXiv:2005.10242.

0

56

Taha ⵣ

@mlnomadpy

2 days ago

“make the latent space better” is the vaguest advice in ml. but there’s a precise answer, and it predates deep learning by decades. a good latent space is the solution to a sphere-packing problem. the optimum has a name. new post + a runnable jax companion 🧵

1

11

1

7

586

Taha ⵣ

@mlnomadpy

2 days ago

📖 read: https://t.co/EQXaZHcJGh 💻 jax: https://t.co/drn4WOesE5

1

0

21

Taha ⵣ

@mlnomadpy

5 days ago

@giffmana @BangachevKiril amma work on the blog post for it and tag you big fan :D

1

0

285

Taha ⵣ

@mlnomadpy

6 days ago

during my time learning about contrastive/self-supervised learning, it always felt mythical on how it exactly works and what mechanism it introduce. I created these two blogs to explain my learning during the past few years, and simplify the concepts links below

5

297

33

298

19K

Taha ⵣ

@mlnomadpy

6 days ago

@BangachevKiril what i'm exploring is whether image representations can be modeled as a fiber bundle, where text captures the semantic base space and image-specific details are encoded in the fibers text would only need to approximate the shared semantic structure rather than the full image rep

0

34

Taha ⵣ

@mlnomadpy

6 days ago

@BangachevKiril yyyep, what i understand so far is that fully closing the modality gap may remove modality-specific information, especially from images.

1

0

27

Taha ⵣ

@mlnomadpy

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users