André Susano Pinto @ASusanoPinto - Twitter Profile

2 days ago

@mtschannen 🚀Gemma4 12B🚀 We made it great by training a simpler model. No vision or audio encoders. Easier said than done. Running exploratory experiments to a final model is always interesting. Joint work with @mtschannen @AndreasPSteiner @confusezius @kmisiunas and the whole Gemma team.

0

3

1

0

188

André Susano Pinto @ASusanoPinto

2 days ago

@mtschannen 🚀Gemma4 12B🚀 We made it great by training a simpler model. No vision or audio encoders. Easier said than done. Running exploratory experiments to a final model is always interesting. Joint work with @mtschannen @AndreasPSteiner @confusezius @kmisiunas and the whole Gemma team.

0

3

1

0

188

ASusanoPinto retweeted

Michael Tschannen @mtschannen

over 1 year ago

Check out our detailed report about *Jet* 🌊 - a simple, transformer-based normalizing flow architecture without bells and whistles. Jet is an important part of JetFormer's engine ⚙️ As a standalone model it is very tame and behaves predictably (e.g. when scaling it up).

0

32

8

10

4K

André Susano Pinto @ASusanoPinto

over 1 year ago

Making new simple things requires attention to detail. From numeric precision and unexpected bugs deep in the stack. But now there is a precedent which includes paper, numbers and code. Hope it helps people go hammer some nails🔨

0

2

0

188

Who to follow

Neil Houlsby

@neilhoulsby

Member of Technical Staff at Anthropic Amateur athlete https://t.co/G1kDE7Dyau

Michael Figurnov

@mfigurnov

Senior Staff Research Scientist @ Google Deepmind

Jeremiah Harmsen

@JeremiahHarmsen

Creator of #TensorFlowHub and @TensorFlow Serving. Lead in Google Brain.

André Susano Pinto @ASusanoPinto

over 1 year ago

Jet, the tool in JetFormer. A coupling normalizing flow where the blocks are powered by ViT. Simple, scalable and it works!

Alexander Kolesnikov @__kolesnikov__

over 1 year ago

With some delay, JetFormer's *prequel* paper is finally out on arXiv: a radically simple ViT-based normalizing flow (NF) model that achieves SOTA results in its class. Jet is one of the key components of JetFormer, deserving a standalone report. Let's unpack: 🧵⬇️

__kolesnikov__'s tweet photo. With some delay, JetFormer's *prequel* paper is finally out on arXiv: a radically simple ViT-based normalizing flow (NF) model that achieves SOTA results in its class.

Jet is one of the key components of JetFormer, deserving a standalone report. Let's unpack: 🧵⬇️ https://t.co/SIKfpEMMWf

1

285

29

182

57K

1

6

3

4

2K

ASusanoPinto retweeted

merve

@mervenoyann

over 1 year ago

Welcome PaliGemma 2! 🤗 Google released PaliGemma 2, best vision language model family that comes in various sizes: 3B, 10B, 28B, based on Gemma 2 and SigLIP, comes with transformers support day-0 🎁 Saying this model is amazing would be an understatement, keep reading ✨

mervenoyann's tweet photo. Welcome PaliGemma 2! 🤗

Google released PaliGemma 2, best vision language model family that comes in various sizes: 3B, 10B, 28B, based on Gemma 2 and SigLIP, comes with transformers support day-0 🎁

Saying this model is amazing would be an understatement, keep reading ✨

28

2K

249

1K

167K

ASusanoPinto retweeted

Andreas Steiner @AndreasPSteiner

over 1 year ago

🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes. 1/7

AndreasPSteiner's tweet photo. 🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes.

1/7 https://t.co/NGy3mMM7sD

4

261

52

100

62K

André Susano Pinto @ASusanoPinto

over 1 year ago

@AndreasPSteiner: Let's just train and see where it goes... Well far... We had to write a 30+ page tech report for all of you to enjoy :)

Andreas Steiner @AndreasPSteiner

over 1 year ago

🚀🚀PaliGemma 2 is our updated and improved PaliGemma release using the Gemma 2 models and providing new pre-trained checkpoints for the full cross product of {224px,448px,896px} resolutions and {3B,10B,28B} model sizes. 1/7

4

261

52

100

62K

0

26

3

4

6K

André Susano Pinto @ASusanoPinto

over 1 year ago

@YugeTen @__kolesnikov__ We already knew we would like it. But we didn't know how :) The NF comes with two properties: invertible and computable logdet. together they don't allow to cheat to map all latents to a trivial point and then obtain a perfect loss on the AR to model that trivial output.

1

2

0

123

André Susano Pinto @ASusanoPinto

over 1 year ago

Did you try to get an auto-regressive transformer to operate in a continuous latent space which is not fixed ahead of time but learned end to end from scratch? Enter JetFormer: https://t.co/NaQzHGvezm -- joint work in a dream team: @mtschannen and @__kolesnikov__

Michael Tschannen @mtschannen

over 1 year ago

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)? We have been pondering this during summer and developed a new model: JetFormer 🌊🤖 https://t.co/ngvPzZvUYW A thread 👇 1/

mtschannen's tweet photo. Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)?

We have been pondering this during summer and developed a new model: JetFormer 🌊🤖

https://t.co/ngvPzZvUYW

A thread 👇

1/ https://t.co/04R4a1nbMu

15

840

142

760

250K

1

37

3

9

5K

André Susano Pinto @ASusanoPinto

over 6 years ago

Feels great to start adding diversity to the available pre-trained visual representations. Especially when it has considerable impact for problems with a smaller number of examples available or hard to collect.

Maxim Neumann @neu_maxim

over 6 years ago

We've looked into representation learning for #RemoteSensing with different datasets and fine-tuning using in-domain data. See paper with datasets and models included 🔋: https://t.co/TheaeAssWm with @ASusanoPinto, @XiaohuaZhai and @neilhoulsby.

neu_maxim's tweet photo. We've looked into representation learning for #RemoteSensing with different datasets and fine-tuning using in-domain data. See paper with datasets and models included 🔋: https://t.co/TheaeAssWm with @ASusanoPinto, @XiaohuaZhai and @neilhoulsby. https://t.co/r7b4xuuDcZ

1

20

11

3

0

4

0

ASusanoPinto retweeted

Google AI

@GoogleAI

over 6 years ago

We’re pleased to release the Visual Task Adaptation Benchmark (VTAB), a diverse, realistic, and challenging protocol to measure progress towards universal visual representations. Learn all about it below. https://t.co/PbORwSFPAg

3

327

113

36

0

André Susano Pinto @ASusanoPinto

about 7 years ago

@lc0d3r We started with the longer one, but it looks by now #tfhub is winning. Shorter it is.

0

2

0

André Susano Pinto @ASusanoPinto

over 7 years ago

#TensorFlowHub helping fast experimentation and making ML models that go to space.

Sergii 🇺🇦 @lc0d3r

over 7 years ago · San Carlos

Amazing article showing how accessible #DeepLearning is becoming. Model trained with transfer learning and "#TensorFlow For Poets" codelab +#tfhub. Converted to #TFLite and now deployed on International Space Station🚀 - TensorFlow Lite is Going to Space - https://t.co/Jb8ykVxtRN

lc0d3r's tweet photo. Amazing article showing how accessible #DeepLearning is becoming. Model trained with transfer learning and "#TensorFlow For Poets" codelab +#tfhub. Converted to #TFLite and now deployed on International
Space Station🚀 - TensorFlow Lite is Going to Space - https://t.co/Jb8ykVxtRN https://t.co/oVuSmarsZt

0

7

4

0

1

2

0

ASusanoPinto retweeted

Andy Brock @ajmooch

over 7 years ago

BigGAN-deep pretrained models are now publicly available for download on TFHub! https://t.co/dZuYLwr8cJ

1

187

45

17

0

ASusanoPinto retweeted

TensorFlow @TensorFlow

over 7 years ago

A new, multilingual version of the Universal Sentence Encoder (USE) model is now available on #TFHub! Check it out here → https://t.co/N1JzuuX4MR

TensorFlow's tweet photo. A new, multilingual version of the Universal Sentence Encoder (USE) model is now available on #TFHub!

Check it out here → https://t.co/N1JzuuX4MR https://t.co/xPD1d9AUxd

0

197

55

19

0

ASusanoPinto retweeted

Google DeepMind @GoogleDeepMind

over 7 years ago

The BigGAN generators from our paper https://t.co/QUYlE9IBsE are now available on TF Hub (https://t.co/GHM9pIgQPw). Try the Colab demo at: https://t.co/Ynyb9T9AAD

13

1K

489

115

0

ASusanoPinto retweeted

ACM-W womENcourage @ACMwomENcourage

over 7 years ago

Enjoying the Workshop by Google engineer Elizabeth Kemp: Transfer Learning with TensorFlow Hub 👩‍💻 #womencourage18 #ML #TensorFlow

0

8

4

0

André Susano Pinto @ASusanoPinto

over 7 years ago

Our team hopes the new frontend helps more people find and use cutting-edge research modules :) #TensorFlowHub #transferlearning

TensorFlow @TensorFlow

over 7 years ago

We are launching a new web experience for TensorFlow Hub! Check out https://t.co/T8COqipES0 and explore our modules, including some new additions like the FasterRCNN for object detection. Learn more on the post ↓ https://t.co/Et5NjpoW8X

3

450

176

24

0

4

0

André Susano Pinto @ASusanoPinto

almost 8 years ago

Great to have image embedding modules trained on datasets other than just ImageNet.

TensorFlow @TensorFlow

almost 8 years ago

Winners of the @inaturalist Challenge 2017 released their model on #TensorflowHub showcasing advantages of transfer learning! #tfhub #transferlearning Check it out here ↓ https://t.co/ELrBIBWJUn

0

74

26

4

0

1

0

André Susano Pinto

@ASusanoPinto

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users