Keyu Tian @keyutian - Twitter Profile

Keyu Tian @keyutian

about 3 years ago

@noahcjones here is a brief comparison

0

1

0

3K

Keyu Tian @keyutian

over 3 years ago

Now this paper has been accepted to #ICLR2023 as a spotlight🌟! I truly appreciate everyone's efforts. Check out our repo https://t.co/zjGbmLtZYI for the latest demo video and updates! Title: "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Sebastian Raschka

@rasbt

over 3 years ago

How can we leverage successful pretraining techniques from transformers to improve purely convolutional networks? The answer is *Sparse Convolutions*! Let's see what happens when purely convolutional networks are pretrained with 1.28 million unlabeled images ... 1/7

rasbt's tweet photo. How can we leverage successful pretraining techniques from transformers to improve purely convolutional networks? The answer is *Sparse Convolutions*!

Let's see what happens when purely convolutional networks are pretrained with 1.28 million unlabeled images ...

1/7 https://t.co/rr9qd7TWWo

9

487

102

263

140K

4

76

12

32

36K

keyutian retweeted

Python Trending 🇺🇦 @pythontrending

over 3 years ago

SparK - [ICLR'23 Spotlight] The first successful BERT-style pretraining on any *convolutional network*; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling" https://t.co/vdgxhmPxS1

0

6

2

4

10K

keyutian retweeted

/MachineLearning @slashML

over 3 years ago

: The first BERT-style pretraining on CNNs! #nlp #cnn https://t.co/L7GT2MFTNH

1

18

7

9

11K

Who to follow

Helin Xu

@helin_xu_

3D & Embodied AI @HaoSuLabUCSD | @sudoAI_ @UCSanDiego @Tsinghua_Uni

Matthew Siper

@MatthewSiper

Founder & builder behind @the_nof1’s agentic quant stack Alpha Arena · Alpha Chat · Agent Builder

Yunzhen Feng

@feeelix_feng

PhD at CDS, NYU. Ex-Intern at GenAI, FAIR @AIatMeta. Previously undergrad at @PKU1898

Keyu Tian @keyutian

over 3 years ago

@foozlefoo Thanks, we're glad SparK may be helpful to your project! Sorry we delayed the update for a few days because it's Lunar New Year today, we'll get back to you as soon as they're posted.

1

0

936

Keyu Tian @keyutian

over 3 years ago

PS: our original submission in Oct. 2022 was titled "Sparse and Hierarchical Masked Modeling for Convolutional Representation Learning"; we've changed it to the new one to more clearly express our vision for bringing the powerful BERT-style self-supervised learning to CNNs.

1

3

0

2

6K

Keyu Tian @keyutian

over 3 years ago

@rasbt Thank you so much! Your notes are really helpful in letting people know about what we do! And I'm glad to see this work has the opportunity to make a meaningful contribution to our community.

0

1

0

916

keyutian retweeted

Daisuke Okanohara / 岡野原大輔

@hillbig

over 3 years ago

画像認識の事前学習で、ViTは一部の入力をMaskし、それを予測するタスクが成功していたがConvは隣り合うパッチ間で重複があり情報が漏れ成功してなかった。SparKは出力も入力と同じマスクを維持するSparseConvを使うことで問題を解決、また復号器側も階層を持ちさらに改善 https://t.co/PWkAoOQXTl

0

61

9

19

14K

Keyu Tian @keyutian

over 3 years ago

@reidatcheson @rasbt Thanks for your interest! We are writing a google Colab tutorial to make it easy to play with our pre-trained models (e.g. to reconstruct the images you upload). We'll release it before this weekend and hope it can be helpful.

0

2

0

201

Keyu Tian @keyutian

over 3 years ago

@rasbt Yeah, that's true for point 2. It'd be better to say "CNNs achieved higher performance than Transformers". BTW, I really appreciate the contrastive idea, which seems more natural for image data than randomly masking. But it turns out the BERT-style pretraining is more effective.

0

182

keyutian retweeted

Sebastian Raschka

@rasbt

over 3 years ago

How can we leverage successful pretraining techniques from transformers to improve purely convolutional networks? The answer is *Sparse Convolutions*! Let's see what happens when purely convolutional networks are pretrained with 1.28 million unlabeled images ... 1/7

9

487

102

263

140K

keyutian retweeted

fly51fly @fly51fly

over 3 years ago

[CV] Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling K Tian, Y Jiang, Q Diao, C Lin, L Wang, Z Yuan [Peking University & Bytedance Inc & University of Oxford] (2023) https://t.co/TJwQep1Has #MachineLearning #ML #AI #CV [1/2]

fly51fly's tweet photo. [CV] Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling
K Tian, Y Jiang, Q Diao, C Lin, L Wang, Z Yuan [Peking University & Bytedance Inc & University of Oxford] (2023)
https://t.co/TJwQep1Has
#MachineLearning #ML #AI #CV
[1/2] https://t.co/erswznpp2X

1

30

10

17

9K

Keyu Tian @keyutian

over 3 years ago

@HappyToKnowThat @jamesr66a it's kinda like the number of neurons in a brain, the more, the smarter. And each parameter is a float in Java, so you can imagine how much memory it takes.

0

134

Keyu Tian @keyutian

over 3 years ago

@ducha_aiki Thanks for sharing😃! For those who may be interested in the code implementation, please see: https://github[dot]com/keyu-tian/SparK

0

2

0

113

Keyu Tian @keyutian

over 3 years ago

@Deep__AI Thanks for sharing😃! For those who may be interested in the code implementation, please see: https://github[dot]com/keyu-tian/SparK

0

3

0

240

keyutian retweeted

DeepAI

@DeepAI

over 3 years ago

🤩The first successful BERT-style #SelfSupervisedLearning on any convolutional network! #ResNet now enjoys masked autoencoding! 🚀A breakthrough paper "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling" by @keyutian et al. https://t.co/YmnyCsOKHj

1

8

3

3K

keyutian retweeted

Dmytro Mishkin 🇺🇦 @ducha_aiki

over 3 years ago

Designing BERT for convolutional networks: sparse and hierarchical masked modeling Keyu Tian, Yi Jiang, Qishuai Diao, Chen Lin, Liwei Wang, Zehuan Yuan tl;dr: create image, which looks to CNN same, as transformers -> MIM starts working https://t.co/g6zDN0BbEH

ducha_aiki's tweet photo. Designing BERT for convolutional networks: sparse and hierarchical masked modeling

Keyu Tian, Yi Jiang, Qishuai Diao, Chen Lin, Liwei Wang, Zehuan Yuan

tl;dr: create image, which looks to CNN same, as transformers -> MIM starts working
https://t.co/g6zDN0BbEH https://t.co/KuXqKd02iy

4

117

25

44

25K

Keyu Tian @keyutian

over 3 years ago

@kzmolikova @rasbt Thanks for noticing this, that is our work which's been on OpenReview since Oct. 2022 (https://openreview. net/forum?id=NRxydtWup1S), and uploaded to arxiv recently. It gives some different results from ConvNeXt v2, e.g., our method works pretty well on ConvNeXt v1 and ResNet.

0

2

0

126

Keyu Tian @keyutian

over 3 years ago

@DiChang10 @eccvconf cool, hope to see you on next ECCV!

0

178

Keyu Tian

@keyutian

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users