Guanghui Qin

@hiaoxui

Researcher at Microsoft Research. Ex Johne Hopkins.

Redmond, WA

Joined July 2017

121 Following

97 Followers

2 Posts

Guanghui Qin @hiaoxui

over 2 years ago

So what about compressing the input during encoding, into what we call “nuggets”? Surprisingly, we found that transformers can use 10% or even 5% of the tokens to represent the texts with negligible information loss. How did we achieve that?

216

Guanghui Qin @hiaoxui

over 2 years ago

We found that many approaches to long-range transformers (such as sparsified pattern, recurrence, and kernels) don’t actually translate into NLP task performance (https://t.co/54pkBWR4q2, #EACL2023). Can we fix it?

265

Guanghui Qin

@hiaoxui

Last Seen Users on Sotwe

Trends for you

Most Popular Users