Xiaodong Liu @AllenLao - Twitter Profile

AllenLao retweeted

over 1 year ago

Kudo to co-first authors Yu Gu, Robert Tinn, @kelvinih for driving the project to a big success, and big congrats to all the co-authors Michael R. Lucas, @naotous, @AllenLao, @TristanNaumann, @JianfengGao0217.

1

3

1

0

342

AllenLao retweeted

Jacob Andreas @jacobandreas

almost 3 years ago

jacobandreas's tweet photo. https://t.co/4EstogYCH8

0

112

20

13

56K

AllenLao retweeted

Geoffrey Hinton

@geoffreyhinton

about 3 years ago

In the NYT today, Cade Metz implies that I left Google so that I could criticize Google. Actually, I left so that I could talk about the dangers of AI without considering how this impacts Google. Google has acted very responsibly.

595

15K

3K

741

3M

Xiaodong Liu @AllenLao

about 3 years ago

@srchvrs @huggingface https://t.co/NMj7TvrmD8

0

2

0

207

Who to follow

Ofir Press

@OfirPress

I push the AI frontier by building tough benchmarks with amazing people. SWE-bench, SWE-agent, SciCode, AlgoTune. Postdoc @Princeton. PhD @nlpnoah @UW.

Caiming Xiong

@CaimingXiong

Co-founder of @Recursive_SI. ex-SVP of Salesforce AI Research | ex-MetaMind (Opinions are personal.)

Sewon Min

@sewon__min

Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp

Xiaodong Liu @AllenLao

about 3 years ago

@xiangrenNLP congratulations!

0

1

0

130

Xiaodong Liu @AllenLao

about 3 years ago

@alex_conneau @OpenAI congratulations!

0

118

AllenLao retweeted

Rada Mihalcea @radamihalcea

about 3 years ago

Drago loved his family and was a deeply caring father. His daughter, Victoria has a disability and requires extensive care. We are raising money to help Drago’s family to continue to provide Victoria with the care she needs. Any help will be appreciated!🙏🏼 https://t.co/AWhrWSHxdw

0

73

39

2

17K

AllenLao retweeted

Jeff Dean

@JeffDean

about 3 years ago

Bard is now available in the US and UK, w/more countries to come. It’s great to see early @GoogleAI work reflected in it—advances in sequence learning, large neural nets, Transformers, responsible AI techniques, dialog systems & more. You can try it at https://t.co/m9D7JYTHvU

27

707

117

30

340K

AllenLao retweeted

Yann LeCun

@ylecun

over 3 years ago

LLMs are still making sh*t up. That's fine if you use them as writing assistants. Not good as question answerers, search engines, etc. RLHF merely mitigates the most frequent mistakes without actually fixing the problem.

48

1K

195

160

435K

AllenLao retweeted

Tuo Zhao @tourzhao

over 3 years ago

Need scalable and efficient large language models for long sequences? Check our SPADE models in https://t.co/D190LCZw7U. By leveraging a state space layer, SPADE complements the lack of long-range dependency issue in transformer models using local attentions. (1/3)

tourzhao's tweet photo. Need scalable and efficient large language models for long sequences? Check our SPADE models in https://t.co/D190LCZw7U. By leveraging a state space layer, SPADE complements the lack of long-range dependency issue in transformer models using local attentions. (1/3) https://t.co/blgiDuW8jl

1

23

4

5

7K

AllenLao retweeted

MMitchell

@mmitchell_ai

almost 4 years ago

Q: @FAccTConference (main AI Ethics conf) was $10,000 short. They also turned down Google sponsorship due to G's continued refusal to address structural discrimination & trauma to me & @timnitGebru specifically. Is there any issue w/ me starting a GoFundMe to make up the diff?

4

94

10

0

AllenLao retweeted

Liam Fedus

@LiamFedus

almost 4 years ago

Today we're releasing all Switch Transformer models in T5X/JAX, including the 1.6T param Switch-C and the 395B param Switch-XXL models. Pleased to have these open-sourced! https://t.co/02YVX4dpUB All thanks to the efforts of James Lee-Thorp, @ada_rob, and @hwchung27

19

989

191

190

0

AllenLao retweeted

Yaqing Wang @Yaqing_Wang

about 4 years ago

🚨[New Paper] Check out our recent work on parameter-efficient fine-tuning. We introduce a new method to boost the performance of Adapter to outperform full model fine-tuning. Great collaboration with @subho_mpi, @AllenLao, Jing Gao, @AhmedHAwadallah and @JianfengGao0217.

0

27

7

3

0

Xiaodong Liu @AllenLao

about 4 years ago

@Skiminok @femtechie Do you guys provide any computational resources?

1

0

AllenLao retweeted

Nathan Benaich

@nathanbenaich

about 4 years ago

🤓In 2017, Google researchers introduced the Transformer in "Attention is all you need", which took AI by storm. 5 startups were born: @AdeptAILabs (🏦 @airstreet), Inceptive, @NEARProtocol, @CohereAI, CharacterAI. Only 1/8 authors remain @GoogleAI, another is at @OpenAI. 😉

nathanbenaich's tweet photo. 🤓In 2017, Google researchers introduced the Transformer in "Attention is all you need", which took AI by storm.

5 startups were born: @AdeptAILabs (🏦 @airstreet), Inceptive, @NEARProtocol, @CohereAI, CharacterAI.

Only 1/8 authors remain @GoogleAI, another is at @OpenAI.

😉 https://t.co/bUjhwKXdDk

17

1K

178

321

0

AllenLao retweeted

AI at Meta

@AIatMeta

about 4 years ago

Today Meta AI is sharing OPT-175B, the first 175-billion-parameter language model to be made available to the broader AI research community. OPT-175B can generate creative text on a vast range of topics. Learn more & request access: https://t.co/3rTMPms1vq

47

2K

643

267

0

AllenLao retweeted

Databricks AI Research

@DbrxMosaicAI

about 4 years ago

Today, an exciting paper from @MSFTResearch: Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer https://t.co/NQVigh6kUL While it's too early to say, this may be remembered as the single biggest efficiency advancement in hyperparameter tuning.

DbrxMosaicAI's tweet photo. Today, an exciting paper from @MSFTResearch:
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
https://t.co/NQVigh6kUL

While it's too early to say, this may be remembered as the single biggest efficiency advancement in hyperparameter tuning. https://t.co/cZqn67lnwK

3

205

38

64

0

AllenLao retweeted

Aleksa Gordić (水平问题)

@gordic_aleksa

about 4 years ago

[🥳new video🧠] "Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer" (μTransfer) paper explained! YT: https://t.co/DGI5tybyDL @TheGregYang @edwardjhu @ibab_ml @sidorszymon @AllenLao @merettm @WeizhuChen @JianfengGao0217 @MSFTResearch @OpenAI

gordic_aleksa's tweet photo. [🥳new video🧠] "Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer" (μTransfer) paper explained!

YT: https://t.co/DGI5tybyDL

@TheGregYang @edwardjhu @ibab_ml @sidorszymon @AllenLao @merettm @WeizhuChen @JianfengGao0217 @MSFTResearch @OpenAI https://t.co/SVWnz1VnKy

1

23

6

4

0

AllenLao retweeted

Microsoft Research

@MSFTResearch

over 4 years ago

When a neural network is too large to pretrain more than once, tuning its hyperparameters is practically impossible. Today, we announce μTransfer—a new technique that can tune the 6.7 billion parameter GPT-3 model using only 7% of the pretraining compute: https://t.co/RnS5HZboq0

6

464

90

77

0

Xiaodong Liu

@AllenLao

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users