Thinh Truong @ththinh_ - Twitter Profile

ththinh_ retweeted

@aryaman2020

7 months ago

i hate ML conference reviewers. i take back everything bad i ever said about ACL. every ACL reviewer i ever got was at least literate

14

470

18

27

36K

Thinh Truong @ththinh_

11 months ago

@lrzneedresearch 7557 and rejected too. AC even recommended accepting. What's the point of the review process if PC just dismissing everyone...

0

1

0

270

ththinh_ retweeted

CLS

@ChengleiSi

12 months ago

Are AI scientists already better than human researchers? We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts. Main finding: LLM ideas result in worse projects than human ideas.

ChengleiSi's tweet photo. Are AI scientists already better than human researchers?

We recruited 43 PhD students to spend 3 months executing research ideas proposed by an LLM agent vs human experts.

Main finding: LLM ideas result in worse projects than human ideas.

12

633

182

227

153K

Thinh Truong @ththinh_

over 1 year ago

@Swarooprm7 my goat @quocleix

0

1

0

281

Who to follow

Adam Fisch

@adamjfisch

Research Scientist @ Google DeepMind | Formerly: PhD @ MIT EECS.

Abhishek Salian

@acvsalian

Building https://t.co/zlBA5gxH6U

Afshin Rahimi

@ashrayme

Applied Scientist @ Amazon

Thinh Truong @ththinh_

almost 2 years ago

also got interrogated at melbourne airport after 30 hrs on the plane. This sucks so much 🙃.

5

0

235

Thinh Truong @ththinh_

about 2 years ago

@lixin4ever Thank you for the release! One question out of curiosity: What is the motivation behind training LLM for SEA? I know that some languages like malay and indo share some similarities but they are generally disconnected (different script, morphology, culture, etc.)

1

0

286

ththinh_ retweeted

Khuyagbaatar Batsuren @khuyagbaatar_b

about 2 years ago

🚨 New paper on Subword Tokenization 🚨 - umLabeller, a new tool, classifies subword tokenization into morph 🤹 or alien 👽 - alien tokenization 🛸 leads to poorer generalizations than morphological tokenization for 3 downstream tasks. https://t.co/gsfLcb0blF (1/7)

khuyagbaatar_b's tweet photo. 🚨 New paper on Subword Tokenization 🚨

- umLabeller, a new tool, classifies subword tokenization into morph 🤹 or alien 👽

- alien tokenization 🛸 leads to poorer generalizations than morphological tokenization for 3 downstream tasks. https://t.co/gsfLcb0blF (1/7) https://t.co/rovSKGYn5q

3

52

16

24

9K

Thinh Truong @ththinh_

about 2 years ago

0-shot evaluation is important (if not more meaningful) when evaluating general capabilities of LLM.

(((ل()(ل() 'yoav))))👾

@yoavgo

about 2 years ago

"In other words, the modification are so simple that there is a rule to determine the label (e.g., adding 'X allegedly did Y' doe not entail 'X did Y'). " And yet the massively pretrained models do not capture this! I find it interesting and remarkable.

1

40

0

1

4K

0

1

0

74

Thinh Truong @ththinh_

about 2 years ago

11/ Please have a look at the paper if you find this interesting. Big thanks to my co-author and supervisors: @YuliaOtmakhova, @karinv, Trevor, @eltimster. Finally, I will be at NAACL (my first in-person conference after almost finishing my PhD). See you in Mexico!

0

2

0

84

Thinh Truong @ththinh_

about 2 years ago

Paper accepted to NAACL 2024 main conference. arxiv: https://t.co/7fyRBZSz7k In this work, we explore the interaction between two bottlenecks of LLMs: negation and tokenization (quoting @karpathy: "tokenization is the root of suffering").

ththinh_'s tweet photo. Paper accepted to NAACL 2024 main conference.
arxiv: https://t.co/7fyRBZSz7k
In this work, we explore the interaction between two bottlenecks of LLMs: negation and tokenization (quoting @karpathy: "tokenization is the root of suffering"). https://t.co/SsX4fICfW5

3

9

4

6

1K

Thinh Truong @ththinh_

about 2 years ago

10/ We see that models could clearly understand negation despite incorrect tokenization. This could be an interesting phenomenon to look at when discovering LLM interpretability. Also, as English is poor in morphology, we are eager to extend this analysis to other languages.

1

0

78

Thinh Truong

@ththinh_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users