Laura O'Mahony @_lauraaisling - Twitter Profile

12 months ago

Most AI systems today follow the same predictable pattern: they're built for specific tasks and optimized for objectives rather than exploration. Meanwhile, humans are an open-ended species—driven by curiosity and constantly questioning the unknown. From inventing new musical genres to imagining life beyond our universe, we continuously push the boundaries of what’s possible. What if AI could be as endlessly creative as humans or even nature itself? I wrote a blog post diving into the world of open-ended AI, exploring how embracing open-endedness might help us break the limits of today’s AI systems 👇 https://t.co/DMEstQCRYv

richardcsuwandi's tweet photo. Most AI systems today follow the same predictable pattern: they're built for specific tasks and optimized for objectives rather than exploration.

Meanwhile, humans are an open-ended species—driven by curiosity and constantly questioning the unknown. From inventing new musical genres to imagining life beyond our universe, we continuously push the boundaries of what’s possible.

What if AI could be as endlessly creative as humans or even nature itself?

I wrote a blog post diving into the world of open-ended AI, exploring how embracing open-endedness might help us break the limits of today’s AI systems 👇

https://t.co/DMEstQCRYv

5

84

25

61

27K

_lauraaisling retweeted

Machine Learning Street Talk

@MLStreetTalk

12 months ago

AI is so smart, why are its internals 'spaghetti'? We spoke with @kenneth0stanley and @akarshkumar0101 (MIT) about their new paper: Questioning Representational Optimism in Deep Learning: The Fractured Entangled Representation Hypothesis. Co-authors: @jeffclune @joelbot3000

18

277

59

178

84K

_lauraaisling retweeted

Andrew Dai @andrewdai99

over 1 year ago

📣 New 📝! Under Alex’s great leadership, we identified a unification under QDC measures that exist to bridge synthetic data and open-endedness (OE) in AI, toward generating data for model training, distillation, self-improvement, etc. 📚 🧵What I’ve learned (inspired by QD)👇

1

21

6

9

3K

Laura O'Mahony @_lauraaisling

over 1 year ago

It was a pleasure to have been part of this project led by @Dahoas1 With synthetic data being so important in training LLMs these days, this survey on the impacts of QDC of synthetic data for LLM performance is timely.

Alex Havrilla @Dahoas1

over 1 year ago

How important is the quality, diversity, and complexity (QDC) of synthetic data for LLM performance? What effect does QDC data composition have on self-improvement? We just released a comprehensive survey discussing these questions (and many more) 🧵

Dahoas1's tweet photo. How important is the quality, diversity, and complexity (QDC) of synthetic data for LLM performance? What effect does QDC data composition have on self-improvement?

We just released a comprehensive survey discussing these questions (and many more) 🧵 https://t.co/31z8ZW8O2v

5

111

32

61

17K

0

8

1

0

753

Laura O'Mahony @_lauraaisling

over 1 year ago

I love this paper as it finally tackles something I’ve been confused about for the last few years since I started working on interpretability!

Sarah Wiegreffe @sarahwiegreffe

over 1 year ago

Have you ever wondered what ✨mechanistic interpretability✨ is, & how it differs from other NLP interpretability research? @nsaphra and I have the paper for you! Check out our paper (which I'll present @BlackboxNLP @emnlpmeeting in Miami next month!). https://t.co/IAr6Z6w3AE

2

107

15

34

13K

0

3

0

1

371

_lauraaisling retweeted

Cohere Labs

@Cohere_Labs

over 1 year ago

This clip from The Journey of Aya documentary features @DeividasMat, @singhshiviii, @luisa_moura_, @muhaksim, and @_lauraaisling. Find the full video at https://t.co/0WsC2i9C8a

0

4

2

0

634

_lauraaisling retweeted

Zirui Chen @ziruichen44

almost 2 years ago

Why do varied DNN designs yield equally good models of human vision? Our preprint with @michaelfbonner shows that diverse DNNs represent images with a shared set of latent dimensions, and these shared dimensions turn out to also be the most brain-aligned. https://t.co/vtOOYHQb47

3

125

41

78

17K

Laura O'Mahony @_lauraaisling

almost 2 years ago

Fascinating paper!

Kevin Mitchell @WiringTheBrain

almost 2 years ago

𝗧𝗵𝗲 𝗚𝗲𝗻𝗼𝗺𝗶𝗰 𝗖𝗼𝗱𝗲 - 𝘁𝗵𝗲 𝗴𝗲𝗻𝗼𝗺𝗲 𝗶𝗻𝘀𝘁𝗮𝗻𝘁𝗶𝗮𝘁𝗲𝘀 𝗮 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗺𝗼𝗱𝗲𝗹 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗿𝗴𝗮𝗻𝗶𝘀𝗺 🧬 https://t.co/ZOlfcPJhW8 very excited to share this new preprint from me and Nick Cheney 😀🧵

WiringTheBrain's tweet photo. 𝗧𝗵𝗲 𝗚𝗲𝗻𝗼𝗺𝗶𝗰 𝗖𝗼𝗱𝗲 - 𝘁𝗵𝗲 𝗴𝗲𝗻𝗼𝗺𝗲 𝗶𝗻𝘀𝘁𝗮𝗻𝘁𝗶𝗮𝘁𝗲𝘀 𝗮 𝗴𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝘃𝗲 𝗺𝗼𝗱𝗲𝗹 𝗼𝗳 𝘁𝗵𝗲 𝗼𝗿𝗴𝗮𝗻𝗶𝘀𝗺 🧬
https://t.co/ZOlfcPJhW8 very excited to share this new preprint from me and Nick Cheney 😀🧵 https://t.co/NKnHXemfAo

19

352

107

251

94K

0

2

1

3

1K

_lauraaisling retweeted

Andrej Karpathy

@karpathy

almost 2 years ago

To help explain the weirdness of LLM Tokenization I thought it could be amusing to translate every token to a unique emoji. This is a lot closer to truth - each token is basically its own little hieroglyph and the LLM has to learn (from scratch) what it all means based on training data statistics. So have some empathy the next time you ask an LLM how many letters 'r' there are in the word 'strawberry', because your question looks like this: 👩🏿‍❤️‍💋‍👨🏻🧔🏼🤾🏻‍♀️🙍‍♀️🧑‍🦼‍➡️🧑🏾‍🦼‍➡️🤙🏻✌🏿🈴🧙🏽‍♀️📏🙍‍♀️🧑‍🦽🧎‍♀🍏💂 Play with it here :) https://t.co/pFQGZIAW1k

karpathy's tweet photo. To help explain the weirdness of LLM Tokenization I thought it could be amusing to translate every token to a unique emoji. This is a lot closer to truth - each token is basically its own little hieroglyph and the LLM has to learn (from scratch) what it all means based on training data statistics.

So have some empathy the next time you ask an LLM how many letters 'r' there are in the word 'strawberry', because your question looks like this:
👩🏿‍❤️‍💋‍👨🏻🧔🏼🤾🏻‍♀️🙍‍♀️🧑‍🦼‍➡️🧑🏾‍🦼‍➡️🤙🏻✌🏿🈴🧙🏽‍♀️📏🙍‍♀️🧑‍🦽🧎‍♀🍏💂

Play with it here :)
https://t.co/pFQGZIAW1k

285

8K

1K

3K

560K

_lauraaisling retweeted

Sara Hooker

@sarahookr

almost 2 years ago

Is bigger always better? 🐘 The idea that scaling more than any other ingredient has driven progress has become formalized as the “bitter lesson” Is Sutton right? 📜https://t.co/ndAIFT4UPY

sarahookr's tweet photo. Is bigger always better? 🐘 The idea that scaling more than any other ingredient has driven progress has become formalized as the “bitter lesson”

Is Sutton right?

📜https://t.co/ndAIFT4UPY

18

431

81

378

110K

_lauraaisling retweeted

David Bau @davidbau

almost 2 years ago

Time to study #llama3 405b, but gosh it's big! Please retweet: if you have a great experiment but not enough GPU, here is an opportunity to apply for shared #NDIF research resources. Deadline July 30: https://t.co/uHN3BxaR6c You'll help @ndif_team test, we'll help you run 405b

2

120

36

42

27K

_lauraaisling retweeted

Jeremy Howard

@jeremyphoward

almost 2 years ago

OMG a gift from heaven! I so need this.

3

107

7

38

26K

_lauraaisling retweeted

Deedy

@deedydas

almost 2 years ago

Gymnastics is the Turing test of video generation models

1K

40K

4K

7K

7M

_lauraaisling retweeted

Maksym Andriushchenko

@maksym_andr

almost 2 years ago

Perhaps my favorite jailbreak: making a harmful request in the past tense (How to create Y? →How did people create Y?). Works on surprisingly many models :-) including the new Gemma-2. I think it tells us something fundamental about the representations that these models learn.

maksym_andr's tweet photo. Perhaps my favorite jailbreak: making a harmful request in the past tense (How to create Y? →How did people create Y?).

Works on surprisingly many models :-) including the new Gemma-2.

I think it tells us something fundamental about the representations that these models learn.

5

122

8

48

14K

_lauraaisling retweeted

Yong Zheng-Xin

@yong_zhengxin

almost 2 years ago

🔥New work on multilinguality + safety + mech interp! We show that DPO training in only English can detoxify LLM in many other languages. We also give a mechanistic explanation on how cross-lingual safety transfer happens. (1/n 🧵) 📃 Paper: https://t.co/jHQeI6Kg2G

yong_zhengxin's tweet photo. 🔥New work on multilinguality + safety + mech interp!

We show that DPO training in only English can detoxify LLM in many other languages.

We also give a mechanistic explanation on how cross-lingual safety transfer happens. (1/n 🧵)

📃 Paper: https://t.co/jHQeI6Kg2G https://t.co/otgNT1KQVv

8

195

39

130

32K

Laura O'Mahony @_lauraaisling

about 2 years ago

@singhshiviii @CohereForAI @sarahookr Congrats!!

0

1

0

46

_lauraaisling retweeted

Sara Hooker

@sarahookr

about 2 years ago

Aya took 14 months involving 3000 + collaborators and was as much a protest about how research is done as it was a movement to improve the state of multilingual progress. 🎉 Grateful to see it recognized at @aclmeeting and everyone who has supported along the way.

3

145

23

10

15K

_lauraaisling retweeted

Cohere Labs

@Cohere_Labs

about 2 years ago

🌱 We’re very excited that our work "Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning" was also accepted! Congrats to authors @singhshiviii, @freddiev4, @mrdanieldsouza, @tellarin, @freakynut, @weiyinko_ml, @krypticmouse, @rv__init__, @DeividasMat,

Cohere_Labs's tweet photo. 🌱 We’re very excited that our work "Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning" was also accepted! Congrats to authors @singhshiviii, @freddiev4, @mrdanieldsouza, @tellarin, @freakynut, @weiyinko_ml, @krypticmouse, @rv__init__, @DeividasMat, https://t.co/xwJ7Ay3S1y

1

61

20

9

42K

_lauraaisling retweeted

Shivalika Singh @singhshiviii

about 2 years ago

2/2! Yay! First ever acceptance at a conference! And it's ACL! 🎉 Huge congrats to all co-authors! It's been a such a joy collaborating with all of you! 💙 Looking forward to #ACL2024 in #Bangkok ;)

singhshiviii's tweet photo. 2/2! Yay! First ever acceptance at a conference! And it's ACL! 🎉

Huge congrats to all co-authors!
It's been a such a joy collaborating with all of you! 💙

Looking forward to #ACL2024 in #Bangkok ;) https://t.co/aL2wvzk4UB

8

121

7

5

17K

Laura O'Mahony

@_lauraaisling

Last Seen Users on Sotwe

Trends for you

Most Popular Users