Not Me @davidschlangen - Twitter Profile

Pinned Tweet

over 3 years ago

It was fun while it lasted, etc etc. If you’re still here, don’t wait any longer and come over to … that other thing. It really isn’t that hard. If you identify as an “AI person”, https://t.co/GPbUxYlSLW is probably the server for you. I’m at [email protected]

0

2

1

4

0

Not Me @davidschlangen

over 1 year ago

Bob Dylan getting the Nobel Prize in Literature really paved the way for Geoff Hinton getting the Nobel Prize in Physics.

0

3

0

1

422

Not Me @davidschlangen

almost 2 years ago

I see another area is getting the treatment. (I was there for dialogue systems being discovered (in the way America was discovered).)

Matthew Guzdial @MatthewGuz

almost 2 years ago

It's funny to watch the stream of Google research/Deepmind papers that say they want to do automated game generation with 0 citations to anything in the area's 30+ year history.

5

156

11

22

21K

0

5

2

0

578

Not Me @davidschlangen

almost 2 years ago

Meta following Apple’s playbook?

0

281

Who to follow

EdinburghNLP

@EdinburghNLP

The Natural Language Processing Group at the University of Edinburgh.

Desmond Elliott

@delliott

Associate Professor at the University of Copenhagen. I work on Vision-Language models and Tokenization-free NLP. EMNLP 2026 PC.

Ehud Reiter

@EhudReiter

I am a computer scientist who works on natural language generation and evaluation, often in healthcare contexts. I teach at Aberdeen University.

Not Me @davidschlangen

almost 2 years ago

@techyalzay Yeah, we briefly looked into this and didn’t find an obvious easy source in the code of the page. (And the HELM webpage is… peculiar.) And it somehow feels wrong to have to scrape this data, which the producers should only have an interest in making accessible.

0

38

Not Me @davidschlangen

almost 2 years ago

Does anyone know how to get the HELM and Arena rankings in a machine readable format (and ideally, programmatically)? #lazyweb #LLMs

0

291

davidschlangen retweeted

Kranti @krantich_

almost 2 years ago

🥳This work has been accepted to #EMNLP 2024 Findings. Thanks to my co-authors @SherzodHakimov, @davidschlangen paper link: https://t.co/IWiMwvhOeB

krantich_'s tweet photo. 🥳This work has been accepted to #EMNLP 2024 Findings.
Thanks to my co-authors @SherzodHakimov, @davidschlangen

paper link: https://t.co/IWiMwvhOeB https://t.co/57w4Pte6pV

0

3

2

3

359

Not Me @davidschlangen

almost 2 years ago

@roman_klinger I’m old school, for me it’s only real when the notification letter arrives by post … erm, the email arrives. (Which now appears to be the case.) We did have a case recently however where something briefly was visible on OpenReview, and then the final decision was different.

0

1

0

100

Not Me @davidschlangen

almost 2 years ago

OpenReview hiding the EMNLP decisions until notifications have been sent out.

1

38

0

1

3K

davidschlangen retweeted

SIGdial @sigdial

almost 2 years ago

The SIGdial 2024 proceedings can now be found on the ACL Anthology 🎉 So many fantastic papers: https://t.co/CBY957o1Rh And if you are sharing your papers here, make sure to tag us @sigdial so we can repost it too! #SIGdial #SIGdial2024 @aclanthology

sigdial's tweet photo. The SIGdial 2024 proceedings can now be found on the ACL Anthology 🎉

So many fantastic papers:
https://t.co/CBY957o1Rh

And if you are sharing your papers here, make sure to tag us @sigdial so we can repost it too!

#SIGdial #SIGdial2024 @aclanthology https://t.co/Y7Q4BUHPvx

0

10

9

2

2K

Not Me @davidschlangen

almost 2 years ago

@wdavidmarx That’s hilarious. I don’t know what it means in Beck’s American context, but in Germany a reference to Heino in that situation would have been signalling an “I’m too cool to be embarrassed” attitude, because it could quite likely be true.

1

0

69

Not Me @davidschlangen

almost 2 years ago

Stay tuned for the full run. In the meantime, you can check out the clembench leaderboard here: https://t.co/Fbtc1Ceuo5

0

191

Not Me @davidschlangen

almost 2 years ago

Ok, whatever it is that @OpenAI has done to o1, it has payed off. At least on wordle, which used to be one of the hardest parts of our “conversational agency” benchmark. 4o: 23 (previous best) o1: 75.33 (Human expert players: 72)

davidschlangen's tweet photo. Ok, whatever it is that @OpenAI has done to o1, it has payed off. At least on wordle, which used to be one of the hardest parts of our “conversational agency” benchmark.

4o: 23 (previous best)
o1: 75.33
(Human expert players: 72) https://t.co/9cyzI5jJU3

1

5

1

3

839

Not Me @davidschlangen

almost 2 years ago

We still have to run the whole benchmark, mind you. This is slow and eye-wateringly expensive 🥹. (Actually, expensive & slow enough for there to be humans on the other side. 😅 )

1

0

212

davidschlangen retweeted

Sherzod Hakimov @SherzodHakimov

almost 2 years ago

"Reflection-Llama-3.1-70B" got first attention then frustration regarding the validity of the results. We benchmarked it with clembench and compared against stock model: Reflection-Llama-3.1-70B - 17/100 Meta-Llama-3.1-70B-Instruct - 39/100 It got worse.

0

1

0

354

Not Me @davidschlangen

almost 2 years ago

Observation was that neither linguistics nor NLP/AI cared too much about CL, leaving it free to reinvent itself. Slides: https://t.co/fJmm3MhHgG Video (if you really must): https://t.co/c9oOWumiFF

0

182

Not Me @davidschlangen

almost 2 years ago

Re: "ACL is (not) an AI conf.", was reminded that I did some similar soul searching some years ago. But a) openly prescriptive, b) coming to conclusion that domain to be claimed could be "linguistic intelligence".

davidschlangen's tweet photo. Re: "ACL is (not) an AI conf.", was reminded that I did some similar soul searching some years ago. But a) openly prescriptive, b) coming to conclusion that domain to be claimed could be "linguistic intelligence". https://t.co/vLfAfPB3Dx

1

2

0

1

314

Not Me @davidschlangen

almost 2 years ago

I guess “ACL is, or at Least Ought to be, Not Just an AI Conference” would have required a font that is too small. #ACL2024NLP

0

1

0

644

Not Me @davidschlangen

almost 2 years ago

@yoavgo Had the same impression @ EMNLP last year. My ad hoc expl was the demographic pyramid in a rapidly growing field — fewer senior people, who also travel less than they used to (inconvenience, guilt abt spent CO2 budget); lots of younger people who hv 2 go & don’t know <1k ppl cnfs

0

3

0

1K

Not Me @davidschlangen

almost 2 years ago

@rulimanurung Flooding predatory conferences with bogus work would of course be a sensible use case. But the results will just be that ARR is flooded with more bogus papers.

0

2

0

47

Not Me @davidschlangen

almost 2 years ago

You have to admire the dedication to the bit. They even went ahead and created a website and actual fake papers. Just to make a satirical point about what AI as a research field has become.

Sakana AI

@SakanaAILabs

almost 2 years ago

Introducing The AI Scientist: The world’s first AI system for automating scientific research and open-ended discovery! https://t.co/jC7g5GPVsE From ideation, writing code, running experiments and summarizing results, to writing entire papers and conducting peer-review, The AI Scientist opens a new era of AI-driven scientific research and accelerated discovery. Here are 4 example Machine Learning research papers generated by The AI Scientist. We published our report, The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery, and open-sourced our project! Paper: https://t.co/lTQ8UenFHk GitHub: https://t.co/Im53whVeAq Our system leverages LLMs to propose and implement new research directions. Here, we first apply The AI Scientist to conduct Machine Learning research. Crucially, our system is capable of executing the entire ML research lifecycle: from inventing research ideas and experiments, writing code, to executing experiments on GPUs and gathering results. It can also write an entire scientific paper, explaining, visualizing and contextualizing the results. Furthermore, while an LLM author writes entire research papers, another LLM reviewer critiques resulting manuscripts to provide feedback to improve the work, and also to select the most promising ideas to further develop in the next iteration cycle, leading to continual, open-ended discoveries, thus emulating the human scientific community. As a proof of concept, our system produced papers with novel contributions in ML research domains such language modeling, Diffusion and Grokking. We (@_chris_lu_, @RobertTLange, @hardmaru) proudly collaborated with the @UniOfOxford (@j_foerst, @FLAIR_Ox) and @UBC (@cong_ml, @jeffclune) on this exciting project.

SakanaAILabs's tweet photo. Introducing The AI Scientist: The world’s first AI system for automating scientific research and open-ended discovery!

https://t.co/jC7g5GPVsE

From ideation, writing code, running experiments and summarizing results, to writing entire papers and conducting peer-review, The AI Scientist opens a new era of AI-driven scientific research and accelerated discovery.

Here are 4 example Machine Learning research papers generated by The AI Scientist.

We published our report, The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery, and open-sourced our project!

Paper: https://t.co/lTQ8UenFHk
GitHub: https://t.co/Im53whVeAq

Our system leverages LLMs to propose and implement new research directions. Here, we first apply The AI Scientist to conduct Machine Learning research. Crucially, our system is capable of executing the entire ML research lifecycle: from inventing research ideas and experiments, writing code, to executing experiments on GPUs and gathering results. It can also write an entire scientific paper, explaining, visualizing and contextualizing the results.

Furthermore, while an LLM author writes entire research papers, another LLM reviewer critiques resulting manuscripts to provide feedback to improve the work, and also to select the most promising ideas to further develop in the next iteration cycle, leading to continual, open-ended discoveries, thus emulating the human scientific community. As a proof of concept, our system produced papers with novel contributions in ML research domains such language modeling, Diffusion and Grokking.

We (@_chris_lu_, @RobertTLange, @hardmaru) proudly collaborated with the @UniOfOxford (@j_foerst, @FLAIR_Ox) and @UBC (@cong_ml, @jeffclune) on this exciting project.

286

6K

1K

5K

3M

2

6

0

2

885

Not Me

@davidschlangen

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users