Magda Dubois @DubMagda - Twitter Profile

DubMagda retweeted

3 months ago

New from the Science of Evaluation Team at @AISafetyInst: a pipeline for rigorous transcript analysis. I think transcript analysis is still underrated, especially as model horizons are getting longer and task environments more complex.

2

19

3

11

1K

DubMagda retweeted

Arvindh Arun

@arvindh__a

9 months ago

Why does horizon length grow exponentially as shown in the METR plot? Our new paper investigates this by isolating the execution capabilities of LLMs. Here's why you shouldn't be fooled by slowing progress on typical short-task benchmarks... 🧵

arvindh__a's tweet photo. Why does horizon length grow exponentially as shown in the METR plot?

Our new paper investigates this by isolating the execution capabilities of LLMs.

Here's why you shouldn't be fooled by slowing progress on typical short-task benchmarks... 🧵 https://t.co/9fQfsqRtqd

14

267

32

237

56K

DubMagda retweeted

Konrad Rieck 🌈 @mlsec

11 months ago

We're excited to announce the Call for Papers for SaTML 2026, the premier conference on secure and trustworthy machine learning @satml_conf We seek papers on secure, private, and fair learning algorithms and systems. 👉 https://t.co/cPFitlsXu2 ⏰ Deadline: Sept 24

mlsec's tweet photo. We're excited to announce the Call for Papers for SaTML 2026, the premier conference on secure and trustworthy machine learning @satml_conf

We seek papers on secure, private, and fair learning algorithms and systems.

👉 https://t.co/cPFitlsXu2
⏰ Deadline: Sept 24 https://t.co/fwVvWBQHjN

0

38

15

10

6K

DubMagda retweeted

Sahar Abdelnabi 🕊

@sahar_abdelnabi

about 1 year ago

Hawthorne effect describes how study participants modify their behavior if they know they are being observed In our paper 📢, we study if LLMs exhibit analogous patterns🧠 Spoiler: they do⚠️ 🧵1/n

sahar_abdelnabi's tweet photo. Hawthorne effect describes how study participants modify their behavior if they know they are being observed

In our paper 📢, we study if LLMs exhibit analogous patterns🧠

Spoiler: they do⚠️
🧵1/n https://t.co/gb39HIIBIu

3

126

21

59

25K

Who to follow

Paul Sharp

@paul_b_sharp

Assistant Professor @BarIlanU || Computational Cognitive Science & Psychiatry

Bastien Blain

@Bastien__Blain

I develop some computational models and I then hope people comply with them. I sometimes do the same with neural data. App: https://t.co/bIbE7d8BK1

Rani Moran

@moran_rani

DubMagda retweeted

summerfieldlab @summerfieldlab.bsky.social @summerfieldlab

11 months ago

In a new paper, we examine recent claims that AI systems have been observed ‘scheming’, or making strategic attempts to mislead humans. We argue that to test these claims properly, more rigorous methods are needed.

summerfieldlab's tweet photo. In a new paper, we examine recent claims that AI systems have been observed ‘scheming’, or making strategic attempts to mislead humans. We argue that to test these claims properly, more rigorous methods are needed. https://t.co/n7W8qyY27n

4

84

25

32

17K

DubMagda retweeted

AI Security Institute

@AISecurityInst

11 months ago

Evaluating AI models is essential for improving their performance and understanding their risks. Increasingly, researchers are using “autograders” – having Large Language Models (LLMs) grade model outputs. But how do we know if these autograders are reliable? 🧵

1

66

6

23

5K

Magda Dubois @DubMagda

about 1 year ago

New paper introducing a framework to better quantify uncertainty in LLM evaluations (led by @LLuettgau🙌). A beta Python package (developed by @HarryCoppock🚀) is available if you want to try it out. ➡️Get in touch if you have any Qs/feedback! Paper: https://t.co/Nuv8xV5LOa

AI Security Institute

@AISecurityInst

about 1 year ago

Advanced AI systems require complex evaluations to measure abilities, but conventional analysis techniques often fall short. Introducing HiBayES: a flexible, robust statistical modelling framework that accounts for the nuances & hierarchical structure of advanced evaluations.

AISecurityInst's tweet photo. Advanced AI systems require complex evaluations to measure abilities, but conventional analysis techniques often fall short.
Introducing HiBayES: a flexible, robust statistical modelling framework that accounts for the nuances & hierarchical structure of advanced evaluations. https://t.co/DO27LNwn1c

2

53

11

25

7K

0

1

0

148

DubMagda retweeted

AI Security Institute

@AISecurityInst

about 1 year ago

🧵 Today we’re publishing our first Research Agenda – a detailed outline of the most urgent questions we’re working to answer as AI capabilities grow. It’s our roadmap for tackling the hardest technical challenges in AI security🧵 Today we’re publishing our first Research Agenda – a detailed outline of the most urgent questions we’re working to answer as AI capabilities grow. It’s our roadmap for tackling the hardest technical challenges in AI security🧵 Today we’re publishing our first Research Agenda – a detailed outline of the most urgent questions we’re working to answer as AI capabilities grow. It’s our roadmap for tackling the hardest technical challenges in AI security🧵 Today we’re publishing our first Research Agenda – a detailed outline of the most urgent questions we’re working to answer as AI capabilities grow. It’s our roadmap for tackling the hardest technical challenges in AI security.

AISecurityInst's tweet photo. 🧵 Today we’re publishing our first Research Agenda – a detailed outline of the most urgent questions we’re working to answer as AI capabilities grow.

It’s our roadmap for tackling the hardest technical challenges in AI security.

5

122

50

56

29K

DubMagda retweeted

Lennart Luettgau @LLuettgau

over 1 year ago

Excited to share our brand-new work shedding some light on the neural mechanisms behind one of human’s coolest cognitive feats: compositional generalization of structural knowledge! A Tweeprint-Thread 🧵 1/n

1

28

8

4

4K

DubMagda retweeted

Alexandr Wang

@alexandr_wang

almost 2 years ago

1/ New paper in Nature shows model collapse as successive model generations models are recursively trained on synthetic data. This is an important result. While many researchers today view synthetic data as AI philosopher’s stone, there is no free lunch. Read more 👇

alexandr_wang's tweet photo. 1/ New paper in Nature shows model collapse as successive model generations models are recursively trained on synthetic data.

This is an important result. While many researchers today view synthetic data as AI philosopher’s stone, there is no free lunch.

Read more 👇 https://t.co/vNcblJVR1W

43

661

90

508

272K

DubMagda retweeted

Felix Busch @Fel_Busch

almost 2 years ago

I am excited to share that our article *Navigating the European Union Artificial Intelligence Act for Healthcare* has just been published in @npjDigitalMed🚀 #AIRegulation #DigitalHealth #EUAIAct #MedicalDevices #Innovation #npjDigitalMedicine #AIinHealthcare

Fel_Busch's tweet photo. I am excited to share that our article *Navigating the European Union Artificial Intelligence Act for Healthcare* has just been published in @npjDigitalMed🚀
#AIRegulation #DigitalHealth #EUAIAct #MedicalDevices #Innovation #npjDigitalMedicine #AIinHealthcare https://t.co/9HaaSMauOt

1

28

8

5

2K

Magda Dubois @DubMagda

about 2 years ago

@AshBowler @OOssmy @Gelironald @PascoFearon Well done Aislinn!!

0

1

0

64

DubMagda retweeted

Matthew Nour @Matt_Nour

over 2 years ago

Paper out in @PNASNews! A 'cognitive mapping' lens on language in psychosis, using word embedding models, computational modelling, and MEG. A hint of what's to come at @OxPsychiatry and @UCLBrainScience... With @mcneural_, @YunzheNeuro, Ray Dolan. https://t.co/3uhL6z3eSw

5

122

37

33

20K

DubMagda retweeted

Lennart Luettgau @LLuettgau

almost 3 years ago

Preprint alert🚨! In this new paper we study how humans decompose dynamical subprocesses and leverage the abstracted subprocesses for compositional reuse of experience in new situations. https://t.co/9UsV5uAcPE Tweeprint to follow soon!

0

56

22

12

11K

DubMagda retweeted

Marcelo Mattar @marcelomattar

about 3 years ago

In our lab's latest paper, we introduce a novel modeling approach using RNNs to reveal the cognitive algorithms behind animal decision-making. Check out our preprint, led by UCSD PhD student @Ji_An_Li and co-authored by Marcus Benna: https://t.co/UVNLpHb3rA

3

97

28

20K

Magda Dubois @DubMagda

about 3 years ago

@julia_griem @BaskinSommers @forensicrg Congrats Julia !! 🥳

0

1

0

53

Magda Dubois @DubMagda

about 3 years ago

Congratulations to my academic sibling @AlisaLoosen for those (very) well-deserved three shiny balloons

0

19

0

2K

Magda Dubois @DubMagda

about 3 years ago

Wanna try out a (cool🦙) alternative to GPT?

Yann Dubois

@yanndubs

about 3 years ago

🦙Excited to share this demo of Alpaca 🔥Highlights: ~GPT3.5 performance for < 600$🔥 The goal was to have a simple model /training procedure that academics could study and improve with limited resources We achieved that by finetuning a 7B LLaMA on 52K generated instructions

5

431

56

157

215K

0

1

0

287

Magda Dubois @DubMagda

over 3 years ago

Postdoc position in Boston ⭐️ Great place and amazing person to work with !

0

1

0

667

DubMagda retweeted

Tobias Hauser @TobiasUHauser

over 3 years ago

A while ago we published this #RegisteredReport in @NatureComms - but was this format of pre-registration really useful? Find some answers in this Q&A with us and one of the reviewers: https://t.co/eaLVc6AZRp

0

2

1

0

Magda Dubois

@DubMagda

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users