Nando @nandomartinezp - Twitter Profile

over 1 year ago

New Paper: We unlock AI Evaluation with explanatory and predictive power through general ability scales! -Explains what common benchmarks really measure -Extracts explainable ability profiles of AI systems -Predicts performance for new task instances, in & out-of-distribution 🧵

lexin_zhou's tweet photo. New Paper: We unlock AI Evaluation with explanatory and predictive power through general ability scales!

-Explains what common benchmarks really measure
-Extracts explainable ability profiles of AI systems
-Predicts performance for new task instances, in & out-of-distribution
🧵

4

85

26

60

27K

NandoMartinezP retweeted

Lexin Zhou

@lexin_zhou

over 1 year ago

1/ New paper @Nature! Discrepancy between human expectations of task difficulty and LLM errors harms reliability. In 2022, Ilya Sutskever @ilyasut predicted: "perhaps over time that discrepancy will diminish" (https://t.co/HADDUztzhu, min 61-64). We show this is *not* the case!

lexin_zhou's tweet photo. 1/ New paper @Nature!

Discrepancy between human expectations of task difficulty and LLM errors harms reliability. In 2022, Ilya Sutskever @ilyasut predicted: "perhaps over time that discrepancy will diminish" (https://t.co/HADDUztzhu, min 61-64).

We show this is *not* the case! https://t.co/u2HYQbWE4j

19

1K

292

954

298K

NandoMartinezP retweeted

Wout Schellaert @WoutSchellaert

about 3 years ago

New and shiny AI systems have superseded the ones we reference (it took a while to publish), but our perspectives and suggestions for evaluating them have only become more relevant. Go have a read! 👽👽

2

5

3

0

992

Nando @NandoMartinezP

about 3 years ago

Científicos señalan los efectos catastróficos que puede causar la IA https://t.co/a7AVGbp90T via @eldiarioes

0

2

0

101

Who to follow

ETSINF UPV

@etsinfupv

ETS d'Enginyeria Informàtica UPV

Derp

@DerpMagician

Full-Stack dev specialized in Front-End, lover of js and sass, also pixelart y robot enthusiast

DanielSolo

@danielsolo

Data Science | Pasión por la Tecnología | Project Leader en Bootcamp CodiGo

NandoMartinezP retweeted

Ryan Burnell @DrRyanBurnell

about 3 years ago

Is it time to rethink how we perform system evaluations in AI? In our new @ScienceMagazine paper, we show that over-reliance on aggregate metrics and a lack of transparency in reporting threatens public understanding and hinders progress in the field. 1/8 https://t.co/kZMNCEALbG

5

179

40

61

116K

NandoMartinezP retweeted

Wout Schellaert @WoutSchellaert

about 4 years ago

📐Our Evaluation Beyond Metrics workshop at IJCAI got accepted... so prepare your cool papers! 💻https://t.co/RRRSJvt1hR With @LucyCheke, @DanajaRutar, @JohnJBurden, @DrRyanBurnell, @TomerUllman and twitterless Josh Tenenbaum, José Hernández-Orallo and Fernando Martínez-Plumed

WoutSchellaert's tweet photo. 📐Our Evaluation Beyond Metrics workshop at IJCAI got accepted... so prepare your cool papers!

💻https://t.co/RRRSJvt1hR

With @LucyCheke, @DanajaRutar, @JohnJBurden, @DrRyanBurnell, @TomerUllman and twitterless Josh Tenenbaum, José Hernández-Orallo and Fernando Martínez-Plumed https://t.co/r84tkAIHSc

2

14

7

0

Nando @NandoMartinezP

over 4 years ago

Our paper "Training on the Test Set: Mapping the System-Problem Space in AI" (https://t.co/rF76aUKOPN) is the runner up for the Blue Sky Awards in @RealAAAI 2022!

0

1

0

NandoMartinezP retweeted

Ursula von der Leyen

@vonderleyen

about 5 years ago

Artificial Intelligence is a fantastic opportunity for Europe. And citizens deserve technologies they can trust. Today we present new rules for trustworthy AI. They set high standards based on the different levels of risk.

66

752

217

19

0

NandoMartinezP retweeted

Gina Reynolds @EvaMaeRey

almost 6 years ago

🎉🎉🎉 I'm excited to introduce "a ggplot2 grammar guide". Here is part of the **visual table of contents** (viztoc). You can click through to get at-your-own-pace guidance from *flipbooks* showing code-output plot evolution! More in 🧵 1/ https://t.co/XCBwKTLfJo

15

2K

567

752

0

NandoMartinezP retweeted

Digital ECAI2020 @ECAI2020

about 6 years ago

Today we are happy to announce #DigitalECAI2020! A digital conference of the highest scientific level which will offer to the #AI community lots of possibilities to meet, debate and interact. Read our statement at: https://t.co/JeOHZ6Ejp4 Join us at https://t.co/r71h9Q3zhK!

0

54

43

1

0

NandoMartinezP retweeted

MUIinf UPV @MUIinfUPV

about 6 years ago

PREINSCRIPCIÓN @MUIinfUPV 20-21 ¡Hasta el 12 de junio! @etsinfupv @upv @UPVCampusAlcoy El MUIINF sigue con el mismo entusiasmo, empresas involucradas, alumnos extranjeros ya admitidos, y formación semi-presencial, que os aporta una gran flexibilidad https://t.co/VdoYxyxEMI

MUIinfUPV's tweet photo. PREINSCRIPCIÓN @MUIinfUPV 20-21 ¡Hasta el 12 de junio!
@etsinfupv @upv @UPVCampusAlcoy

El MUIINF sigue con el mismo entusiasmo, empresas involucradas, alumnos extranjeros ya admitidos, y formación semi-presencial, que os aporta una gran flexibilidad
https://t.co/VdoYxyxEMI https://t.co/6yFIAbf3d9

0

1

2

0

NandoMartinezP retweeted

Digital ECAI2020 @ECAI2020

over 6 years ago

🚨 UPDATE: In the light of the #COVID19 situation and having the health and safety of all the community as top priority, #ECAI2020 has been rescheduled to August 29-September 2. ➡️ Complete statement is at: https://t.co/B22P00e3b9 We look forward to seeing you next August!

6

39

34

1

0

NandoMartinezP retweeted

Centre for the Study of Existential Risk @CSERCambridge

over 6 years ago

Seeking paper submissions for the 1st Evaluating Progress in AI workshop at ECAI20. Experimental and theoretical papers on developing benchmarks, indicators, measuring progress, forecasting societal impacts of AI advances. Submission deadline March 20th! https://t.co/mFpGzART0K

0

2

3

0

NandoMartinezP retweeted

Lid.IA @lidiaconia

about 7 years ago

📡Happy to announce that our paper "Automated Data Transformation with Inductive Programming and Dynamic Background Knowledge" has been accepted at @ECMLPKDD #ECMLPKDD2019 👏😄👩‍💻 👨‍💻@NandoMartinezP @ceferra @UPV #VRAIN #VRAINUPV #DataScience

lidiaconia's tweet photo. 📡Happy to announce that our paper "Automated Data Transformation with Inductive Programming and Dynamic Background Knowledge" has been accepted at @ECMLPKDD #ECMLPKDD2019 👏😄👩‍💻

👨‍💻@NandoMartinezP @ceferra @UPV #VRAIN #VRAINUPV
#DataScience https://t.co/s2i5mvAv2o

0

15

2

0

Nando @NandoMartinezP

about 7 years ago

@dmonett @_KarenHao Sorry for the delay! I haven't connected on twitter much lately. The shinyApp was developed by Aiden (co-author), so I do not have the code. Send me an email ([email protected]) and I'll send you the data I collected (IJCAI, AAAI and AITopics).

0

1

0