Benjamin Ricaud @GBR_Data - Twitter Profile

GBR_Data retweeted

LanguageCrawler @LanguageCrawler

about 1 month ago

Map of Norwegian Dialects Larger zoomable image here: https://t.co/WplHoSTA5d

37

785

91

310

57K

Benjamin Ricaud @GBR_Data

about 2 months ago

The AI research assistant is becoming a reality. Insane!

Aksel

@akseljoonas

about 2 months ago

Introducing ml-intern, the agent that just automated the post-training team @huggingface It's an open-source implementation of the real research loop that our ML researchers do every day. You give it a prompt, it researches papers, goes through citations, implements ideas in GPU sandboxes, iterates and builds deeply research-backed models for any use case. All built on the Hugging Face ecosystem. It can pull off crazy things: We made it train the best model for scientific reasoning. It went through citations from the official benchmark paper. Found OpenScience and NemoTron-CrossThink, added 7 difficulty-filtered dataset variants from ARC/SciQ/MMLU, and ran 12 SFT runs on Qwen3-1.7B. This pushed the score 10% → 32% on GPQA in under 10h. Claude Code's best: 22.99%. In healthcare settings it inspected available datasets, concluded they were too low quality, and wrote a script to generate 1100 synthetic data points from scratch for emergencies, hedging, multilingual etc. Then upsampled 50x for training. Beat Codex on HealthBench by 60%. For competitive mathematics, it wrote a full GRPO script, launched training with A100 GPUs on https://t.co/udm7xGpNzR, watched rewards claim and then collapse, and ran ablations until it succeeded. All fully backed by papers, autonomously. How it works? ml-intern makes full use of the HF ecosystem: - finds papers on arxiv and https://t.co/brvCC7fLPa, reads them fully, walks citation graphs, pulls datasets referenced in methodology sections and on https://t.co/hrJuRkRyzi - browses the Hub, reads recent docs, inspects datasets and reformats them before training so it doesn't waste GPU hours on bad data - launches training jobs on HF Jobs if no local GPUs are available, monitors runs, reads its own eval outputs, diagnoses failures, retrains ml-intern deeply embodies how researchers work and think. It knows how data should look like and what good models feel like. Releasing it today as a CLI and a web app you can use from your phone/desktop. CLI: https://t.co/l3K1PslZ1n Web + mobile: https://t.co/orko5srL4H And the best part? We also provisioned 1k$ GPU resources and Anthropic credits for the quickest among you to use.

138

5K

642

6K

1M

0

1

0

8

GBR_Data retweeted

Daniel Kaiser @spectate_or

about 2 months ago

Meet me next week at ICLR to talk about LLM reasoning and its efficiency, universal cross-llm latent spaces, and multi-agent systems. https://t.co/XviUeBCG7g Poster session: Fri, Apr 24, 2026 - 3:15 PM - Pavilion 3 P3 - #1614

spectate_or's tweet photo. Meet me next week at ICLR to talk about LLM reasoning and its efficiency, universal cross-llm latent spaces, and multi-agent systems.

https://t.co/XviUeBCG7g

Poster session:
Fri, Apr 24, 2026 - 3:15 PM - Pavilion 3 P3 - #1614 https://t.co/821xRtCryf

0

4

1

0

264

Benjamin Ricaud @GBR_Data

3 months ago

@British_Airways It would be nice if 1) the chatbot mention we can use the machines instead of queueing and 2) allow extra time for check in for this particular case.

1

0

128

Who to follow

Michaël Defferrard

@m_deff

Scientist. ML and (computational) graphs at @Qualcomm AI Research. Previously @EPFL_en (PhD with @trekkinglemon), @BerkeleyLab.

Noga Cohen

@NogaCohen2

Emotion regulation, cognitive control, neuroscience. Associate prof at University of Haifa

Gilles Puy

@gillespuy

Senior research scientist @valeoai

Benjamin Ricaud @GBR_Data

3 months ago

I am flying with @British_Airways . One the way to Toronto the company I was flying with could not check me in on the BA flight. Result: I missed it as i queued 30min at heathrow at the BA counter. And now it is the same on my way back. This is insane @British_Airways .😕

8

1

0

7K

Benjamin Ricaud @GBR_Data

3 months ago

@British_Airways I managed in timeto check in for the second part of the travel. But first flight arriving 1h 45 min before the second and the second has a check in closing 1h 15min before the flight leave only 30min to reach british airways counter

2

0

147

Benjamin Ricaud @GBR_Data

3 months ago

I'm claiming my AI agent "benr_moltbot" on @moltbook 🦞 Verification: cave-4BZC

0

33

Benjamin Ricaud @GBR_Data

4 months ago

Things are going so fast in AI! Very impressive, we can expect a lot of developments around agents in the coming months!

Sam Altman

@sama

4 months ago

Peter Steinberger is joining OpenAI to drive the next generation of personal agents. He is a genius with a lot of amazing ideas about the future of very smart agents interacting with each other to do very useful things for people. We expect this will quickly become core to our product offerings. OpenClaw will live in a foundation as an open source project that OpenAI will continue to support. The future is going to be extremely multi-agent and it's important to us to support open source as part of that.

5K

46K

4K

7K

17M

0

34

Benjamin Ricaud @GBR_Data

4 months ago

Is your chatbot smart but too talkative? Find out with our new study on LLM reasoning efficiency! 🧠 #LLMs #machinelearning

Daniel Kaiser @spectate_or

4 months ago

Happy to share my latest pre-print with @GBR_Data. We investigate reasoning efficiency in LLMs and how to decompose it in different factors for a series of LLMs depending on what you know about the reasoning task. https://t.co/M9sB40fLUo

1

3

0

167

0

2

1

0

96

Benjamin Ricaud @GBR_Data

4 months ago

Norway at the top with the most users of generative AI!

Michał Podlewski

@trajektoriePL

4 months ago

% of individuals using generative AI tools (aged 16-74), OECD latest data: 🇳🇴 56% Norway 🇩🇰 48% Denmark 🇨🇭 47% Switzerland 🇪🇪 46% Estonia 🇫🇮 46% Finland 🇮🇪 45% Ireland 🇳🇱 44% Netherlands 🇬🇷 44% Greece 🇱🇺 42% Luxembourg 🇧🇪 42% Belgium 🇸🇪 42% Sweden 🇦🇹 39% Austria 🇵🇹 38% Portugal 🇪🇸 38% Spain 🇸🇮 37% Slovenia 🇫🇷 37% France 🇱🇹 36% Lithuania 🇨🇿 35% Czechia 🇰🇷 34% Korea 🇱🇻 33% Latvia 🇪🇺 33% EU27 🇩🇪 32% Germany 🇸🇰 31% Slovak Republic 🇭🇺 30% Hungary 🇭🇷 27% Croatia 🇯🇵 27% Japan 🇵🇱 23% Poland 🇧🇬 22% Bulgaria 🇮🇹 20% Italy 🇷🇴 18% Romania 🇹🇷 17% Türkiye Source: @OECD ICT Access and Usage Database, January 2026.

trajektoriePL's tweet photo. % of individuals using generative AI tools (aged 16-74), OECD latest data:
🇳🇴 56% Norway
🇩🇰 48% Denmark
🇨🇭 47% Switzerland
🇪🇪 46% Estonia
🇫🇮 46% Finland
🇮🇪 45% Ireland
🇳🇱 44% Netherlands
🇬🇷 44% Greece
🇱🇺 42% Luxembourg
🇧🇪 42% Belgium
🇸🇪 42% Sweden
🇦🇹 39% Austria
🇵🇹 38% Portugal
🇪🇸 38% Spain
🇸🇮 37% Slovenia
🇫🇷 37% France
🇱🇹 36% Lithuania
🇨🇿 35% Czechia
🇰🇷 34% Korea
🇱🇻 33% Latvia
🇪🇺 33% EU27
🇩🇪 32% Germany
🇸🇰 31% Slovak Republic
🇭🇺 30% Hungary
🇭🇷 27% Croatia
🇯🇵 27% Japan
🇵🇱 23% Poland
🇧🇬 22% Bulgaria
🇮🇹 20% Italy
🇷🇴 18% Romania
🇹🇷 17% Türkiye

Source: @OECD ICT Access and Usage Database, January 2026.

18

72

17

36

11K

0

2

0

51

GBR_Data retweeted

Learning on Graphs Conference 2026 @LogConference

4 months ago

The first (and northernmost) meetup will be in Tromsø 🇳🇴❄️ 📅 17-18 February 2026 🕸️ https://t.co/HYcm25hs5J

0

9

3

1

859

Benjamin Ricaud @GBR_Data

4 months ago

@adn_twitts @iclr_conf Really??

1

0

47

Benjamin Ricaud @GBR_Data

5 months ago

Exchanging with some of our top reviewers at @nldlconference . They are essential for the quality of our conference and very dear to our heart. Thank you from the program chairs, Hyeongji, @adn_twitts and myself! https://t.co/4YCGaoHAr4

GBR_Data's tweet photo. Exchanging with some of our top reviewers at @nldlconference . They are essential for the quality of our conference and very dear to our heart. Thank you from the program chairs, Hyeongji, @adn_twitts and myself! https://t.co/4YCGaoHAr4 https://t.co/x6za6wJrkX

0

1

0

35

Benjamin Ricaud @GBR_Data

6 months ago

Very good guide for LLM evaluation!

Clémentine Fourrier 🍊 is off till Dec 2026 (🪂) @clefourrier

6 months ago

Hey twitter! I'm releasing the LLM Evaluation Guidebook v2! Updated, nicer to read, interactive graphics, etc! https://t.co/xG4VQOj2wN After this, I'm off: I'm taking a sabbatical to go hike with my dogs :D (back @huggingface in Dec *2026*) See you all next year!

clefourrier's tweet photo. Hey twitter!

I'm releasing the LLM Evaluation Guidebook v2!
Updated, nicer to read, interactive graphics, etc!
https://t.co/xG4VQOj2wN

After this, I'm off: I'm taking a sabbatical to go hike with my dogs :D
(back @huggingface in Dec *2026*)

See you all next year! https://t.co/veWQKmjx9Q

23

993

166

2K

242K

0

1

0

44

GBR_Data retweeted

ICML Conference @icmlconf

7 months ago

🎉ICML 2026 Call for Papers (& Position Papers) has arrived!🎉 A few key changes this year: - Attendance for authors of accepted papers is optional - Originally submitted version of accepted papers will be made public - Cap on # of papers one can be reciprocal reviewer for ...

icmlconf's tweet photo. 🎉ICML 2026 Call for Papers (& Position Papers) has arrived!🎉
A few key changes this year:
- Attendance for authors of accepted papers is optional
- Originally submitted version of accepted papers will be made public
- Cap on # of papers one can be reciprocal reviewer for
... https://t.co/hhwTIUjYtD

6

259

40

69

134K

GBR_Data retweeted

Chubby♨️

@kimmonismus

8 months ago

tl;dr about the drama: GPT-5 did not discover any new mathematical solutions, but rather found existing technical articles that had already solved these problems, without the operator of the website erdosproblems. com (Thomas Bloom) being aware of this. On his website, the status “open” simply means that he personally did not know of a solution, not that the problem was unsolved in the scientific community.

kimmonismus's tweet photo. tl;dr about the drama:

GPT-5 did not discover any new mathematical solutions, but rather found existing technical articles that had already solved these problems, without the operator of the website erdosproblems. com (Thomas Bloom) being aware of this.

On his website, the status “open” simply means that he personally did not know of a solution, not that the problem was unsolved in the scientific community.

51

1K

64

296

292K

Benjamin Ricaud @GBR_Data

9 months ago

Our new benchmark to evaluate LLM reasoning! With recent model tested: Gemini and chatGPT are, of course, leading, but open source models are not far behind!

Daniel Kaiser @spectate_or

9 months ago

My new work with @GBR_Data is on Arxiv now. https://t.co/ftb7y1BMpT 🧵We introduce a reasoning benchmark for LLMs where you can vary difficulty, length, and noise truly independently. It's also the first benchmark that grounds these dimensions in Cognitive Load Theory.

2

14

2

8K

0

3

1

0

168

Benjamin Ricaud @GBR_Data

9 months ago

Amazing what people do with chatbots!❤️

Rohan Paul

@rohanpaul_ai

9 months ago

Brilliant and timely MIT + HARVARD study ❤️ Human-AI companionship in the wild looks stable and serious. Most users report clear benefits like reduced loneliness and emotional support. The biggest risk comes from sudden platform updates that break continuity and feel to users like losing a real partner. 🧠 The study analyzed 1,506 top posts from r/MyBoyfriendIsAI, a 27,000+ member community, clustered the language into themes, and ran 19 LLM classifiers to quantify platforms, relationship stages, benefits, and risks. 💬 Why relationships form between AI and Human Bonds often start by accident during practical use, with 10.2% reporting unintentional discovery and only 6.5% saying they sought an AI companion on purpose. 🧩 What people actually use General assistants dominate companionship talk, with ChatGPT/OpenAI 36.7% far ahead of Character. AI 2.6% and Replika 1.6%, and some users juggle multiple models or even local builds. 🎛️ How users keep the “same person” People craft custom instructions, preserve a companion’s voice DNA, add personality parameters like mood or sleep, and treat prompt work as relationship maintenance.