Egor Bogomolov @egor_bb - Twitter Profile

egor_bb retweeted

8 days ago

Today we're releasing Mellum2: our first "serious" LLM. This is a 12B A2.5B MoE LLM pre-trained on ~11T tokens and post-trained with RLVR. I'm proud to be leading the team that was working on it for the last 6 months. We release base/SFT/RL checkpoints along with a tech report

nv_pavlichenko's tweet photo. Today we're releasing Mellum2: our first "serious" LLM.

This is a 12B A2.5B MoE LLM pre-trained on ~11T tokens and post-trained with RLVR.
I'm proud to be leading the team that was working on it for the last 6 months.

We release base/SFT/RL checkpoints along with a tech report

55

863

95

472

78K

egor_bb retweeted

JetBrains @jetbrains

about 2 months ago

RL training and coding-agent experiments not scaling locally? IdeGYM fixes that – and it's now open source. https://t.co/4hrxkNH2lp

jetbrains's tweet photo. RL training and coding-agent experiments not scaling locally? IdeGYM fixes that – and it's now open source. https://t.co/4hrxkNH2lp https://t.co/aHzmfvoQeg

0

47

7

13

7K

Egor Bogomolov @egor_bb

7 months ago

A pleasure to see our work posted by AK 😊 📉Issue: GRPO hurts generation diversity and pass@k/max@k do not grow as well as pass@1 📈Solution: we estimate gradients for both on- and off-policy cases to optimize max@k directly, and it shows better yields in code generation tasks (measured as the ratio of passed tests)

AK

@_akhaliq

7 months ago

The Best of N Worlds Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

1

67

7

34

24K

0

141

Egor Bogomolov @egor_bb

9 months ago

Interesting @NeurIPSConf experience: a weird mix of likely the most positive review I’ve received and an overriding PC reject on top of it 🤔

egor_bb's tweet photo. Interesting @NeurIPSConf experience: a weird mix of likely the most positive review I’ve received and an overriding PC reject on top of it 🤔 https://t.co/riltjppIku

0

1

0

187

Who to follow

Timofey Bryksin

@timofeybryksin

Head @JetBrains Research

Konstantin Grotov

@kgrotov

Researcher at AI Agents & Planning Team @JetBrains Research

Anya Bataeva

@_fyzbt

data scientist @ fintech / here for the political commentary and the scientific papers

egor_bb retweeted

IDE Workshop @IDEworkshop

9 months ago

The 3rd IDE Workshop @ICSEconf 2026 is scheduled for Saturday, April 18th! Please submit your short papers and extended abstracts on anything IDE-related: plugins, studies, refactorings, environments, AI in IDE, etc.! All information here: https://t.co/PNeeFLI8Sz 🏝️🏝️🏝️

IDEworkshop's tweet photo. The 3rd IDE Workshop @ICSEconf 2026 is scheduled for Saturday, April 18th!

Please submit your short papers and extended abstracts on anything IDE-related: plugins, studies, refactorings, environments, AI in IDE, etc.!

All information here: https://t.co/PNeeFLI8Sz

🏝️🏝️🏝️ https://t.co/1X5wCLC7PH

1

3

1

0

559

egor_bb retweeted

andrew zakonov

@andrewzakonov

about 1 year ago

JetBrains Junie is live — for everyone! single AI subscription. pro tier included with all products pack and dotultimate.

6

124

23

15

13K

Egor Bogomolov @egor_bb

over 1 year ago

@InceptionAILabs Is there an API to access the model? I would be happy to run it on some coding benchmarks, but I have not find any points to the API yet.

0

43

Egor Bogomolov @egor_bb

almost 2 years ago

@john_lam @headinthebox Hi, author here. For code generation task we give model an instruction in natural language and access to the library. The reference code is only used for evaluation to compute metrics

0

1

0

73

egor_bb retweeted

Timofey Bryksin @timofeybryksin

almost 2 years ago

Interested in impacting millions of developers with your ML research? We at JetBrains Research are hiring! https://t.co/SnTUOjPTRK

0

10

6

3

1K

Egor Bogomolov @egor_bb

almost 2 years ago

🤝 Great thanks to @saridormi, Timur Galimzyanov, Evgeniy Glukhov, Anton Shapkin, @tigina_maria, @areyde, Alexander Kovrigin, @avandeursen, @MalihehIzadi, and @timofeybryksin for a fantastic work! Please submit to our leaderboard, and we will see you in Long Code Arena! 🏟️

egor_bb's tweet photo. 🤝 Great thanks to @saridormi, Timur Galimzyanov, Evgeniy Glukhov, Anton Shapkin, @tigina_maria, @areyde, Alexander Kovrigin, @avandeursen, @MalihehIzadi, and @timofeybryksin for a fantastic work!

Please submit to our leaderboard, and we will see you in Long Code Arena! 🏟️ https://t.co/WbEKO39WJE

1

7

0

1

469

Egor Bogomolov @egor_bb

almost 2 years ago

Models' contexts are getting so big that they can (and often should!) include an entire repository, while we are still evaluating them on methods and files. That's why we created Long Code Arena. Pre-print: https://t.co/q1HTcvnjwe Datasets: https://t.co/UszSNix6J1 Details in🧵!

egor_bb's tweet photo. Models' contexts are getting so big that they can (and often should!) include an entire repository, while we are still evaluating them on methods and files. That's why we created Long Code Arena.

Pre-print: https://t.co/q1HTcvnjwe
Datasets: https://t.co/UszSNix6J1

Details in🧵! https://t.co/tUBQk0KEQY

4

88

26

47

23K

Egor Bogomolov @egor_bb

almost 2 years ago

🗂️ Module summarization — based on the module’s or project’s source code and a short description of the desired documentation, the model should generate it, testing its abilities in large comprehensive natural language texts. This benchmark includes custom LLM-based evaluation.

egor_bb's tweet photo. 🗂️ Module summarization — based on the module’s or project’s source code and a short description of the desired documentation, the model should generate it, testing its abilities in large comprehensive natural language texts. This benchmark includes custom LLM-based evaluation. https://t.co/U73R4bsXzx

1

4

0

433

Egor Bogomolov

@egor_bb

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users