Jules Gagnon-Marchand @julesgm4 - Twitter Profile

9 months ago

@karpathy imports in the function also.. a surprising fraction of people argue that exceptions in Python are hard to read and that user facing errors should log then exit, vs raise exceptions.

0

49

Jules Gagnon-Marchand @julesgm4

9 months ago

@lateinteraction continuous, low level empathy and projection maybe, "what is this person doing / trying to achieve, what are they feeling"

0

96

Jules Gagnon-Marchand @julesgm4

about 1 year ago

@alexalbert__ having a button to systematically trigger search without having to ask the model to use its search tools every time would be nice

0

35

julesgm4 retweeted

Stella Li ✈️ ICML🇰🇷

@StellaLisy

about 1 year ago

⚙️ Looking closer into GRPO: there is a "clipping bias" that amplifies high-prior model behaviors. Code reasoning could be one of the magical behaviors for Qwen-Math💻 Empirically, we disabled clipping (fig.)-the gains disappeared‼️

StellaLisy's tweet photo. ⚙️ Looking closer into GRPO: there is a "clipping bias" that amplifies high-prior model behaviors.

Code reasoning could be one of the magical behaviors for Qwen-Math💻

Empirically, we disabled clipping (fig.)-the gains disappeared‼️ https://t.co/yXLvHgvetR

2

98

3

10

11K

Who to follow

David Dobre

@busycalibrating

PhD in LLM robustness and alignment @Mila_Quebec. Likes mountains.

Alex Ostapenko

@ostap__alex

Mila Quebec

Nikita Saxena (she/her)

@nikitasaxena02

Vision @GoogleDeepmind | @WiMLWorkshop | ex-@Mila_Quebec

Jules Gagnon-Marchand @julesgm4

about 1 year ago

@oh_that_hat I think a leg doesn't have memory of it's own internal state, & instead at best has residual internal effects of a previous internal state (which is different & I don't think I would qualify as consciousness)

0

95

Jules Gagnon-Marchand @julesgm4

about 1 year ago

@oh_that_hat and not so much a computation of the current state of its senses + its own internal previous computational state. it's sensory memory vs memory/ understanding of a level of of its computational state.

1

0

61

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@HannesStaerk Hello Hannes, was this recorded?

0

80

julesgm4 retweeted

Nathan Lambert

@natolambert

over 1 year ago

DeepSeek makes it quite clear how they trained R1. None of these steps alone are super surprising, but how to sequence and blend them together definitely is.

natolambert's tweet photo. DeepSeek makes it quite clear how they trained R1.
None of these steps alone are super surprising, but how to sequence and blend them together definitely is. https://t.co/csScVb47QP

11

614

79

404

65K

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@natolambert I also wonder if you leave some performance on the table with RLVR because of this, where some of the learning potential on the data is wasted on trying to deal with the prompting format

0

45

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@natolambert it makes the models trained with open-instruct perform worse on HuggingFace Lighteval for example

1

0

50

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@xhluca maybe you just ask the llm reranker I guess

1

0

49

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@xhluca I was thinking about the inverse problem. how do you detect if there's nothing interesting to retrieve? that's also an important problem

2

4

1

0

200

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@xhluca (not following this sub field anymore, it's possible that it all already exists) Also Salut Xing :))

1

0

68

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@xhluca also detecting which direction in a scientific area is unexplored

1

0

53

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@arianTBD @srush_nlp Hello Arian, do you have a link to the talk?

1

3

0

71

julesgm4 retweeted

Epoch AI

@EpochAIResearch

over 1 year ago

1/10 Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%.

EpochAIResearch's tweet photo. 1/10 Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%. https://t.co/sNVEB6SvyJ

53

2K

383

845

1M

Jules Gagnon-Marchand @julesgm4

over 1 year ago

@lvwerra

0

4

0

269

Jules Gagnon-Marchand

@julesgm4

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users