Tim Delille @timothydelille - Twitter Profile

Tim Delille

@TimothyDelille

about 21 hours ago

@marclou did you change the core product ?

0

81

Tim Delille

@TimothyDelille

4 days ago

agreed, living in a european city means all your needs are taken care of, no need to own anything, you take the subway to the gym, walk to the grocery store, your whole life is catered to. the downside is… your whole life is catered to. you only do things that are accessible by subway. you spend your time waiting for public transport. you go to wework. variance is extremely small, everyone does the same things. you will never meet a dude going on random side quests, riding motorbikes in the desert just because (« why would you even want to do that! ») it’s also why europeans tend to aggregate in NY / SF. im french and living in the US for 6 years btw (sf -> ny -> la)

0

1

0

176

Tim Delille

@TimothyDelille

6 days ago

@fchollet regarding 4. what data are you looking at specifically ?

0

2

0

412

Tim Delille

@TimothyDelille

6 days ago

super cringe to see myself on video but I have a better idea what to improve now

1

2

0

32

Who to follow

Julian Both

@julian_bo2

Doing business in the online space.

phoenix

@nerdcoree

no name individual sharing nothing.

Oren Aksakal

@orenaksakal

engineering manager 9-5, building micro startups 5-9

Tim Delille

@TimothyDelille

6 days ago

back in topanga this week-end (video is 3s long bc it’s from a live photo lol)

1

3

0

66

Tim Delille

@TimothyDelille

14 days ago

@jose_goncalves_ average sorbonne student

0

1

0

93

Tim Delille

@TimothyDelille

3 months ago

@ChristosTzamos @yechan_ai is it encoded in another set of weights (which only executes when the model triggers “fast decoding” ?) this correctness guarantee is really amazing. does this also mean that you can execute arbitrarily long algorithms (like multiplying very long numbers), bypassing ctx length?

1

0

45

Tim Delille

@TimothyDelille

3 months ago

@ChristosTzamos @yechan_ai is it encoded in another set of weights (which only executes when the model triggers “fast decoding” ?) this correctness guarantee is really amazing. does this also mean that you can execute arbitrarily long algorithms (like multiplying very long numbers), bypassing ctx length ?

0

1

0

37

Tim Delille

@TimothyDelille

5 months ago

@bryan_johnson whoop shows me +3% recovery impact for ag1. confounding or actual impact ?

0

1

31

Tim Delille

@TimothyDelille

5 months ago

@BetterCallMedhi c’est plus facile de faire du revenue en vendant du slop a des grands groupes / gov qui n’ont aucun standard plutôt que d’essayer de pousser la frontiere et dev le prochain palantir. j’ai l’impression que ce mispricing se corrige dans le tertiaire

0

3

0

1K

Tim Delille

@TimothyDelille

5 months ago

@nathanbenaich @culturengine @alisabets @zhoubolei @QuanquanGu @petrenko_ai @tweetingnonstop @ben_kasper @_samirism I’ll be around!

1

2

0

97

Tim Delille

@TimothyDelille

7 months ago

the flash attention paper reports achieving the first non-random performance on Path-X and Path-256. Since they’re so difficult, has any modern LLM been evaluated on these tasks ?

TimothyDelille's tweet photo. the flash attention paper reports achieving the first non-random performance on Path-X and Path-256. Since they’re so difficult, has any modern LLM been evaluated on these tasks ? https://t.co/blFr7UTJI9

0

3

1

108

Tim Delille

@TimothyDelille

8 months ago

@martin_casado great job this is incredible

1

2

0

350

Tim Delille

@TimothyDelille

9 months ago

Falcon 9 spotted in Santa Monica

0

6

1

0

408

Tim Delille

@TimothyDelille

11 months ago

@martin_casado cool stuff!

0

3

0

48

Tim Delille

@TimothyDelille

12 months ago

Whoop didn’t let me customize journal entries so I built my own been tracking supplements and workouts for 3+ years. Going to start tracking behaviors too now

0

146

Tim Delille

@TimothyDelille

12 months ago

@karpathy insightful, thank you “How it went well” will need to come from a world model / tutor (in Claude’s case: an engineer) instead of the agent itself, won’t it ?

0

39

Tim Delille

@TimothyDelille

12 months ago

https://t.co/kARy8Aiu2g

0

1

0

1

82

Tim Delille

@TimothyDelille

12 months ago

omg I still can't get this CarRacing agent to learn... Simple Deep Q Network baseline with same parameters as the Atari paper: 1M episode replay buffer size, linearly decreasing epsilon from 1 to 0.1, clipping rewards between -1 and 1 (code in the reply)

TimothyDelille's tweet photo. omg I still can't get this CarRacing agent to learn... Simple Deep Q Network baseline with same parameters as the Atari paper: 1M episode replay buffer size, linearly decreasing epsilon from 1 to 0.1, clipping rewards between -1 and 1 (code in the reply) https://t.co/oTk8ewjoxX

1

0

1

139

Tim Delille

@TimothyDelille

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users