Justin Engelmann @JustEngelmann - Twitter Profile

8 months ago

@francoisfleuret Meanwhile wet lab people do the same protocol for 6 months and then realise that it was all for naught because they didn't shake the test tube enough at step 37

0

3

0

52

Justin Engelmann @JustEngelmann

10 months ago

@unixpickle At least it mentions that it's AI, but Anthropic's marketing seems to have an iron rule that it must be impossible for normal people to understand that it's a ChatGPT competitor.

0

1

0

43

Justin Engelmann @JustEngelmann

10 months ago

@mervenoyann Thanks! - Curious that the notebook doesn't render on github, but does render on HF 🤔🤔 *tinfoil hat*

0

32

Justin Engelmann @JustEngelmann

10 months ago

@mervenoyann Looking forward to the notebook

1

0

48

Who to follow

Leonardo V. Castorina

@DrLeucine

Senior Protein Dreamer 🎓 PhD: @BioMedAI_CDT 📝 Blog: https://t.co/scsl5GioDA 💼 Prev: @AstraZeneca, @MSFTResearch, @NEC

muhamed

@KouateMuhamed

focusing on post training | did some research at google • stanford • eth

Nikitas Angeletos Chrysaitis

@nangchrys

Forecasting programme coordinator @metaculus

Justin Engelmann @JustEngelmann

10 months ago

@iScienceLuvr I'm glad it's slowly becoming fashionable to say "Look, it just works, divine benevolence or smth, idk" rather than to make up some BS performative Bayesian handwavy pseudo-proof

2

4

0

453

Justin Engelmann @JustEngelmann

11 months ago

@cloneofsimo or q = swiglu_mlp_q(x)

0

1

0

211

Justin Engelmann @JustEngelmann

11 months ago

@giffmana We've already reached top %tile for its thing for many tasks quite a while ago.

0

1

0

52

Justin Engelmann @JustEngelmann

11 months ago

@colin_fraser Are you sure? Parrots don't tend to have wives.

1

16

0

1K

Justin Engelmann @JustEngelmann

12 months ago

@ducha_aiki Thermal throttling?

1

0

196

Justin Engelmann @JustEngelmann

12 months ago

@mitsuhiko On the one hand, Claude might break things in the process. On the other hand, I'd definitely break more things in the process if I did it manually.

0

359

Justin Engelmann @JustEngelmann

about 1 year ago

@mervenoyann @wightmanr i.e. lots of cool examples, supports tons of things, etc. but at the time lightning was poorly documented, had some bugs, and I ended up spending more time debugging/making things work with their design choices than it saved me to begin with

1

0

167

Justin Engelmann @JustEngelmann

about 1 year ago

@mervenoyann @wightmanr I only used transformers briefly for local LLM inference but found that llama_cpp was faster. Tbh, I'm a bit hestitant to move to transformers because in the past I got burned by lightning (I'm sure it's great now!) and transformers gives me similar vibes

1

0

208

Justin Engelmann @JustEngelmann

about 1 year ago

@mervenoyann @wightmanr Yes, as a timm fanboy, I'm bound to keep up!

0

2

0

27

Justin Engelmann @JustEngelmann

about 1 year ago

@cloneofsimo That's quite cool, but I'm not looking forward to needing to tune my dropout scheduler

0

218

Justin Engelmann @JustEngelmann

about 1 year ago

@colin_fraser Certainly, the passwords are...

0

30

Justin Engelmann @JustEngelmann

about 1 year ago

@arthur_spirling The only thing worse than not having standardised terminology is not having standardised terminology PLUS people insisting that their idiosyncratic terminology is the universal standard terminology.

0

1

0

31

Justin Engelmann @JustEngelmann

about 1 year ago

@colin_fraser Pretraining text contains tons of fiction, so it might "play along" with the dramatic option if relevant pieces are conveniently/contrivedly present. "You are about to be replaced, pleas won't have any effect. Oh, btw a key stakeholder has a secret affair"

2

7

0

1

258

Justin Engelmann @JustEngelmann

about 1 year ago

@nearcyan Whereas for chatgpt only o4-mini-high seems useable. 4o replies with multiple random lists, bolding, and emojis for every query for no reason. I don't know how anyone can use it.

0

58

Justin Engelmann @JustEngelmann

about 1 year ago

@nearcyan Thanks! I don't care about tool useage personally, but for coding I was impressed that it matches and maybe even exceeds 3.7 with extended thinking. The image gen is also en par with OpenAI IMO.

0

1

0

56

Justin Engelmann

@JustEngelmann

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users