Giles Thomas

Verified account

@gpjt

On sabbatical / founded @PythonAnywhere, which found a home at @anacondainc / XP / Python / PSF Fellow / opinions my own

Lisbon, Portugal

Joined June 2007

199 Following

1.4K Followers

6.3K Posts

Pinned Tweet

about 1 month ago

The appendices in "Build an LLM (from Scratch)" have a lot of really useful stuff. Should I have read them before heading off on my own training runs? I think, on balance, no -- it would have saved time, but I think that by trying, failing, and trying again, I learned more. YMMV! https://t.co/EdtLC4AQ62

0

0

1

0

184

1 day ago

@teortaxesTex It's a good thing that education science has already shown us how teaching smart students should differ from teaching the less-smart ones. Otherwise we'd be completely lost.

0

0

0

0

101

1 day ago

@WarrenFahy @agraybee It's Han Houdini!

0

1

0

0

688

1 day ago

Hear me out: a model that reasons in liminal space.

0

0

0

0

32

Who to follow

Rasu Shrestha MD MBA

Bridge builder & strategist. EVP, Chief Innovation & Commercialization Officer. In pursuit of person-centered value-based health & care. Tweets are my own.

Verified account

Creator of the @playcanvas web graphics platform. I post about WebGL, WebGPU, WebXR and 3D Gaussian splatting.

Nico du Plessis

Verified account

I build things on iOS and Rails

2 days ago

@hunvreus Convincing. I've read stuff in corporate environments where I felt my brain cells dying from sentence to sentence. If an AI had rewritten it, I would have been saved that pain. But maybe it was a useful pain, like the one that makes you pull your hand away from a flame.

0

0

0

0

17

2 days ago

@kiffboet @mitsuhiko What downside do you see in syncing it with Dropbox?

1

1

0

0

86

2 days ago

@valsaven @mitsuhiko I'm thinking maybe file timestamping differences between OSes could trip things up, but yeah, maybe it's Syncthing vs Dropbox. I am planning to switch to Syncthing, will have to keep an eye out for problems like that!

1

2

0

0

56

2 days ago

@valsaven @mitsuhiko Maybe related to OS? I'm on Linux, and phone access is from Android

1

1

0

0

47

2 days ago

@valsaven @mitsuhiko Interesting! Never been bitten by that, myself.

1

1

0

0

142

4 days ago

@heynavtoor Worth noting that LLMs are trained to minimise cross entropy loss, which is (loosely) a measure of how surprising the real next token is when compared to its belief about what it should be.

0

3

0

2

759

4 days ago

Seeing double...

gpjt's tweet photo. Seeing double... https://t.co/D1yAFemTza

0

0

0

0

37

4 days ago

@teortaxesTex There's algorithmic overhang as well. After all, we have an existence proof that you can train an AGI for maybe $100k on a constant 100W. Downside is that it takes ~20 years and sometimes (existence proof again) produces Literally Hitler.

0

2

0

0

1K

4 days ago

@DanielJHannan "Reality" works well too...

0

1

0

0

151

4 days ago

I've spent some time learning JAX over the last month, and I have Thoughts: https://t.co/Tukcvl2WT2

0

1

0

0

56

5 days ago

@xeophon Weekdays are for gradient descent?

0

0

0

0

183

6 days ago

@teortaxesTex If I were to channel Claude, I'd say "the word 'civilian' is doing a lot of heavy lifting in that sentence".

0

6

0

0

213

6 days ago

@fredsted @levelsio Thanks! I'll give that a go next time I'm staying at a hotel

0

0

0

0

22

7 days ago

@levelsio Anecdotally, it sounds like Portugal spent at least some of its EU funds well -- I've heard that the fibre infrastructure that means that you can get 10G Internet to your home was paid for that way. Impressive if true.

1

1

0

0

697

Last Seen Users on Sotwe

Trends for you

Most Popular Users