Jonas @LoosJonas - Twitter Profile

@IterIntellectus Some aspects are quite uncontroversial & universal although often fuzzy, e.g. "lying is usually bad", "being helpful tends to be good", "don't eradicate humanity", etc. - so I think that at least some progress is possible

0

2

0

70

Jonas

@LoosJonas

3 days ago

@VictorTaelin Maybe use AI for the first 95% just to see if sth is possible and then reimplement from scratch?

0

1

0

526

Jonas

@LoosJonas

18 days ago

I was thinking a bit about a new lock design that prevents picking by locking the pins in place before validation

0

2

0

30

Jonas

@LoosJonas

25 days ago

@che_shr_cat Interesting method! Just wondering - are there also cases where it's strictly worse than standard AdamW? Or is it always at least similarly effective?

0

1

0

373

Jonas

@LoosJonas

27 days ago

@ebarenholtz If the changes the layers introduce in the residual stream meaningfully correspond to geometric operations in the manifolds visualized in these fancy animations - then "they think in shapes" doesn't really sound like the worst description, or?

0

85

Jonas

@LoosJonas

27 days ago

@mariusmosbach @johnhewtt (semi-implicitly, their KL penalty apparently does smth similar - https://t.co/UoCQhvbCV0)

0

34

Jonas

@LoosJonas

27 days ago

@mariusmosbach @johnhewtt My first idea was to add this as an extra regularization loss, so I was positively surprised seeing that it apparently already learns this implicitly

1

0

30

Jonas

@LoosJonas

about 1 month ago

@tszzl Having them on a separate machine with remote control is such a blessing, except for all the anthropic outages

0

239

Jonas

@LoosJonas

about 1 month ago

@giffmana Emergent Goblification

0

1

0

143

Jonas

@LoosJonas

about 1 month ago

@bojie_li Apparently only ~1% improvement could be achieved by increasing question catalogue size, so it seems already good. Nice that the data is public!

LoosJonas's tweet photo. @bojie_li Apparently only ~1% improvement could be achieved by increasing question catalogue size, so it seems already good.

Nice that the data is public! https://t.co/iFOdEExjzf

0

1

0

39

Jonas

@LoosJonas

about 1 month ago

@bojie_li Awesome work! Did you check how size prediction accuracy scales with benchmark size? Would it make sense to double the number of questions to get better predictions, or is it saturated already?

1

2

0

3K

Jonas

@LoosJonas

about 1 month ago

@sytelus > Con: we will never know what models are thinking. I guess it would be relatively easy to build a translator. Will not be perfect, but even todays CoTs aren't necessarily 100% faithful.

0

3

0

37

Jonas

@LoosJonas

about 1 month ago

@kalomaze could it be that it's pretrained with all data in chat format and only this template?

0

243

Jonas

@LoosJonas

about 1 month ago

@VictorTaelin Nice! Would probably be helpful if it included more hard problems, because if already current models score ~90%, distingushing future models seems noisy

0

1

0

538

Jonas

@LoosJonas

about 1 month ago

@marvinsxtr @gabrieldernbach @TUBerlin @bifoldberlin @AIgnostics @ChariteBerlin nice project page and awesome to see code/model/data being public!

0

20

Jonas

@LoosJonas

Last Seen Users on Sotwe

Trends for you

Most Popular Users