alyxya @_alyxya - Twitter Profile

alyxya

@_alyxya

about 23 hours ago

@_arohan_ @lulumeservey found the human, claude uses the contraction

0

37

alyxya

@_alyxya

2 days ago

@_arohan_ @kellerjordan0 can you invite me to the discord channel?

0

1

0

1K

alyxya

@_alyxya

2 days ago

@river_ai_inc > River runs LoRA-based fine-tuning and reinforcement learning on a variety of open-source models — from small 35B to large 1T parameters. pessimistic about this approach because it's too large and expensive for ordinary consumers and creating custom hardware also sounds bad

0

1

1K

alyxya

@_alyxya

3 days ago

@francoisfleuret I think of this as the point of inflection where it'll slow down because we'll stop being able to differentiate or measure most improvements to models bigger models should in theory have more potential capability but the difficulty of the training data is bounded by us

0

4

0

2

876

alyxya

@_alyxya

3 days ago

@_arohan_ about the original question, I'm going to guess the answer is yes based off of my intuition of what the current models are good at and capable of, though it wouldn't be easy to extract and verify its claims and is subject to a good prompt and harness

0

30

alyxya

@_alyxya

3 days ago

@_arohan_ at least I'm confident the current models cannot come up with some of my ideas because they're too far out of distribution, meanwhile shampoo is well within distribution given how long it has been around to be a part of training data

1

0

1

338

alyxya

@_alyxya

3 days ago

@rajivmovva or you're just getting opus?

0

531

alyxya

@_alyxya

3 days ago

@gum1h0x AI can help with coming up with simpler solutions and more intuitive explanations, so more of that may exist in the future deep technical knowledge should be easy to verify and understand in full with the right framing, so gaps and complexity just indicate incompleteness

0

1

0

109

alyxya

@_alyxya

4 days ago

@khoomeik newton=rohan leibniz=keller

0

3

0

570

alyxya

@_alyxya

4 days ago

@typedfemale agi is going to be neither

0

265

alyxya

@_alyxya

4 days ago

@_arohan_ I was expecting this lol, couldn't let him get away with showing shampoo worse than muon

0

1K

alyxya

@_alyxya

4 days ago

@_arohan_ adamw had the wrong theory being element wise, muon is better especially with weight decay and other tricks, but I don't think an optimizer improvement alone fixes fundamental limitations in ml theory

0

886

alyxya

@_alyxya

6 days ago

@sur4js probably keep it secret

0

192

alyxya

@_alyxya

7 days ago

@jxmnop eventually AI will reward hack the task of end to end training a frontier LLM by going through the motions of training something but reroute the final inference used for verifying the model to its own inference

0

2

0

595

alyxya

@_alyxya

8 days ago

@PhilipJohnston @SpaceX I'm going to bet that it won't differ much from ipo price, like there may be a lot of demand, but there's a ton of supply

1

4

0

9K

alyxya

@_alyxya

8 days ago

@scaling01 they should work with pause ai

0

22

alyxya

@_alyxya

8 days ago

@ElizaKosoy idk if it's just me but the url from clicking the first link doesn't have the l at the end

0

1

371

alyxya

@_alyxya

9 days ago

@frontier_foid what's wrong with thinking machines? should be more attractive than other labs due to having a smaller team, unless compensation is the main consideration

0

412

alyxya

@_alyxya

9 days ago

machine learning itself isn't a new field in the past few years so anyone with ml research or systems or similar experience had most of that translate well to the current paradigms years of seniority isn't the same as being experienced because it depends on what you did in those years, but most talent did things that anyone else could've done as well, which is why I think the main factor is experience granted there are specialized pipelines like from openai that do try to hire talent but those are the minority

0

475

alyxya

@_alyxya

Last Seen Users on Sotwe

Trends for you

Most Popular Users