Will Rice @_Will_Rice - Twitter Profile

Will Rice

@_Will_Rice

about 2 months ago

@wandb

0

33

Will Rice

@_Will_Rice

about 2 months ago

Is making a team and paying for a seat the only way to get notifications for @wandb anymore?

1

0

122

Will Rice

@_Will_Rice

4 months ago

@karpathy Mostly agree, except I’ve totally had Claude code suggest to add ignore comments to mypy errors after tying to fix it. So I think it does get stuck and give up. The difference is you can just say “don’t reference my other chats” and it will try again.

0

588

Will Rice

@_Will_Rice

10 months ago

@sedielem I put normalizing flow, but seeing flow-based in an abstract has changed over the years what I expect the paper to be about. Early in my career I would expect Real NVP, but now I would expect flow matching.

0

3

0

307

Who to follow

Kalyan KS

@kalyan_kpl

NLP Consultant & Researcher. AIxFunda newsletter for the latest LLM, Agents and RAG updates.

Voze

@usevoze

Drive growth your way.

csukuangfj

@csukuangfj

Developer of Next-gen Kaldi

Will Rice

@_Will_Rice

10 months ago

I find deep research from @OpenAI super useful. However, I would like to see improvement in the output format. I always want it to be in markdown but it leaves out information when asked to convert it to markdown. If anyone has luck with getting the full output in a markdown file, I would appreciate any tips.

0

73

Will Rice

@_Will_Rice

10 months ago

what do you do when you get rate limited by your AI therapist?

0

1

0

44

Will Rice

@_Will_Rice

10 months ago

I'd like to see the perfomance of a random sample from the same companies ChatGPT was given to pick from for a comparison.

Rohan Paul

@rohanpaul_ai

11 months ago

Its going viral on Reddit. Somebody let ChatGPT run a $100 live share portfolio, restricted to U.S. micro-cap stocks. Did an LLM really bit the market?. - 4 weeks +23.8% while the Russell 2000 and biotech ETF XBI rose only ~3.9% and 3.5%. Prompt + GitHub posted --- ofcourse its a short‑term outperformance, tiny sample size, and also micro caps are hightly volatile. So much more exahustive analysis is needed with lots or more info (like Sharpe ratios and longer back-testing etc), to explore whether an LLM can truly beat the market.

rohanpaul_ai's tweet photo. Its going viral on Reddit.

Somebody let ChatGPT run a $100 live share portfolio, restricted to U.S. micro-cap stocks.

Did an LLM really bit the market?.

- 4 weeks +23.8%

while the Russell 2000 and biotech ETF XBI rose only ~3.9% and 3.5%.

Prompt + GitHub posted

---

ofcourse its a short‑term outperformance, tiny sample size, and also micro caps are hightly volatile.

So much more exahustive analysis is needed with lots or more info (like Sharpe ratios and longer back-testing etc), to explore whether an LLM can truly beat the market.

172

9K

591

19K

2M

0

1

0

51

Will Rice

@_Will_Rice

about 1 year ago

So we tuning on the test set now?

0

1

0

33

Will Rice

@_Will_Rice

almost 2 years ago

When is @astral_sh going to replace mypy? 😁

0

2

0

88

_Will_Rice retweeted

William

@williamwelsh

almost 2 years ago

@thdxr we heard you like abstracting so we abstracted your abstraction to give you more abstraction 🤩

0

32

2

0

12K

Will Rice

@_Will_Rice

almost 2 years ago

Someone remind me how many years it was between the first GAN paper and realistic faces that still had artifacts.

ROM 🥔

@i_dg23

almost 2 years ago

more detailed version

21

364

59

158

204K

0

2

0

165

Will Rice

@_Will_Rice

almost 2 years ago

@timClicks If you look at the code for the paper there is a lot of triton cuda stuff

0

77

Will Rice

@_Will_Rice

about 2 years ago

@JeremiahDJohns It’s interesting that some of these came from sources already on the internet that are obviously trolls/satire. That actually might be fixable if you could rank existing answers based on truthfulness.

1

0

2K

Will Rice

@_Will_Rice

about 2 years ago

Not surprising to anyone that’s actually worked on generative models.

Riley Goodside

@goodside

about 2 years ago

GPT-3’s score on the MMLU benchmark was 40%. First release of GPT-4 scored 86%, and today GPT-4o is 89%. An increase of just 3% — that’s a full year of progress. If you plot the prior trend we were supposed to be at 100, maybe 120% by now. AI is hitting a wall.

goodside's tweet photo. GPT-3’s score on the MMLU benchmark was 40%. First release of GPT-4 scored 86%, and today GPT-4o is 89%.

An increase of just 3% — that’s a full year of progress. If you plot the prior trend we were supposed to be at 100, maybe 120% by now. AI is hitting a wall. https://t.co/XshPQKjOw2

53

566

16

55

108K

0

1

0

169

Will Rice

@_Will_Rice

about 2 years ago

Unpopular opinion: AI agents are going to do some things well, but fail to generalize enough to be useful.

Nick Davidov

@Nick_Davidov

about 2 years ago

Unpopular opinion: AI agents are hard.

98

416

26

55

88K

0

1

0

107

Will Rice

@_Will_Rice

about 2 years ago

@jankosinski How do you know that it was accurately predicted?

1

0

223

_Will_Rice retweeted

Arun Rao

@sudoraohacker

about 2 years ago

Hot take: ML researchers are underestimating how quickly recent scaling laws may flatten out - it’s quite likely what people see as an exponential function is a sigmoid, and the harsh reality of the physics of high energy costs and power plant construction restrict the expected benefits from pretraining of ever larger models.

5

36

9

12

7K