Freddie Bickford Smith @fbickfordsmith - Twitter Profile

Pinned Tweet

11 months ago

There’s a lot of confusion around uncertainty in machine learning. We argue the "aleatoric vs epistemic" view has contributed to this and present a rigorous alternative. #ICML2025 with @janundnik @eleanortrollope @markvanderwilk @adamefoster @tom_rainforth 1/5

fbickfordsmith's tweet photo. There’s a lot of confusion around uncertainty in machine learning.

We argue the "aleatoric vs epistemic" view has contributed to this and present a rigorous alternative.

#ICML2025 with @janundnik @eleanortrollope @markvanderwilk @adamefoster @tom_rainforth

1/5 https://t.co/lFwojHjqNp

1

63

15

33

5K

fbickfordsmith retweeted

The Wall Street Journal

@WSJ

3 months ago

Anthropic, the AI company known for its devotion to safety, is scaling back that commitment https://t.co/qOdXDbP9zb

35

170

35

34

54K

fbickfordsmith retweeted

Jon Barron

@jon_barron

4 months ago

This idea that intelligence is solely a function of what you've observed since birth and not also a function of the 500 million years of evolution that preceded your birth is surprisingly sticky despite being demonstrably untrue.

57

1K

54

244

119K

Freddie Bickford Smith @fbickfordsmith

6 months ago

@SumedhaChugh @janundnik 11am on Friday :)

0

1

0

61

Who to follow

aj

@anndvision

postdoc @Columbia . member of @blei_lab . phd @UniofOxford . prev @OATML_Oxford , @PVG_McGill , intern @Meta . he / they

Andrew Campbell

@AndrewC_ML

Research Scientist, Google DeepMind. Previous: @Xaira_Thera, PhD @oxcsml

Tim G. J. Rudner

@timrudner

Assistant Professor, @UofT Statistics & CS | CIFAR AI Chair, @VectorInst Machine Learning, AI Safety, AI Governance Prev: Rhodes Scholar, @UniofOxford, @Yale

Freddie Bickford Smith @fbickfordsmith

6 months ago

Active testing enables label-efficient model evals but can be computationally expensive. We show how to reduce costs and scale up to LLMs. https://t.co/rXkpQrJ7DY Work led by Gabrielle Berrada. Find her at EurIPS, or @janundnik and me at NeurIPS in San Diego.

fbickfordsmith's tweet photo. Active testing enables label-efficient model evals but can be computationally expensive.

We show how to reduce costs and scale up to LLMs.

https://t.co/rXkpQrJ7DY

Work led by Gabrielle Berrada. Find her at EurIPS, or @janundnik and me at NeurIPS in San Diego. https://t.co/qDQrM3bCaT

3

14

7

8

7K

fbickfordsmith retweeted

Jannik Kossen @janundnik

6 months ago

Come and check out our work on how to evaluate LLMs with less compute and fewer labels! Find first author Gabrielle at EurIPS or Freddie and I at poster 110 at the 11 AM session on Friday.

0

6

1

597

fbickfordsmith retweeted

Dimitris Papailiopoulos

@DimitrisPapail

6 months ago

Grandest endorsement of doing a PhD that I've seen recently :)

9

819

43

114

57K

fbickfordsmith retweeted

Timothy Gowers @wtgowers @wtgowers

7 months ago

I crossed an interesting threshold yesterday, which I think many other mathematicians have been crossing recently as well. In the middle of trying to prove a result, I identified a statement that looked true and that would, if true, be useful to me. 1/3

61

2K

299

757

893K

fbickfordsmith retweeted

Toby Ord

@tobyordoxford

8 months ago

New post on RL scaling: Careful analysis of OpenAI’s public benchmarks reveals RL scales far worse than inference: to match each 10x scale-up of inference compute, you need 100x the RL-training compute. The only reason it has been cost-effective is starting from a tiny base. 🧵

tobyordoxford's tweet photo. New post on RL scaling:
Careful analysis of OpenAI’s public benchmarks reveals RL scales far worse than inference: to match each 10x scale-up of inference compute, you need 100x the RL-training compute. The only reason it has been cost-effective is starting from a tiny base.
🧵 https://t.co/ZwhDegc4NO

26

497

53

366

192K

fbickfordsmith retweeted

Helen Toner

@hlntnr

8 months ago

Every so often, OpenAI employees ask me how I see the co now. It's always tough to give a simple answer. Some things they're doing, eg on CoT monitoring or building out system cards, are great. But the dishonesty & intimidation tactics in their policy work are really not. E.g:

340

5K

624

1K

27M

fbickfordsmith retweeted

Oxford Statistics @OxfordStats

9 months ago

With @JesusOxford we are looking for a Professor of Statistics. Become part of a historic institution and a community focused on academic excellence, innovative thinking, and significant practical application. About the role: https://t.co/mzBacAoCiv Deadline: 15 September

OxfordStats's tweet photo. With @JesusOxford we are looking for a Professor of Statistics.

Become part of a historic institution and a community focused on academic excellence, innovative thinking, and significant practical application.

About the role: https://t.co/mzBacAoCiv
Deadline: 15 September https://t.co/vXWBslgcGb

0

11

9

1

1K

fbickfordsmith retweeted

Cas (Stephen Casper)

@StephenLCasper

9 months ago

Research on AI "sandbagging" is getting more popular recently. In this 🧵, I'll give some reasons that I think it's not a useful research paradigm. TL;DR, I think it's a confusing reframing of fairly well studied and previously solved problems.

5

80

4

87

16K

fbickfordsmith retweeted

David Krueger 🦥 ⏸️ ⏹️ ⏪

@DavidSKrueger

10 months ago

It's great the governments (and others) continue to demonstrate that the models companies release are incredibly insecure. It's terrible that governments aren't penalizing companies for releasing such insecure models, and instead just help them patch them.

0

19

3

5

2K

Freddie Bickford Smith @fbickfordsmith

10 months ago

@_rockt @andrewgwils More like (1) the future resembles the past and (2) you can capture the resemblances in your model, right? The existence of the future doesn’t imply you can predict it.

0

51

fbickfordsmith retweeted

Lorenz Kuhn @_lorenzkuhn

10 months ago

Just two years ago, our smartest models could barely solve the easiest competitive programming problems. Last week, our latest reasoning models achieved a gold medal score at the International Olympiads of Informatics. Competitive programming is one of the cleanest examples of scaling up RL training for LLMs. Soon, with experimental approaches like the one used for IMO, we might see similar scaling on real-world coding problems.

12

146

7

22

16K

fbickfordsmith retweeted

William MacAskill

@willmacaskill

10 months ago

Today I’m releasing an essay series called Better Futures. It’s been something like eight years in the making, so I’m pretty happy it’s finally out! It asks: when looking to the future, should we focus on surviving, or on flourishing?

willmacaskill's tweet photo. Today I’m releasing an essay series called Better Futures.

It’s been something like eight years in the making, so I’m pretty happy it’s finally out!

It asks: when looking to the future, should we focus on surviving, or on flourishing? https://t.co/qdQhyzlvJa

11

341

38

173

50K

Freddie Bickford Smith @fbickfordsmith

10 months ago

Join RainML! Lots of exciting work going on :)

Tom Rainforth @tom_rainforth

10 months ago

I have an opening for a 2-year postdoc in probabilistic machine learning and/or experimental design. The application deadline is the 3rd of September. See here for details and how to apply: https://t.co/ht9n9cEviw

0

39

11

8

3K

0

3

0

232

Freddie Bickford Smith @fbickfordsmith

10 months ago

Strongly recommend working with Rob!

Rob Cornish @rob_cornish

10 months ago

I'm looking for talented and ambitious PhD students to join me at Nanyang Technological University Singapore to work on safe and robust AI systems! Full scholarships covering tuition and a stipend are available, and are open to local and international students alike.

5

77

18

28

8K

0

1

0

186

Freddie Bickford Smith @fbickfordsmith

10 months ago

@ClementineDomi6 @GatsbyUCL @SaxeLab Congrats!

1

0

246

Freddie Bickford Smith @fbickfordsmith

11 months ago

Come and chat at the poster session today at 4:30-7pm :) Poster E-1403, East Exhibition Hall A-B

Freddie Bickford Smith @fbickfordsmith

11 months ago

There’s a lot of confusion around uncertainty in machine learning. We argue the "aleatoric vs epistemic" view has contributed to this and present a rigorous alternative. #ICML2025 with @janundnik @eleanortrollope @markvanderwilk @adamefoster @tom_rainforth 1/5

1

63

15

33

5K

0

4

0

221

fbickfordsmith retweeted

summerfieldlab @summerfieldlab.bsky.social @summerfieldlab

11 months ago

In a new paper, we examine recent claims that AI systems have been observed ‘scheming’, or making strategic attempts to mislead humans. We argue that to test these claims properly, more rigorous methods are needed.

summerfieldlab's tweet photo. In a new paper, we examine recent claims that AI systems have been observed ‘scheming’, or making strategic attempts to mislead humans. We argue that to test these claims properly, more rigorous methods are needed. https://t.co/n7W8qyY27n

4

84

25

32

17K

Freddie Bickford Smith

@fbickfordsmith

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users