Jasper Götting @JasperGeh - Twitter Profile

3 days ago

Why run AI biorisk evals at all? In his new post, @JasperGeh explains the biorisk evidence hierarchy (first-principles arguments, evals, uplift RCTs) and why evals provide the best evidence-per-dollar. Read the full post here: https://t.co/QeSZaE3Fzh

SecureBio's tweet photo. Why run AI biorisk evals at all? In his new post, @JasperGeh explains the biorisk evidence hierarchy (first-principles arguments, evals, uplift RCTs) and why evals provide the best evidence-per-dollar.

Read the full post here: https://t.co/QeSZaE3Fzh https://t.co/Gco5noPzIw

0

7

2

306

Jasper Götting @JasperGeh

14 days ago

@gleech The real beauty is in the ✨fields✨

0

40

JasperGeh retweeted

Cairo Smith

@cairoasmith

about 1 month ago

There's a common misconception that Brutalist buildings were unpainted, but thanks to microscopic analysis of the exteriors we can now recreate what they looked like in their prime.

cairoasmith's tweet photo. There's a common misconception that Brutalist buildings were unpainted, but thanks to microscopic analysis of the exteriors we can now recreate what they looked like in their prime. https://t.co/P2AW49BONS

443

39K

3K

7M

JasperGeh retweeted

Liv Boeree

@Liv_Boeree

about 1 month ago

Please call your congressperson NOW and ask them to vote NO on this today!

3

254

51

4

8K

Who to follow

Ollie Rodriguez (formerly Ollie Base)

@Ollie_Base

Effective altruism / AI safety / Peaceful music-maker. AI Events Program Lead at the Centre for Effective Altruism (views my own)

Jonas Sandbrink

@JonasSandbrink

AI & Biosecurity; Building AI-bio credentialing as Entrepreneur in Residence at Sentinel Bio

Jonas Vollmer

@Jonas_Vollmer

AI forecasting, investing, philanthropy

JasperGeh retweeted

pedram.md

@pdrmnvd

5 months ago

oh you’re using claude code? everyone’s using open code. just kidding we’re all on amp code. we’re using cline, we’re using roo code. we just forked our own version of roo. were using kilo code. we were on coderabbit but their ceo yelled at us so now we’re using qorbit. apple just acquired them for $30bn so we just migrated our entire team to slash commands. one guy is still on aider. the PM is on loveable. he just shipped a new product on replit. the intern installed a slackbot that lets you chat with your spreadsheet. legal is still reviewing devin’s enterprise contract. we evaluated junie for three ukrainians using jetbrains. someone in slack just asked “has anyone tried amp?” we are using goose for scripts. next week we’re piloting augment code. the CTO heard good things about trae. our CEO is friends with the guy from conductor. our CFO resigned. our CISO said we’ve had fourteen supply chain attacks in the last week. we’re shipping the worlds most expensive todo app.

124

6K

487

1K

792K

JasperGeh retweeted

Andy Masley

@AndyMasley

7 months ago

Bro Tesla stole my meme

18

618

10

17

32K

JasperGeh retweeted

Andy Masley

@AndyMasley

8 months ago

Every day I find a new way of trying to get across just how ridiculously fake the problem of AI water use is

196

6K

575

2K

482K

JasperGeh retweeted

Luca Righetti

@lucafrighetti

9 months ago

How can we verify that AI ChemBio safety tests were properly run? Today we're launching STREAM: a checklist for more transparent eval results. I read a lot of model reports. Often they miss important details, like human baselines. STREAM helps make peer review more systematic.

lucafrighetti's tweet photo. How can we verify that AI ChemBio safety tests were properly run?

Today we're launching STREAM: a checklist for more transparent eval results.

I read a lot of model reports. Often they miss important details, like human baselines. STREAM helps make peer review more systematic. https://t.co/PtXMWJbyKY

2

84

16

27

14K

JasperGeh retweeted

Luca Righetti

@lucafrighetti

9 months ago

I've been procrastinating on this chart of all model card releases by OpenAI, GDM, and Anthropic: • 4 cases of late safety results (out of 27, so ~15%) • Notably 2 cases were late results showed increases in risk • The most recent set of releases in August were all on time

lucafrighetti's tweet photo. I've been procrastinating on this chart of all model card releases by OpenAI, GDM, and Anthropic:
• 4 cases of late safety results (out of 27, so ~15%)
• Notably 2 cases were late results showed increases in risk
• The most recent set of releases in August were all on time https://t.co/Uci9PN7ui3

2

55

7

14

4K

JasperGeh retweeted

Andy Masley

@AndyMasley

9 months ago

This headline from the NYT, that came complete with a really nice photo series of the local impacted community, is I think a full lie rather than just misleading. It is not the case, anywhere, that data centers taking water has caused problems for communities.

AndyMasley's tweet photo. This headline from the NYT, that came complete with a really nice photo series of the local impacted community, is I think a full lie rather than just misleading. It is not the case, anywhere, that data centers taking water has caused problems for communities. https://t.co/1LYXYmks9i

27

2K

87

262

81K

JasperGeh retweeted

SecureBio

@SecureBio

9 months ago

The Nucleic Acid Observatory is hiring! Having built the technical core of an early warning system, we now received funding to further scale our work, including owning initial outbreak response. We’re hiring for four roles, with a referral bonus of up to $6k. More details:

1

7

3

1

2K

JasperGeh retweeted

Andy Masley

@AndyMasley

10 months ago

Folks, I ran the numbers on the UK government's recommendation to delete old photos and emails to save water. Link below

17

671

62

81

105K

JasperGeh retweeted

David Manheim is @ MoreOnline

@davidmanheim

10 months ago

Hot take: Most AI-bioterrorism risk "debate" is people shadow-boxing over definitions, not different predictions about concrete points. I've talked to many other experts. We often don’t agree on what “enable” means, so people think the risk is uncertain in ways it is not. 🧵

2

42

5

17

5K

JasperGeh retweeted

Epoch AI

@EpochAIResearch

11 months ago

We have graded the results of @OpenAI's evaluation on FrontierMath Tier 1–3 questions, and found a 27% (± 3%) performance. ChatGPT agent is a new model fine-tuned for agentic tasks, equipped with text/GUI browser tools and native terminal access. 🧵

EpochAIResearch's tweet photo. We have graded the results of @OpenAI's evaluation on FrontierMath Tier 1–3 questions, and found a 27% (± 3%) performance. ChatGPT agent is a new model fine-tuned for agentic tasks, equipped with text/GUI browser tools and native terminal access. 🧵 https://t.co/L7D3cEp58I

33

776

103

136

214K

JasperGeh retweeted

dynomight @dynomight7

11 months ago

New colors without shooting lasers into your eyes https://t.co/UzaP4kobZz

5

28

3

2K

Jasper Götting @JasperGeh

11 months ago

@lxrjl @AricFloyd Bayesian curls, of course

0

4

0

175

JasperGeh retweeted

Larus (Ven) ➡️ the Hadopelagic Zone 🍉 @NeonTetraploid

11 months ago

i've had this idea bouncing around in my head for a while now

39

10K

1K

344K

Jasper Götting @JasperGeh

11 months ago

0

73

JasperGeh retweeted

Siméon

@Simeon_Cps

11 months ago

A bit more details on this: 1. Why is the dual deployment setup promising for bio? a) Bioweapons adjacent knowledge (e.g. virology etc) is useful for a tiny fraction of the population. Removing it from a general purpose deployment is not actually curtailing much benefits. The existence of platforms like the one below that could put high KYCs affecting just a few thousands users enables to get the benefits with minimal costs. b) bioweapons on the other hand are really bad so really worth decreasing the marginal risk close from zero. c) note that this method is less promising for cyber which is very adjacent to code and which is more symmetric. 2) why shouldn't we stick to existing methods like constitutional classifiers? a) because they don't really work as well as you'd want. A core reason why Anthropic made up this distinction in threat models between "universal" and "non universal" jailbreaks, narrowing down their commitment to universal ones, is that they couldn't defend against jailbreaks in general. So as a result they deployed their system which likely increases the risk non trivially, even assuming that it's resistant to all universal jailbreaks. 3) what are the technical challenges in doing this dual deployment setup? a) the biggest uncertainty I have is whether you can get a model to be good at a domain just by feeding most knowledge/corpus very late stage in the training run. If it is the case, then doing this dual late stage training shouldn't be too painful. If not, that would make this process a lot costlier. b) another challenge to solve but which seems much more feasible is to identify the overwhelming majority of bioweapons relevant knowledge & reasoning. For bio, it seems easier than for some other domains as it's pretty niche & identifiable.

0

28

5

7

11K

JasperGeh retweeted

SecureBio

@SecureBio

11 months ago

Experts underestimate the progress of AI and the potential implications for pandemic risk. We are proud to have contributed to a new study that shows how biosecurity experts and superforecasters think about biological risk and AI progress.

2

12

7

1

906

Jasper Götting

@JasperGeh

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users