Ben Edelman @EdelmanBen - Twitter Profile

about 1 month ago

This paper does a fantastic job conveying that: (1) Deep learning abounds in miraculous empirical regularities (2) A beautiful scientific theory has emerged over the past decade to explain the miracles (3) Yet most fundamental questions remain mysteries. The best is yet to come.

Jamie Simon @learning_mech

about 1 month ago

1/ Deep learning is going to have a scientific theory. We can see the pieces starting to come together, and it's looking a lot like physics! We're releasing a paper pulling together these emerging threads and giving them a name: learning mechanics. 🔨 https://t.co/92nSIHameW 🔧

learning_mech's tweet photo. 1/ Deep learning is going to have a scientific theory. We can see the pieces starting to come together, and it's looking a lot like physics!

We're releasing a paper pulling together these emerging threads and giving them a name: learning mechanics.

🔨 https://t.co/92nSIHameW 🔧 https://t.co/3cshMD33bl

53

2K

293

2K

303K

2

42

6

16

6K

Ben Edelman @EdelmanBen

2 months ago

@zicokolter At CAISI we started using the phrase "agent hijacking" for prompt injections of agents because it avoids the inevitable confusion about the prompt injection vs jailbreak distinction (not to even mention direct vs indirect), and conveys impact more directly for a lay audience.

0

1

0

1

63

Ben Edelman @EdelmanBen

2 months ago

@zicokolter Yep agreed it's all the same underlying vulnerability; instruction hierarchy-style distinctions (app developer / user / external content) are "just" an abstraction. (I was also involved with the new paper, btw)

1

0

50

Ben Edelman @EdelmanBen

2 months ago

@zicokolter Post where Simon Willison coined prompt injection: https://t.co/PRiBfAdE1V. Paper where Greshake et al. coined indirect prompt injection: https://t.co/W2hfu2TYfh

1

0

1

72

Ben Edelman @EdelmanBen

2 months ago

@zicokolter Fwiw, my understanding is that the original coinage of prompt injection was focused on contexts where the untrusted data comes from an untrusted user. Then Greshake et al. coined IPI to highlight the case where the attacker leverages data likely to be retrieved at inference time.

1

0

55

Ben Edelman @EdelmanBen

4 months ago

Excited to be part of this initiative. Join our team to advance the frontier of agent security research and standards! https://t.co/2v854Aydmk

Director Michael Kratsios

@mkratsios47

4 months ago

The future of AI is agentic, and America is leading the way to make it secure and interoperable. A new AI Agent Standards Initiative is launching this week @NIST to drive industry-led standards and open protocols that build trust and advance innovation. https://t.co/bS5oqvU8iu

140

2K

330

884

153K

0

9

0

3

673

EdelmanBen retweeted

Tony Wang

@TonyWangIV

4 months ago

Excited to share @NIST+CAISI’s initial public draft on how to run and report results of automated evals. If you have opinions on evals, we’d love your feedback — help us improve the AI evals ecosystem! Public comments accepted through March 31st via [email protected]. more in🧵

TonyWangIV's tweet photo. Excited to share @NIST+CAISI’s initial public draft on how to run and report results of automated evals.

If you have opinions on evals, we’d love your feedback — help us improve the AI evals ecosystem!

Public comments accepted through March 31st via ai800-2@nist.gov.

more in🧵 https://t.co/n9cEynIoyb

2

28

6

17

4K

EdelmanBen retweeted

Boaz Barak @boazbaraktcs

4 months ago

One of the best places if you have technical background and care about AI going well!

0

23

3

11

5K

EdelmanBen retweeted

Dwarkesh Patel

@dwarkesh_sp

4 months ago

Seems like a great opportunity for technical talent to come into government and help the USG make sound, technically informed decisions on AI

9

144

15

40

50K

EdelmanBen retweeted

Samuel Hammond 🦉

@hamandcheese

4 months ago

CAISI is hiring for a bunch of exciting new roles, from partnerships to technical experts in AI x bio / chem and more. They're serious about bringing in strong researchers & engineers and letting them do good work. Based in DC or SF: https://t.co/GsooeO3IxK

4

170

40

68

85K

Ben Edelman @EdelmanBen

4 months ago

My Agent Security team is hiring Research Engineers & Scientists. Other teams are hiring people with strong technical backgrounds too: Frontier Assessment, Cyber, Chem/Bio, Applied Systems, and Partnerships. Job postings are listed here: https://t.co/hG2mmUMiUH

0

10

1

5

657

Ben Edelman @EdelmanBen

4 months ago

People sometimes ask me how to leverage a technical background to jump into U.S. AI policy. As of this week my answer is straightforward: apply to join us at CAISI! We're a startup within government, and we're doing a hiring surge.

EdelmanBen's tweet photo. People sometimes ask me how to leverage a technical background to jump into U.S. AI policy. As of this week my answer is straightforward: apply to join us at CAISI! We're a startup within government, and we're doing a hiring surge. https://t.co/InGPQJ5E6Y

4

88

23

36

24K

Ben Edelman @EdelmanBen

4 months ago

The United States is the center of the AI revolution. We need dedicated public servants to ensure our government is smart on AI issues.

1

9

2

1

6K

Ben Edelman @EdelmanBen

4 months ago

At CAISI, we're the U.S. government's leading experts on agent security. We published this RFI so deployers, developers, and experts can provide insights that inform our research and NIST guidelines development. Responses due March 9th!

Peter Cihon @pcihon

4 months ago

CAISI has published an RFI about securing AI agents. It seeks insights from AI agent deployers, developers, and computer security researchers. Questions address the current threat landscape, mitigations, measurements, and other security considerations unique to AI agents.

1

13

6

2

3K

1

8

1

2

684

EdelmanBen retweeted

Peter Cihon @pcihon

5 months ago

CAISI is recruiting an intern to support an agent security standards project. Position closes Jan. 15 for a February start. Please help spread the word. Details in thread:

2

52

18

12

14K

Ben Edelman @EdelmanBen

5 months ago

@boazbaraktcs Since I organized this by model family branding (GPT) rather than developer (OpenAI), I think the move would be to add a separate o-series line. And don't get me started about Sonnet vs Opus

0

175

Ben Edelman @EdelmanBen

5 months ago

the AI race in one terrible graph

2

27

1

3

3K

Ben Edelman

@EdelmanBen

Last Seen Users on Sotwe

Trends for you

Most Popular Users