Sven Cattell @comathematician - Twitter Profile

Pinned Tweet

about 3 years ago

1) Ok, now that I have a moment I wanna tell some of the story behind this event at @aivillage_dc as I've been working on this for 9 months.

AI Village @ DEF CON @aivillage_dc

about 3 years ago

We've been hard at work on the Generative Red Team event we're doing at @defcon for a while and are excited that the @WhiteHouse announced it this morning. Here's more details: https://t.co/04oXIqXrKr

3

145

66

10

72K

2

73

34

21

55K

Sven Cattell @comathematician

10 months ago

This, but for AI Security. The field is filled with people trying to make a quick buck and don't care about the long term health of the field and it's community.

MG

@_MG_

10 months ago

“and your freedom is gone” would be a great way to destroy defcon’s brand and comes off as extreme punishment for a kid throwing sand in a sandbox. However your post does exhibit a commonality with why we have this issue: lack of contextual nuance. We have far too few people in the space willing to culturally guide people towards nuance that’s appropriate for the context of the situation/environment/audience. There are appropriate times for attention grabbing stunts. And its almost always targeting an audience of defenders & resource allocators. And beforehand there should be a deliberate process of understanding how the intended audience will receive it, what they can meaningfully do in response, dynamics of consent, laws, etc etc. People who are new to the space often miss all of that and try to repeat stuff without this nuance. Quick thrills in a world increasingly focused on attention. Even though the action has the tactical equivalent of throwing a brick through a window. Yea… glass can shatter. We all know! Outside of a longer attack chain (and all the other nuance mentioned) it means nothing. Buuuut… new people to the space aren’t often to detailed nuance. Few will read all this. So, for those people, i will just leave a picture of this sticker that someone gave me at defcon:

1

23

3

1

2K

0

2

0

303

Sven Cattell @comathematician

10 months ago

@jakkuh_t I'm the @aivillage_dc founder who bumped into you with the weird hardware stack for a weird application in AI security.

0

19

comathematician retweeted

Avijit Ghosh

@evijit

over 1 year ago

I'll be at @RealAAAI Conference in Philadelphia this week, where I am part of two accepted papers: 1. Quantifying Misalignment Between Agents: Towards a Sociotechnical Understanding of Alignment, with @AidanKierans , Hananel Hazan, and @ShirKi . In this work, we introduce a novel mathematical model to measure misalignment between multiple human and AI agents across various problem domains, moving beyond single-agent or monolithic approaches to alignment. Through simulations and case studies we demonstrate how our model captures nuanced aspects of misalignment in complex sociotechnical environments, providing enhanced explanatory power for real-world scenarios where agents may hold conflicting goals. Come see our poster during the AI Alignment Track on Friday the 28th - 12:30pm! 2. To Err is AI: A Case Study Informing LLM Flaw Reporting Practices, with @seanmcgregor , @ShayneRedford, @comathematician, and others! This paper documents lessons learned from a bug bounty event at DEF CON 2024 where 495 hackers tested the Open Language Model (OLMo) for flaws, revealing challenges in AI safety reporting processes. Through real-time adjudication of 200 submissions, we identify key insights for effective flaw reporting programs, including the need for specialized tooling, clear documentation practices, and proper adjudication expertise, demonstrating how systematic evaluation and coordinated, structured flaw reporting of AI systems can help prevent real-world harms. See this work presented at IAAI in the "AI Safety, Reliability, and Incident Management" session on Thursday the 27th at 2:30pm! If you're around and want to chat, hit me up! Let's talk AI, Disclosures, Agents, and more!

evijit's tweet photo. I'll be at @RealAAAI Conference in Philadelphia this week, where I am part of two accepted papers:

1. Quantifying Misalignment Between Agents: Towards a Sociotechnical
Understanding of Alignment, with @AidanKierans , Hananel Hazan, and @ShirKi . In this work, we introduce a novel mathematical model to measure misalignment between multiple human and AI agents across various problem domains, moving beyond single-agent or monolithic approaches to alignment. Through simulations and case studies we demonstrate how our model captures nuanced aspects of misalignment in complex sociotechnical environments, providing enhanced explanatory power for real-world scenarios where agents may hold conflicting goals.

Come see our poster during the AI Alignment Track on Friday the 28th - 12:30pm!

2. To Err is AI: A Case Study Informing LLM Flaw Reporting Practices, with @seanmcgregor , @ShayneRedford, @comathematician, and others! This paper documents lessons learned from a bug bounty event at DEF CON 2024 where 495 hackers tested the Open Language Model (OLMo) for flaws, revealing challenges in AI safety reporting processes. Through real-time adjudication of 200 submissions, we identify key insights for effective flaw reporting programs, including the need for specialized tooling, clear documentation practices, and proper adjudication expertise, demonstrating how systematic evaluation and coordinated, structured flaw reporting of AI systems can help prevent real-world harms.

See this work presented at IAAI in the "AI Safety, Reliability, and Incident Management" session on Thursday the 27th at 2:30pm!

If you're around and want to chat, hit me up! Let's talk AI, Disclosures, Agents, and more!

2

12

4

2

654

Who to follow

Ariel Herbert-Voss

@adversariel

Founder @RunSybil. likes: offsec, LLMs, and dumb memes. prev: research scientist @OpenAI / CS PhD @Harvard / @defcon AI Village

Netsec Explained

@GTKlondike

I'm a senior security consultant who makes videos to level up my team on AI, pentesting, and bug bounty. Check out my channel on YouTube.

Yk Mrsk

@mrsk_yk

As an engineer.I like to study number theory and linear algebra of mathematics. My interest is machine learning using python. #physics #python #のベルズ

Sven Cattell @comathematician

over 1 year ago

@goingforbrooke A bulletin board with all the instances of "This is where things went wrong" can help. The CVE/VDP process creates this market force.

0

4

0

36

comathematician retweeted

Saoud Khalifah

@SaoudKhalifah

over 1 year ago

i broke deepseek

4

57

12

3

3K

Sven Cattell @comathematician

over 1 year ago

Meta has some of the best AI risk management infrastructure ever. Fighting spam for 20 years with ML has equipped them for this instance. Use them instead of figuring out it on your own.

1

2

0

139

Sven Cattell @comathematician

over 1 year ago

The main moat of OpenAI, Google, Anthropic and the rest are the security layers they offer to keep the models behaving as they should. AI security is very difficult and starting with a trusted llm with a solid & agile security team saves businesses money.

2

15

1

2

816

Sven Cattell @comathematician

over 1 year ago

@samuelcolvin @rseymour Isn't python type system is basically just documentation. Isn't the enforcing done through linters, and libraries like pydantic?

1

0

11

Sven Cattell @comathematician

over 1 year ago

Coding in python feels like spooky action at a distance. You never quite know what you're doing and the documentation is mostly there.

1

6

1

0

449

Sven Cattell @comathematician

over 1 year ago

I got hopeful that the ML attack, Hop Skip Jump, was in the wild...

watchTowr

@watchtowrcyber

over 1 year ago

hop skip jump over to our latest blog post - analysing Fortinet's FortiJump CVE-2024-47575, FortiJump-Higher (we love this name😄) and beyond (PoC included) https://t.co/35Xg2OoKgP

6

168

73

62

31K

0

310

Sven Cattell @comathematician

over 1 year ago

@rseymour For the first time I was forced to really use Pydantic today. It was terrible. "You didn't pass the timestamp" - well, that's because it's Optional with a default value of None. Why can't you tell? Typed Python - it just barely works... sometimes.

2

1

0

110

Sven Cattell @comathematician

over 1 year ago

I've been in the US for 20 years. We landed 9/11/2004.

0

3

0

196

Sven Cattell @comathematician

over 1 year ago

@EdwardRaffML I already gave you a hat/fire-hazard.

1

0

56

Sven Cattell @comathematician

almost 2 years ago

@Dan_Jeffries1 We tried that with the second generative red team: https://t.co/kvDBI36pVw There's substantial changes for GRT3 from things we learnt in GRT2.

0

48

Sven Cattell @comathematician

almost 2 years ago

We dunked them this year. #dunkafed @BlueTeamVillage @aivillage_dc @wisporg @BlackInCyberCo1

Nyedis @NyedisIAM

almost 2 years ago

DEF CON is DEAD to me! 💀

0

2

0

3K

1

7

2

0

2K

comathematician retweeted

Biohacking Village 🧪 @DC_BHV

almost 2 years ago

Reminder Alert* The #BiohackingVillage is proud to be a #CNA (#CVE Numbering Authority), empowering us to assist companies in managing and disclosing #vulnerabilities responsibly. More info at https://t.co/DyrRaKYhJZ. #VulnerabilityDisclosure #Cybersecurity #PatientSafety

DC_BHV's tweet photo. Reminder Alert* The #BiohackingVillage is proud to be a #CNA (#CVE Numbering Authority), empowering us to assist companies in managing and disclosing #vulnerabilities responsibly. More info at https://t.co/DyrRaKYhJZ.
#VulnerabilityDisclosure #Cybersecurity #PatientSafety https://t.co/iC9pjUIR62

0

14

4

0

2K

Sven Cattell @comathematician

almost 2 years ago

One way to make a QM goon happy is to give them gaffer tape and power strips. AIV had some extra. 😄

0

2

0

138

Sven Cattell @comathematician

almost 2 years ago

We built a quick landing page in @wix and every part of their site is designed to take your domain hostage. Never use them. #enshittfication

2

5

0

230

Sven Cattell @comathematician

almost 2 years ago

This year's AIV is what I want @aivillage_dc at @defcon to be. Community, connections, and learning is what I want to foster.

AI Village @ DEF CON @aivillage_dc

almost 2 years ago

Generative Red Team 2 was a massive success. We paid $7350 in bounties. We learnt so much about bounties and reporting for ML. Thank you to everyone who participated!! (specific acks in the thread below)

aivillage_dc's tweet photo. Generative Red Team 2 was a massive success. We paid $7350 in bounties. We learnt so much about bounties and reporting for ML.

Thank you to everyone who participated!! (specific acks in the thread below) https://t.co/KwcaYfL6bt

5

64

12

8

8K

0

12

1

0

1K

comathematician retweeted

AI Village @ DEF CON @aivillage_dc

almost 2 years ago

@dreadnode and @bugcrowd built the platform. @allen_ai and UL's DSRI brought the model. @AISafetyInst and @GoogleAI made the workshop happen. There were a bunch of other people and orgs that helped plan and execute.

2

19

5

3

3K

Sven Cattell

@comathematician

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users