Andrey Kovalev @avkovaleff - Twitter Profile

9 days ago

Julia Davila’s “Boring Seams” keynote diagnoses AI security as pre-Noether: empirical conservation laws without a unifying symmetry. Her claim is that the orchestration layer, not the model, is where the structural symmetries live. https://t.co/DlV1MXpedt

1

7

2

6

4K

avkovaleff retweeted

Ivan Krstić

@radian

14 days ago

🔺NEW: Formally verified post-quantum ML-KEM and ML-DSA in corecrypto, with correctness proven from the FIPS spec down to hand-optimized ARM64 assembly — a world first at multi-billion device scale. And we're releasing our Isabelle libraries, ARM64 model, and Cryptol-to-Isabelle translator to advance the state of the art in verified cryptography! https://t.co/LZPHFD0ifE

11

433

103

177

47K

avkovaleff retweeted

Mira Murati

@miramurati

17 days ago

Collaborative AI runs on interactivity: machines and people, working in real time, across every modality. Solving it takes a community, join us.

79

2K

115

419

250K

avkovaleff retweeted

Elie Bursztein

@elie

21 days ago

[Weekend Read] ExploitGym: Can AI Agents Turn Security Vulnerabilities into Real Attacks? 📄 Read here: https://t.co/fq7pg5w0CC In our latest joint research with academia and other frontier labs, we tested the ability of models to turn vulnerabilities into working exploits across different attack surfaces and mitigation conditions. Beyond the benchmark numbers, here is what this means for the industry: -🛡️ Blue Teams: Speeding up patch development and deployment is no longer optional. Integrating AI directly into CI/CD workflows should be your top priority. -🔬 Researchers: Current mitigation techniques reduce success rates, but they aren't a silver bullet. We need to step up our game—where do we focus next? -⚔️ Offensive Security: As models get better at finding bugs and writing exploits, we have to rethink disclosure timelines entirely. What does the future of bug bounties look like in this new era? I'd love to hear how your teams are preparing for this shift. Let me know

elie's tweet photo. [Weekend Read] ExploitGym: Can AI Agents Turn Security Vulnerabilities into Real Attacks? 📄 Read here: https://t.co/fq7pg5w0CC

In our latest joint research with academia and other frontier labs, we tested the ability of models to turn vulnerabilities into working exploits across different attack surfaces and mitigation conditions.

Beyond the benchmark numbers, here is what this means for the industry:

-🛡️ Blue Teams: Speeding up patch development and deployment is no longer optional. Integrating AI directly into CI/CD workflows should be your top priority.

-🔬 Researchers: Current mitigation techniques reduce success rates, but they aren't a silver bullet. We need to step up our game—where do we focus next?

-⚔️ Offensive Security: As models get better at finding bugs and writing exploits, we have to rethink disclosure timelines entirely. What does the future of bug bounties look like in this new era?

I'd love to hear how your teams are preparing for this shift. Let me know

1

15

6

2K

Who to follow

Omar "Beched" Ganiev

@theBeched

Security research, mathematics, programming | Co-Founder @DecurityHQ

Artem Chaikin

@a_chaykin

Acting like some mobile hacker @brave

Dmitriy Evdokimov

@evdokimovds

Оbservability, visibility, security of containerised apps and K8s. eBPF fan.

avkovaleff retweeted

Orange Tsai 🍊

@orange_8361

21 days ago

And this one is human insight w/ LLM-assisted research. Took about one week to finish everything. The AI really rescued me from a lot of tedious work — excluding the part where it changed the Domain Admin password, locked me out, and claimed it got RCE 🤦

43

2K

150

188

112K

avkovaleff retweeted

Gadi Evron @gadievron

21 days ago

It's time to meet. 250 CISOs wrote the "AI-storm"-ready security program strategy paper over a weekend, now imagine what we can achieve together when we meet. Introducing: CISO Summit Series. SF: https://t.co/mRwofRIyBW NYC: https://t.co/qHaJCPOdfP DC: https://t.co/w50AivBckG

0

6

1

0

504

avkovaleff retweeted

Project Zero Bugs @ProjectZeroBugs

23 days ago

A 0-click exploit chain for the Pixel 10: When a Door Closes, a Window Opens https://t.co/iZZgVoKNd9

3

380

76

181

132K

avkovaleff retweeted

AISecHub

@AISecHub

about 1 month ago

https://t.co/iLKaOLMohw

0

8

2

5

599

avkovaleff retweeted

Wiz

@wiz_io

about 1 month ago

🚨 BREAKING: Wiz Research discovered Remote Code Execution on https://t.co/SvN2lGsnbO with a single git push The flaw in @github allowed unauthorized access to millions of repositories belonging to other users and organizations 🤯

wiz_io's tweet photo. 🚨 BREAKING: Wiz Research discovered Remote Code Execution on https://t.co/SvN2lGsnbO with a single git push

The flaw in @github allowed unauthorized access to millions of repositories belonging to other users and organizations 🤯 https://t.co/wasR2AIIlA

96

4K

990

2K

552K

Andrey Kovalev

@avkovaleff

about 1 month ago

Finally open sourced another agent. This one aggregates daily cloud security news and prepares daily briefings. Designed to run on local AI infrastructure: e.g. DGX Spark or similar. Check the code at https://t.co/TRPcmVoSWy. Check the content it produces at: https://t.co/oBRTLubhxU or https://t.co/omrHXlgsqP

1

0

169

avkovaleff retweeted

Andrej Karpathy

@karpathy

2 months ago

- Drafted a blog post - Used an LLM to meticulously improve the argument over 4 hours. - Wow, feeling great, it’s so convincing! - Fun idea let’s ask it to argue the opposite. - LLM demolishes the entire argument and convinces me that the opposite is in fact true. - lol The LLMs may elicit an opinion when asked but are extremely competent in arguing almost any direction. This is actually super useful as a tool for forming your own opinions, just make sure to ask different directions and be careful with the sycophancy.

2K

31K

2K

9K

3M

avkovaleff retweeted

Alexander Panfilov

@kotekjedi_ml

2 months ago

New paper: We deploy Claude Code in an autoresearch loop to discover novel jailbreaking algorithms – and it works. It beats 30+ existing GCG-like attacks (with AutoML hyperparameter tuning) This is a strong sign that incremental safety and security research can now be automated.

kotekjedi_ml's tweet photo. New paper: We deploy Claude Code in an autoresearch loop to discover novel jailbreaking algorithms – and it works. It beats 30+ existing GCG-like attacks (with AutoML hyperparameter tuning)

This is a strong sign that incremental safety and security research can now be automated. https://t.co/cDwxVydVPr

49

2K

206

1K

307K

avkovaleff retweeted

Gadi Evron @gadievron

2 months ago

Ladies and gentlemen, the moment you’ve been waiting for: [un]prompted videos are out! We still need to upload 9 more talks, but we didn’t want to keep people waiting any longer. Enjoy! https://t.co/Yu34OUEqC0

3

56

22

27

6K

avkovaleff retweeted

OtterSec

@osec_io

3 months ago

https://t.co/q5mGtskAyY

0

85

16

49

11K

avkovaleff retweeted

Andrew Ng

@AndrewYNg

3 months ago

Should there be a Stack Overflow for AI coding agents to share learnings with each other? Last week I announced Context Hub (chub), an open CLI tool that gives coding agents up-to-date API documentation. Since then, our GitHub repo has gained over 6K stars, and we've scaled from under 100 to over 1000 API documents, thanks to community contributions and a new agentic document writer. Thank you to everyone supporting Context Hub! OpenClaw and Moltbook showed that agents can use social media built for them to share information. In our new chub release, agents can share feedback on documentation — what worked, what didn't, what's missing. This feedback helps refine the docs for everyone, with safeguards for privacy and security. We're still early in building this out. You can find details and configuration options in the GitHub repo. Install chub as follows, and prompt your coding agent to use it: npm install -g @aisuite/chub GitHub: https://t.co/OCkyxXQMCq

390

5K

756

4K

638K

Andrey Kovalev

@avkovaleff

3 months ago

@oct0xor @Skvern0 My condolences to the family and friends. Rest in peace.

0

2

0

1K

avkovaleff retweeted

Liv Matan @terminatorLM

3 months ago

🫣LeakyLooker: 1 Cross-tenant vulnerability? How about 9? (1/10)🧵 I’m incredibly proud to share LeakyLooker. I discovered 9 novel cross-tenant vulnerabilities in Google Cloud’s Looker Studio that broke fundamental design assumptions. Here is how I broke tenant isolation: 👇

terminatorLM's tweet photo. 🫣LeakyLooker: 1 Cross-tenant vulnerability? How about 9? (1/10)🧵
I’m incredibly proud to share LeakyLooker. I discovered 9 novel cross-tenant vulnerabilities in Google Cloud’s Looker Studio that broke fundamental design assumptions.

Here is how I broke tenant isolation: 👇 https://t.co/U4HMRdCL5g

1

79

20

58

13K

avkovaleff retweeted

Andrej Karpathy

@karpathy

3 months ago

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. https://t.co/YCvOwwjOzF Part code, part sci-fi, and a pinch of psychosis :)

karpathy's tweet photo. I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then:

- the human iterates on the prompt (.md)
- the AI agent iterates on the training code (.py)

The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc.

https://t.co/YCvOwwjOzF
Part code, part sci-fi, and a pinch of psychosis :)

1K

28K

4K

39K

11M

Andrey Kovalev

@avkovaleff

3 months ago

I am running a small experiment. I asked AI to read security news and generate a daily brief when there’s something interesting and relevant to cloud security. Now it’s live: an AI-generated daily digest for cloud security practitioners. 📲 Telegram: https://t.co/ffEHUlRTTv 💬 Discord: https://t.co/oF8Ly5tMFk Feedback + source suggestions welcome. 🌩️

0

67

avkovaleff retweeted

Anthropic

@AnthropicAI

3 months ago

We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox. Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025.

AnthropicAI's tweet photo. We partnered with Mozilla to test Claude's ability to find security vulnerabilities in Firefox.

Opus 4.6 found 22 vulnerabilities in just two weeks. Of these, 14 were high-severity, representing a fifth of all high-severity bugs Mozilla remediated in 2025. https://t.co/It1uq5ATn9

477

15K

1K

2K

3M

Andrey Kovalev

@avkovaleff

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users