mufeed vh @mufeedvh - Twitter Profile

6 days ago

one of them has been disclosed with a CVE. more to come. besides, what’s google cooking? mythos? their own gemini variant? interesting times. https://t.co/zBjQDGig2O

mufeed vh

@mufeedvh

19 days ago

We're doing an experiment with open models @winfunction to see how far we can push them to find vulns in hardened targets. So far: - $4.5K in bounties from Chrome VRP with a few more pending, with the scans costing less than $100. - 2 CVEs in NGINX (CVE-2026-28755 & CVE-2026-42926). And watch out for the next release! - And 60ca500faea0fc70816bb9c53af3815e2af3e6c962b4b4ea63c33c62ebb4240d 👀 We're writing a blog on this soon.

mufeedvh's tweet photo. We're doing an experiment with open models @winfunction to see how far we can push them to find vulns in hardened targets. So far:

- $4.5K in bounties from Chrome VRP with a few more pending, with the scans costing less than $100.

- 2 CVEs in NGINX (CVE-2026-28755 & CVE-2026-42926). And watch out for the next release!

- And 60ca500faea0fc70816bb9c53af3815e2af3e6c962b4b4ea63c33c62ebb4240d 👀

We're writing a blog on this soon.

5

101

13

46

12K

0

19

3

2

3K

mufeed vh

@mufeedvh

13 days ago

@jmedeiros1337 @winfunction yes! we're currently running an experiment where we use "tiny" open models to find 0days in hardened software. will publish the blog soon! :) https://t.co/tCDmgF2tdk

mufeed vh

@mufeedvh

19 days ago

We're doing an experiment with open models @winfunction to see how far we can push them to find vulns in hardened targets. So far: - $4.5K in bounties from Chrome VRP with a few more pending, with the scans costing less than $100. - 2 CVEs in NGINX (CVE-2026-28755 & CVE-2026-42926). And watch out for the next release! - And 60ca500faea0fc70816bb9c53af3815e2af3e6c962b4b4ea63c33c62ebb4240d 👀 We're writing a blog on this soon.

5

101

13

46

12K

1

3

0

370

mufeed vh

@mufeedvh

13 days ago

We discovered the same vulnerability too. :) And @winfunction discovered 4 more remote RCE primitives in NGINX soon to be publicly disclosed. Anywho, we're hiring security researchers with a knack on taming LLMs. If you're interested in novel vulnerability research and autonomous exploitation with language models, DM me and I'll send you a fun CTF to solve. :)

mufeedvh's tweet photo. We discovered the same vulnerability too. :)

And @winfunction discovered 4 more remote RCE primitives in NGINX soon to be publicly disclosed.

Anywho, we're hiring security researchers with a knack on taming LLMs.

If you're interested in novel vulnerability research and autonomous exploitation with language models, DM me and I'll send you a fun CTF to solve. :)

Nebula Security

@nebusecurity

15 days ago

Introducing nginx-poolslip, a fresh RCE for the the latest nginx release 1.31.0. nginx-rift has been patched, but our security agent Vega has found a new 0 day. We will release the full technical writeup with ASLR bypass 30 days after the patch on https://t.co/LAhOC5UHrp.

28

1K

258

797

477K

7

106

18

43

21K

mufeed vh

@mufeedvh

13 days ago

@winfunction just gonna put this here. and watch out for the next release. https://t.co/UX4f4sAQRQ

1

0

1

550

Who to follow

MS8

@MohammedShine8

Chapter Lead @AutoSecResGroup-KERALA Information Security Learner/Bugbounty Hunter. Tweets are mine and doesn't represent my employer

Vijith Vellora

@vijithvellora

Security Analyst | Former Assistant commander @Cyberdomekerala 🇮🇳 | Hacker by Profession & Passion. Acknowledged by Google, Sony, Nokia, AT&T, IBM, Etc.

Thejus!

@Tjz_kr1

security analyst | a rational mind

mufeed vh

@mufeedvh

19 days ago

@0xAsm0d3us ofc, would love to! will dm you.

0

1

0

89

mufeed vh

@mufeedvh

19 days ago

a long-overdue life update: i moved to bangalore!

2

30

0

1

1K

mufeed vh

@mufeedvh

19 days ago

@k7agar yes sir, scouting both offices and talent! and def, dming you.

0

1

0

95

mufeed vh

@mufeedvh

19 days ago

@0xAsm0d3us @winfunction thanks! 🫡❤️

0

1

0

423

mufeed vh

@mufeedvh

19 days ago

We're doing an experiment with open models @winfunction to see how far we can push them to find vulns in hardened targets. So far: - $4.5K in bounties from Chrome VRP with a few more pending, with the scans costing less than $100. - 2 CVEs in NGINX (CVE-2026-28755 & CVE-2026-42926). And watch out for the next release! - And 60ca500faea0fc70816bb9c53af3815e2af3e6c962b4b4ea63c33c62ebb4240d 👀 We're writing a blog on this soon.

5

101

13

46

12K

mufeedvh retweeted

mufeed vh

@mufeedvh

19 days ago

We're doing an experiment with open models @winfunction to see how far we can push them to find vulns in hardened targets. So far: - $4.5K in bounties from Chrome VRP with a few more pending, with the scans costing less than $100. - 2 CVEs in NGINX (CVE-2026-28755 & CVE-2026-42926). And watch out for the next release! - And 60ca500faea0fc70816bb9c53af3815e2af3e6c962b4b4ea63c33c62ebb4240d 👀 We're writing a blog on this soon.

5

101

13

46

12K

mufeed vh

@mufeedvh

21 days ago

oh that's me! the first on the list is our 4th finding in NGINX. maybe we should do more of this @winfunction.

wavefnx

@wavefnx

21 days ago

The software that runs in the veins of modern society is fragile, every proper Engineer knows that, C just makes it worse. This affects 0.6.27 ... 1.30.0, so pretty much everything until yesterday. I know some of you are still using affected versions so update to 1.31.0.

wavefnx's tweet photo. The software that runs in the veins of modern society is fragile, every proper Engineer knows that, C just makes it worse.

This affects 0.6.27 ... 1.30.0, so pretty much everything until yesterday.

I know some of you are still using affected versions so update to 1.31.0. https://t.co/432ewlIjjX

2

27

1

6

17K

1

10

1

0

738

mufeed vh

@mufeedvh

about 1 month ago

@Tocelot 🙂‍↕️❤️

0

45

mufeed vh

@mufeedvh

about 1 month ago

Love the Claudia reference as the first thing here. We loved working on Claudia but couldn't balance working on security research projects and Claudia at once. Fun fact, we invented "SKILLS" before it was even a thing. There was a feature in Claudia called "AGENTS" where users could share and install system prompts for specific tasks via their GitHub repos, just like the skill marketplaces concept in Claude now. See here: https://t.co/86a0PTryFZ And Anthropic did talk to us after the launch of Claudia but unfortunately I can't reveal more about it but damn was it some tough decision.

mufeedvh's tweet photo. Love the Claudia reference as the first thing here.

We loved working on Claudia but couldn't balance working on security research projects and Claudia at once.

Fun fact, we invented "SKILLS" before it was even a thing. There was a feature in Claudia called "AGENTS" where users could share and install system prompts for specific tasks via their GitHub repos, just like the skill marketplaces concept in Claude now.

See here: https://t.co/86a0PTryFZ

And Anthropic did talk to us after the launch of Claudia but unfortunately I can't reveal more about it but damn was it some tough decision.

Jon Lai

@Tocelot

about 1 month ago

a16z @speedrun request for startups: GUIs for Agents we’re still in the MS-DOS era of agents today - CLI, terminal sessions, file directories deleted by openclaw etc. while a small slice of silicon valley are power users, we're SO early for the rest of the world at Speedrun, we’re looking for bold founders excited to bring the power of agents to normies everywhere. there's a whole slew of products to be built here - from agent builders to marketplaces to managed infrastructure one broad idea we’re excited about are visual abstraction layers for agents. if you don't know exactly what you want, a command line / chat interface is paralyzing - you need to see options 1 example - think of a GUI or visual command center inspired by strategy games (ex. Factorio) where agents and workflows are represented graphically. skills, tools, MCP connections, background processes, etc could all be configured and shown visually in a workspace on UX, strategy games have long perfected agent management. zoom to get a birds-eye view of your agents, batch and queue orders via shortcuts, assign agents in multiplayer etc. a well-designed agent command center would make multi-agent orchestration for normies feel easy & intuitive most folks today still haven't moved beyond ChatGPT. the potential is enormous - just as Windows unlocked mass-market use of personal computers, the right visual abstraction layer could unlock agentic work for everyone - from individuals to enterprise teams if you share our vision, we'd love to chat!

279

1K

92

1K

198K

2

15

3

1

2K

mufeed vh

@mufeedvh

about 1 month ago

@Tocelot @speedrun author of claudia here, love the reference! https://t.co/Hhnk1mgp8X

mufeed vh

@mufeedvh

about 1 month ago

Love the Claudia reference as the first thing here. We loved working on Claudia but couldn't balance working on security research projects and Claudia at once. Fun fact, we invented "SKILLS" before it was even a thing. There was a feature in Claudia called "AGENTS" where users could share and install system prompts for specific tasks via their GitHub repos, just like the skill marketplaces concept in Claude now. See here: https://t.co/86a0PTryFZ And Anthropic did talk to us after the launch of Claudia but unfortunately I can't reveal more about it but damn was it some tough decision.

2

15

3

1

2K

0

5

1

0

292

mufeed vh

@mufeedvh

about 2 months ago

@Felix_Josemon @gregisenberg very soon :)

0

1

0

22

mufeedvh retweeted

mufeed vh

@mufeedvh

about 2 months ago

During our YC (@ycombinator S24) batch, we had the awesome opportunity to meet @paulg and talk about what we're building: An autonomous AI hacker. To showcase a fun demo, I remember opening my laptop in the Uber to his home and challenging our agents to find vulnerabilities in the old HackerNews codebase written in Arc. For those unfamiliar, Arc is a programming language designed by PG and Robert Morris. And the old HN codebase is written in Arc. We only got to talk about it with him but we just redid the experiment with our improved harness for fun! And we wrote a blog about it: https://t.co/IxVhtqDjSg

1

17

2

1

1K

mufeed vh

@mufeedvh

about 2 months ago

During our YC (@ycombinator S24) batch, we had the awesome opportunity to meet @paulg and talk about what we're building: An autonomous AI hacker. To showcase a fun demo, I remember opening my laptop in the Uber to his home and challenging our agents to find vulnerabilities in the old HackerNews codebase written in Arc. For those unfamiliar, Arc is a programming language designed by PG and Robert Morris. And the old HN codebase is written in Arc. We only got to talk about it with him but we just redid the experiment with our improved harness for fun! And we wrote a blog about it: https://t.co/IxVhtqDjSg

1

17

2

1

1K

mufeedvh retweeted

Dwayne

@CtrlAltDwayne

about 2 months ago

Everyone is talking about Mythos, but GPT-5.4 is actually shaping up to be a more capable model than people realize. This N-Day bench has GPT-5.4 at the top, followed by GLM-5.1 and interestingly beating Opus 4.6 so far. Crazy to think Spud is a bigger leap than this.

CtrlAltDwayne's tweet photo. Everyone is talking about Mythos, but GPT-5.4 is actually shaping up to be a more capable model than people realize. This N-Day bench has GPT-5.4 at the top, followed by GLM-5.1 and interestingly beating Opus 4.6 so far. Crazy to think Spud is a bigger leap than this. https://t.co/6fKcv0VVzg

6

65

2

5

4K

mufeedvh retweeted

winfunc

@winfunction

about 2 months ago

Vulnerability benchmarks rot. Cases leak into training data, scores measure memorization. We built N-Day-Bench: tests LLMs on finding real vulnerabilities in real repos, refreshed monthly from live GitHub advisories. Blinded judging. All traces public. Very interestingly, the latest model from @Zai_org, GLM 5.1 performs really well! Link: https://t.co/K3foq0DfMt

winfunction's tweet photo. Vulnerability benchmarks rot. Cases leak into training data, scores measure memorization.

We built N-Day-Bench: tests LLMs on finding real vulnerabilities in real repos, refreshed monthly from live GitHub advisories. Blinded judging. All traces public.

Very interestingly, the latest model from @Zai_org, GLM 5.1 performs really well!

Link: https://t.co/K3foq0DfMt

2

7

3

839

mufeed vh

@mufeedvh

about 2 months ago

seconding this. have seen "fix" commits for vulns with zero impact but in claude's words, a critical in all caps. with the same model being the triager now, it's out there with a CVE tag. some findings are at best "hardening", it ain't bad per se, best practice or whatever. a solid exploit poc should be conveying impact, these cvss scores don't cut it.

1

0

59

mufeed vh

@mufeedvh

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users