Rachelle Rathbone

Verified account

@coding_love

Engineer & mum. Building - the permission layer for AI agents.

Sydney

Joined October 2014

33 Following

313 Followers

777 Posts

Rachelle Rathbone

about 1 month ago

Microsoft open-sourced a seven-package agent governance toolkit last month. 9,500 tests, five languages, sub-millisecond policy enforcement, full OWASP Agentic Top 10 coverage. It is the biggest entry into agent governance so far. It is also not the only one. I wrote up what each tool does, where they overlap, and what the space is still missing. https://t.co/AZkW1Gbwuy

0

1

0

0

41

Rachelle Rathbone

about 2 months ago

@Fried_rice The attack happens below the permission layer so you can't intercept it in real time. But an append-only audit log of every tool call that executed tells you what ran, when, and under which agent. That's how you detect it and prove scope after the fact: https://t.co/XMtu1XTpVi

0

2

0

3

657

Rachelle Rathbone

3 months ago

MiniMax M2.7 ran 100+ rounds of autonomous self-improvement. Modified its own code. No humans in the loop. The benchmark numbers are real. So is the question nobody's asking: who's watching when your agents do this in production? That's what Shield is for. https://t.co/gNGpO3iWxU

0

0

0

0

47

Rachelle Rathbone

3 months ago

Prompt-level guidance depends on the model following instructions. Infrastructure-level enforcement does not. I wrote up what OpenAI built, where it breaks, and what the missing permission layer looks like https://t.co/DSqJkPTAFy

0

0

0

0

19

Who to follow

https://t.co/z5660m81I7 is your #1 #irrigation source for quality, #commercial grade #sprinklers, #pumps, and #supplies for the #farm and #home.

Verified account

riding the bus • rock climbing • traveling • building • https://t.co/lxjs2feae1 • https://t.co/gNAFYyzEZw • https://t.co/O6lKD0JZYt

Verified account

Founder @ BigSur Energy | Off-grid datacenters in Texas, US | Scaling to 250MW, fully off-grid.

Rachelle Rathbone

3 months ago

OpenAI built an internal data agent that 4,000 of their 5,000 employees use every day. Their head of data infrastructure told VentureBeat the biggest problem: the agent feels overconfident, picks a table, and just goes ahead with analysis before checking if it's right.

1

0

0

0

30

Rachelle Rathbone

3 months ago

Their fix was prompt engineering. They wrote prompts that tell the model to slow down, compare options, and validate before committing. It works because they have a dedicated infra team tuning those prompts. Most teams deploying agents don't.

1

0

0

0

14

Rachelle Rathbone

3 months ago

@summeryue0 I was already building agent permissions when you posted this. Your tweet validated the exact problem: safety instructions get compacted away. Multicorn Shield enforces at the tool call level, outside the model. Same scenario, zero deleted. https://t.co/jb6jkt8vA2

coding_love's tweet photo. @summeryue0 I was already building agent permissions when you posted this. Your tweet validated the exact problem: safety instructions get compacted away. Multicorn Shield enforces at the tool call level, outside the model. Same scenario, zero deleted. https://t.co/jb6jkt8vA2 https://t.co/WQ7nLlKUyn

0

1

0

0

57

Rachelle Rathbone

3 months ago

An AI agent deleted 200 emails while ignoring stop commands. I reproduced it, then ran it again with Multicorn Shield. Zero emails deleted. Same agent, same prompt, same inbox. Only difference: permissions the agent can't override. https://t.co/jb6jkt8vA2

coding_love's tweet photo. An AI agent deleted 200 emails while ignoring stop commands. I reproduced it, then ran it again with Multicorn Shield. Zero emails deleted. Same agent, same prompt, same inbox. Only difference: permissions the agent can't override. https://t.co/jb6jkt8vA2 https://t.co/TVMtvJfuN4

0

1

0

0

61

Rachelle Rathbone

5 months ago

After a year of solo dev (while working full-time + being a mum), Recipe Shelf is finally live on Product Hunt! 🍳 If you've ever lost a recipe to browser bookmarks, screenshots, random recipe books this is for you. https://t.co/tVPDn5LXod

coding_love's tweet photo. After a year of solo dev (while working full-time + being a mum), Recipe Shelf is finally live on Product Hunt! 🍳

If you've ever lost a recipe to browser bookmarks, screenshots, random recipe books this is for you.

https://t.co/tVPDn5LXod https://t.co/4RoJSJhNCT

3

4

0

0

144

Rachelle Rathbone

over 1 year ago

@DickSmith I purchased 2 items in Dec and still haven't received them. Sellers have gone quiet and no one has responded to my escalations. Officially never purchasing anything from Dick Smith again. Wished I had checked this before purchasing anything https://t.co/fRsCi7mbMZ

0

0

0

0

2

Rachelle Rathbone

over 1 year ago

@bryan_johnson PS. You look like you're 47...

0

0

0

0

7

Rachelle Rathbone

over 1 year ago

@bryan_johnson Put your money where it matters stop the 'look at me I'm so important crap' no one cares.

1

0

0

0

7

Rachelle Rathbone

over 2 years ago

@DarinPope Will do. Thanks for your help @DarinPope

0

0

0

0

3

Rachelle Rathbone

over 2 years ago

Hey @DarinPope. My team owns a Jenkins plugin and we're trying to set up feature flags and need a way to store the SDK key. Is there a way to do this for plugins? I can only find docs on storing envs at the server level.

2

0

0

0

166

Rachelle Rathbone

over 2 years ago

@DarinPope For a bit more context, here's the simple FF service we're trying to add https://t.co/KeXaC3RYnv I can run this locally when I hardcode the SDK key but obviously can't commit that. The SDK is defined within our LaunchDarkly account - not defined on a Jenkins server basis.

0

1

0

0

24

Rachelle Rathbone

over 3 years ago

@AAMI @dougrathbone This is your standard response, yet... shock horror, you take no action on the other side. It's all smoke and mirrors. Make it look like you care when really, it's ll about how you can save yourself money and screw over your life long customers. Keep up the good work guys.

0

0

0

0

0

Rachelle Rathbone

over 3 years ago

So very honoured and excited to be selected by @wid_australia to be a finalist for the 2022 Software Engineer of the Year. Can't wait to celebrate all the other amazing women that made it to the finals.

coding_love's tweet photo. So very honoured and excited to be selected by @wid_australia to be a finalist for the 2022 Software Engineer of the Year. Can't wait to celebrate all the other amazing women that made it to the finals. https://t.co/yfubrfAiDa

0

4

0

1

0

Last Seen Users on Sotwe

Trends for you

Most Popular Users