James Brady @james_elicit - Twitter Profile

Two years ago we published our stance on AI in engineering interviews. Core question: does the candidate understand the code? That question still holds—but it's no longer where we gather our most interesting signal.

1

0

152

Who to follow

Reka

@RekaAILabs

An AI research and product company 🫠. We are a team of scientists and engineers building state-of-the-art multimodal models 😻

Ethan Perez

@EthanJPerez

Alignment team lead at Anthropic

Abhi Venigalla

@ml_hardware

Researcher @Databricks. Former @MosaicML, @CerebrasSystems. Addicted to all things compute.

James Brady

@james_elicit

2 months ago

Worst pattern: yolo --dangerously-skip-permissions, accept every diff without reviewing. The code kind-of-works but tells us nothing we couldn't learn from the model itself. Surprise finding: plan-first thinking is way rarer than expected. Most candidates jump straight to "make the model write code."

1

0

102

James Brady

@james_elicit

3 months ago

Same pattern as alarm fatigue in hospitals—when monitors beep constantly, nurses learn to ignore them. The safety system becomes the danger. Auto mode could be the right trade-off: let the model assess risk rather than prompting on everything. Cautiously hopeful!

0

91

James Brady

@james_elicit

3 months ago

Anthropic just announced auto mode for Claude Code permissions—I am unreasonably excited about this! For the last few weeks, Claude Code's permission system has been quietly training us to be **less** safe.

1

0

205

James Brady

@james_elicit

3 months ago

What the team actually did: • Attempt to preempt the permission model—custom allow/deny lists, hooks to block destructive commands, mining past transcripts for things we'd been approving on autopilot • Switched to pi ($1K in API tokens, still better than the prompt loop) • Moved work to remote agents (Devin, Niteshift) partly just to avoid the babysitting • Probably ran with --dangerously-skip-permissions without telling me, tbh The safety mechanism was producing the opposite of safety.

1

0

115

James Brady

@james_elicit

3 months ago

@ntkris Getting increasingly hard to find (although obvs not impossible) to find things which don't fit the latter two categories – at least in white-collar settinga

0

15

James Brady

@james_elicit

3 months ago

We're automating @elicitorg. When everyone goes on holiday in December, the company keeps running. Bugs triaged. Features shipped. Metrics dutifully updated by robots who don't know it's Christmas. Step one: figure out what to automate. I set up a Notion database, asked people to decompose their work into triggers, steps, outputs. Turns out this was the wrong question…

1

13

1

6

2K

james_elicit retweeted

Elicit

@elicitorg

3 months ago

The Elicit API is now available in preview for Pro and Teams users. You can search 138M+ papers and generate Research Reports from your code, scripts, or AI tools. Get your API key at https://t.co/Q3r5PggChI and check out https://t.co/yMxHqkK0Ob

elicitorg's tweet photo. The Elicit API is now available in preview for Pro and Teams users. You can search 138M+ papers and generate Research Reports from your code, scripts, or AI tools.

Get your API key at https://t.co/Q3r5PggChI and check out https://t.co/yMxHqkK0Ob https://t.co/rwtBWExpAM

4

38

7

22

8K

James Brady

@james_elicit

3 months ago

@VivaLaPanda @voooooogel *First* token was already bork for me!

0

1

0

20

James Brady

@james_elicit

3 months ago

@jerhadf I sent message IDs etc. to Brittney T

0

3

0

931

James Brady

@james_elicit

3 months ago

Is anybody else getting absolutely bonkers hallucinations from Claude!? I just tried to check a couple of things off my todo list 😅

james_elicit's tweet photo. Is anybody else getting absolutely bonkers hallucinations from Claude!?

I just tried to check a couple of things off my todo list 😅 https://t.co/BoYCFuyJkt

5

45

2

18

20K

James Brady

@james_elicit

3 months ago

@arm1st1ce Full transcript https://t.co/QWsRL40Cj9

3

12

0

4

1K

James Brady

@james_elicit

3 months ago

@ntkris We get a lot of value from spending time with customers to see what's *actually* slowing them down – pretty hard to shadow yourself though! … or is it easy? I don't know at this poitn

1

0

15

James Brady

@james_elicit

3 months ago

We're hiring for other people who don't want to do boring stuff! DM me or check out https://t.co/Q9I5XDyqu2

1

2

0

2

131

James Brady

@james_elicit

3 months ago

We're running a retreat next week to automate as many of these rote tasks as we can. Demos on Friday. More on what we learn soon. Starting lesson: don't ask people to map their jobs. Ask what they'd love to never do again. What's the most tedious part of your job?

1

0

1

131

James Brady

@james_elicit

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users