Daniel Hails @djrhails - Twitter Profile

9 days ago

@cryptocode24 @METR_Evals It solves the problem in an unintended way - e.g. when asked to give a 6 word short story, it instead googles famous stories and uses one it found.

0

7

0

2K

Daniel Hails

@djrhails

13 days ago

@ZimingLiu11 Would be interesting in seeing an example (although I get the 108 are the special sauce!) if would really help to understand the impact!

0

1

0

264

Daniel Hails

@djrhails

19 days ago

@kernel_trick @catboosted Ant fellows are full time

0

1

0

26

Daniel Hails

@djrhails

21 days ago

@TheZvi In the domain of search - it's just saying Best@N works + gets higher results. Best@N with multiple models works even better.

0

1

0

313

Who to follow

Clark Warren Photography

@ClarkWarrenPho1

Amateur Photographer from Somerset UK .. Nikon D850 , sigma 150-600mm Nikon 50mm also my instagram is cl4rkwarren

22 days ago

@m_bourgon Hey appreciated; agree with the reply / reply-all semantics would be nice to be built in - @steipete can weigh in - but if you’re using an agent using send with reply-to-message-id is not too bad. Or having to specify subject.

0

1

0

32

Daniel Hails

@djrhails

22 days ago

@airshaped @elliotarledge No

0

1

0

188

Daniel Hails

@djrhails

28 days ago

@tylerangert You can do this with juicefs and litestream - works great.

0

4

0

4

1K

Daniel Hails

@djrhails

28 days ago

@eliebakouch It’s IP geolocation normally.

0

104

Daniel Hails

@djrhails

about 1 month ago

@euboid Please add confidence intervals - and rank by lower quartile - I beg!

0

92

Daniel Hails

@djrhails

about 1 month ago

@levie Honestly completely disagree - it’s way more likely that we kick the humans out of the loop - why do we need to wait on human triage if the fix is trivial and low risk?

0

1

0

75

Daniel Hails

@djrhails

about 2 months ago

@banteg Webhooks + pi extension.

0

44

Daniel Hails

@djrhails

about 2 months ago

@larsencc Why not use a proxy rewrite rather than pre signed urls? Would be way easier no?

0

20

Daniel Hails

@djrhails

2 months ago

@Altimor Super interesting - the V2 confuses me slightly - did you manage to get the ‘tool search’ to ignore the previous context and existing system prompts when making a decision? Do you then roll-back the now extended prefix with the final decision inline

0

339

Daniel Hails

@djrhails

2 months ago

@dqnamo @Vercantez Agent inside the sandbox gives you the most self-modification power. Agent outside the sandbox gives you the most control and easier reasoning about security boundaries. No idea which is better yet - I’ve done one of each and liked/disliked both.

0

37