byron

@String_The0rist

Human being. Almost unique, though remarkably not.

Johannesburg, South Africa

Joined October 2016

459 Following

33 Followers

1.2K Posts

byron @String_The0rist

24 days ago

@Undeference @kai_fell @Jonathan_Blow Interesting. This study itself basically proves that it's own results are not generally useful unless you're using ChatGPT4o and I assume with the weightings tested at that time. Perhaps a tool that performs instantaneous benchmarks across all models would be quite useful.

0

0

0

0

17

byron @String_The0rist

25 days ago

@kai_fell @Jonathan_Blow It kinda actually is. Maybe not the "imagine" part. But priming the model's disposition and personality traits in a prompt can result in measurably improved outputs. https://t.co/FoKTVAUnEo

1

0

0

0

220

byron @String_The0rist

25 days ago

@Jonathan_Blow I would use codex (chatgpt's coding model), it still works as a language model, but is stricter and more literal in following instructions thoroughly

0

0

0

0

546

byron @String_The0rist

29 days ago

@bettercallsalva Interesting, Is that because of the larger context window? .. I've found that very detailed prompting (priming its mindset) is needed to prevent drift and breaking work down into chunks that won't exceed multiple context windows is essential.

1

0

0

0

7

Who to follow

Alexandre Almeida

@The_MRC Career Development Award fellow | @Cambridge_Uni @CamVetSchool | Bioinformatics, Genomics, Microbiome | https://t.co/69MnQYyAgL

Science Ph.D. Lived in MN/WA/NJ/NY/CA/IL/NM/VA/AR. I was able to retire at 56 since the S & P 500 was very good to me. Life goal: set foot on every continent.

@namanatulshukla

Founder @casuroai

byron @String_The0rist

29 days ago

Been working with Claude Opus and Sonnet 4.6 and 4.7 for about 4 months, and I've consistently been finding the value of a cross model deliberation workflow. Codex is also consistently better at converging on evidence based solutions than Claude.

String_The0rist's tweet photo. Been working with Claude Opus and Sonnet 4.6 and 4.7 for about 4 months, and I've consistently been finding the value of a cross model deliberation workflow. Codex is also consistently better at converging on evidence based solutions than Claude. https://t.co/j0TQLzTwop

1

0

0

0

40

byron @String_The0rist

29 days ago

@stelloprint @kunchenguid @theo Ah, ok. Wezterm behaves as the frontend to my custom agent-agnostic wrapper, so it would be hard to replace it without lua

0

1

0

0

22

byron @String_The0rist

30 days ago

@kunchenguid @theo yeah, i've been trying to avoid building tools and workflows that depend on third party plugins and solutions. does it work well for you though?

0

0

0

0

15

byron @String_The0rist

30 days ago

@lliu54827 @heygurisingh totally! by the way, do you know of any good ways to run slash commands in claude programmatically, like from a hook for example?

1

0

0

0

10

byron @String_The0rist

30 days ago

@kunchenguid @theo do you have scripts attached to opencode hooks? I didn't see any native hook support. I'm playing with openclaude, but so far, codex and claude cli offer native support. (you can override anthropic base url and point claude cli to deepseek or other compatible models

1

0

0

0

57

byron @String_The0rist

30 days ago

@stelloprint @kunchenguid @theo lua script supported on cmux?

1

0

0

0

20

byron @String_The0rist

30 days ago

@AishwaryaDevv Build your own. There is way more power in agility with the wild fluctuations right now, don't lock yourself in to any ecosystem yet. We're in a bubble right now. What works for me now: AI-agnostic CLI wrapper with custom built Wezterm interface. Use hooks for enforcement

0

1

0

2

544

byron @String_The0rist

30 days ago

@examaddaorg @X F) Takes way too long to produce mediocre at best, more often, buggy code. Codex has both speed and precision. Claude's output quality does not justify the time spent.

1

1

0

0

20

byron @String_The0rist

30 days ago

@examaddaorg @X E) Inconsistent performance. One day it works well, the next day it's lobotomised.

1

1

0

0

11

byron @String_The0rist

about 1 month ago

@jonasfroeller shhhhhh

0

1

0

0

40

byron @String_The0rist

about 1 month ago

@thsottiaux its back for me, thanks

0

0

0

0

424

byron @String_The0rist

about 1 month ago

@lliu54827 @heygurisingh true, hooks are the way. hard-block actions, use well written error messages as guidance to steer claude. hard-block superpowers skills and override with local versions to chain them in workflows. a single correct skill use like brainstorming ignites the workflow engine

1

0

0

0

46

byron @String_The0rist

about 1 month ago

@fcoury Not yet for me

0

0

0

0

63

byron @String_The0rist

about 1 month ago

@Jayyanginspires use openai's plugin for claude code. Ask claude to get codex to review its work, or run its questions past codex.

0

0

0

0

177

byron @String_The0rist

about 1 month ago

@LuxVeritasAeter @keithwhor @beezlebuddy lol

0

0

0

0

19

byron @String_The0rist

about 1 month ago

@keithwhor Claude Opus 4.7 has been really idiotic lately. Been trying to debug an issue with it for hours, and it just leads itself down a random goose chase following its own red herrings. I ask it to run it past codex and in one sweep codex obliterates Claude's entire belief system.

0

2

0

0

882

Last Seen Users on Sotwe

Trends for you

Most Popular Users