TheCoderBTW @TheCoderBtw - Twitter Profile

Pinned Tweet

1 day ago

Here is the latest project I'm working right now Orrery - An autonomous AI coding-loop engine With a live orbital visualizer to watch it run Fully open source with MIT License Link is in the comments

TheCoderBtw's tweet photo. Here is the latest project I'm working right now

Orrery - An autonomous AI coding-loop engine

With a live orbital visualizer to watch it run

Fully open source with MIT License

Link is in the comments https://t.co/PM0mZGoeMx

1

0

49

TheCoderBTW

@TheCoderBtw

10 minutes ago

@theo Gonna try this😅

0

14

TheCoderBTW

@TheCoderBtw

20 minutes ago

@tmuxvim It gets me more than my girl gets me😅

0

5

TheCoderBTW

@TheCoderBtw

23 minutes ago

@haider1 How openai managed to do this and anthropic where are you guys

0

28

TheCoderBTW

@TheCoderBtw

25 minutes ago

@HamelHusain What it can do that claude cant?

0

4

TheCoderBTW

@TheCoderBtw

28 minutes ago

@araseb_ Building in public

0

5

TheCoderBTW

@TheCoderBtw

29 minutes ago

@argofowl That's a wild take

0

24

TheCoderBTW

@TheCoderBtw

30 minutes ago

@anshnanda Do you have any before and after metrics on this?

0

1

0

459

TheCoderBTW

@TheCoderBtw

43 minutes ago

@theo @grok whats your opinion?

1

0

11

TheCoderBTW

@TheCoderBtw

about 2 hours ago

@argofowl Here we go again...

0

17

TheCoderBTW

@TheCoderBtw

about 2 hours ago

@morganlinton What are the benefits of this?

1

0

13

TheCoderBTW

@TheCoderBtw

about 2 hours ago

@0ximjosh Eventually they'll become the new game changing product that you never intended to build

0

1

0

24

TheCoderBTW

@TheCoderBtw

about 2 hours ago

@bruvimtired @claudeai @cursor_ai Try fable with /advisor command. Update us alongside ;)

0

27

TheCoderBTW

@TheCoderBtw

about 2 hours ago

@kr0der @trq212 already mentioned this. They will definitely add fable back to subscriptions when they figure out the compute capacity issues

0

22

TheCoderBTW

@TheCoderBtw

about 2 hours ago

@omarsar0 Absolutely amazing

0

3

TheCoderBTW

@TheCoderBtw

about 3 hours ago

@MattPenny99 @gregisenberg Why is that?

0

15

TheCoderBTW

@TheCoderBtw

about 4 hours ago

@DeepakNesss I'll look into this. Thank you

0

1

0

10

TheCoderBTW

@TheCoderBtw

about 9 hours ago

@PawelHuryn Why skipping high on fable?

1

0

54

TheCoderBTW

@TheCoderBtw

about 9 hours ago

@trq212 This is super useful. Im currently using fable as my orchestrator for almost every tasks. Fable with low effort acts as sub agents. Super nice

0

1K

TheCoderBTW

@TheCoderBtw

about 9 hours ago

@VulcanBench Agreed. Efficiency matters just as much as raw scores. Looking forward to seeing more models in your benchmarks

0

4

TheCoderBTW

@TheCoderBtw

about 10 hours ago

Showing token usage on benchmarks alongside their model performance gonna be really useful

VulcanBench

@VulcanBench

about 12 hours ago

Okay, I'm just going to come out and say it. We have to start sharing token use alongside model performance. I don't think benchmarks are as useful if you see one model is 6% more accurate than another, but don't know if one uses 600% more tokens than the other. A good model should have a balance of accuracy, coupled with strong token use. This is why I share this in all of my benchmarks. Take this one from yesterday. If you just looked at the results you would say, oh GLM 5.2 High ties Fable 5.2 Low and Sonnet 5 High. But the reality is, to tie them both, it had to use 7,628% more tokens and the cost, 596% more. Most benchmarks would just show all these models against each other with one accuracy score. This doesn't tell the whole story. We can do better.

VulcanBench's tweet photo. Okay, I'm just going to come out and say it. We have to start sharing token use alongside model performance.

I don't think benchmarks are as useful if you see one model is 6% more accurate than another, but don't know if one uses 600% more tokens than the other.

A good model should have a balance of accuracy, coupled with strong token use.

This is why I share this in all of my benchmarks. Take this one from yesterday. If you just looked at the results you would say, oh GLM 5.2 High ties Fable 5.2 Low and Sonnet 5 High.

But the reality is, to tie them both, it had to use 7,628% more tokens and the cost, 596% more.

Most benchmarks would just show all these models against each other with one accuracy score. This doesn't tell the whole story.

We can do better.

6

52

9

4

4K

1

4

0

43

TheCoderBTW

@TheCoderBtw

about 10 hours ago

@synthwavedd my Pro sub expired June 25 waiting for exactly this. if it really lands July 7 I missed it by two weeks, classic

0

1

0

155

TheCoderBTW

@TheCoderBtw

Last Seen Users on Sotwe

Trends for you

Most Popular Users