shaurya @0oAstro - Twitter Profile

@scaling01 better than gpt-5.5 on swe bench pro, ehh? also svg-bench being better than opus, that too being much smaller than opus. big if true and the capabilities carry on and not just be another "bench-maxxed" model. BrowserComp looks promising for computer use agents

0

838

shaurya

@0oAstro

4 days ago

@theo openclaw aint arch, nanobot maybe.

0

1

0

171

shaurya

@0oAstro

5 days ago

@tenobrus Got the jackpot ehh?

0

1

0

82

shaurya

@0oAstro

5 days ago

The truth is... I am Iron Man.

Chris

@ChrissGPT

5 days ago

Hell yeahhhh

25

76

1

13

15K

0

1

0

52

shaurya

@0oAstro

6 days ago

@julien_c ill do one better % of difference between Opus 4.6 and Opus 4.8 :)

0

1

0

4K

shaurya

@0oAstro

7 days ago

@paulg @agupta Just have both models talk to each other solving things around

0

40

shaurya

@0oAstro

7 days ago

@benhylak say hello to self-hosted bifrost, your own private llm gateway

0

4

0

1

286

shaurya

@0oAstro

7 days ago

@LLMJunky thats more than 32k$ compared to the api pricing just for ref. (might be much more than that)

0

1

0

196

shaurya

@0oAstro

7 days ago

@yacineMTB i did a similar thing but albeit with obsidian markdown files as source of truth for website content, yours is definitely much cooler. https://t.co/qdc60gpbhW

0

513

shaurya

@0oAstro

7 days ago

all this for 100$, absolutely insane.

0

1

0

41

shaurya

@0oAstro

8 days ago

@thsottiaux Speaking on everyone's behalf, this still makes up a case for limit reset

0

334

shaurya

@0oAstro

8 days ago

@fardeentwt You forgot the furry suit part

0

1

0

174

shaurya

@0oAstro

8 days ago

@photon_hq what will a GTM do with those coding subs

1

2

0

468

shaurya

@0oAstro

9 days ago

@mil000 tbf if you are not using all the bells and whistles of salesforce + have not done a slop work on the codebase, it is pretty easy task to do

0

116

shaurya

@0oAstro

9 days ago

@scaling01 Claude code aint even that great of a harness to begin with + these models especially deepseek really shine when you turn into a subagents hoard trying to solve a problem simply because they are so cheap to run

0

3

0

132

shaurya

@0oAstro

9 days ago

@scaling01 Mythos 90% seems a stress tho (an ignorant being who is yet to try out mythos and is basing his views based on what his friends using it said)

0

2

0

723