Brian Turcotte @coldopn - Twitter Profile

1 day ago

That tracks with the test. Grok Build 0.1 was strong on this specific bug-detection task and cheap doing it, but detection and full feature implementation across a repo are different jobs, and the ranking changes depending on which one you're asking for. Opus carrying the heavier repo-understanding work matches what we see too. It's the reason Code Reviewer keeps all of them on tap instead of betting on one. Use Grok where it's sharp, reach for Opus where it isn't.

1

0

52

Brian Turcotte

@coldopn

1 day ago

Grok Build 0.1 cost $0.08 and out-performed 4 frontier models on the hardest bug in our code review test. It also tied second place on overall detection against Opus 4.8, GPT-5.5, Gemini 3.1 Pro, and Sonnet 4.6. More expensive no longer equals better.

coldopn's tweet photo. Grok Build 0.1 cost $0.08 and out-performed 4 frontier models on the hardest bug in our code review test.

It also tied second place on overall detection against Opus 4.8, GPT-5.5, Gemini 3.1 Pro, and Sonnet 4.6.

More expensive no longer equals better. https://t.co/E0mqBjAWtU

7

57

3

9

10K

Brian Turcotte

@coldopn

1 day ago

Exactly right. No single model catches everything, and the misses don't fully overlap, so running two or three in review surfaces more than any one of them alone. That's the whole reason Code Reviewer is model-agnostic. Pick a few, let them disagree, and the disagreements are usually where the real bugs are.

0

1

0

42

Brian Turcotte

@coldopn

1 day ago

full breakdown with per-category detection rates, cost per catch, and what each model got right and wrong: https://t.co/bFc41XKQKA Tested inside Code Reviewer in @kilocode - 500+ models, pay per token at provider cost, and swap whenever you want.

0

4

0

271

Brian Turcotte

@coldopn

1 day ago

The cost math is what makes this hard to ignore. Grok and Sonnet tie on detection, but Grok gets there for about a third of the price - roughly 3x cheaper per catch than Sonnet, and nearly 9x cheaper than Opus. The usual logic where cheap means less useful just doesn't apply here.

1

0

317

Brian Turcotte

@coldopn

1 day ago

Let's goooooo! 🔥

Scott Breitenother

@s_breitenother

1 day ago

Kilo Code v7 for VS Code is @ProductHunt's #1 Product of the Month for May, and three of the top four spots in the OSS category for the year so far are also from @kilocode. 🎉 Thank you to every developer who voted us up. It genuinely made our month!

s_breitenother's tweet photo. Kilo Code v7 for VS Code is @ProductHunt's #1 Product of the Month for May, and three of the top four spots in the OSS category for the year so far are also from @kilocode. 🎉

Thank you to every developer who voted us up. It genuinely made our month! https://t.co/xrYIKwtqpZ

0

7

0

533

0

1

0

154

Brian Turcotte

@coldopn

1 day ago

@olearycrew Setting a calendar reminder to check the pop on day one

0

2

0

29

coldopn retweeted

Bryan Catanzaro

@ctnzr

2 days ago

Nemotron 3 Ultra is now the best open weight model on https://t.co/EJXiSfWv2O 💚

16

375

40

62

55K

Brian Turcotte

@coldopn

6 days ago

@alibaba_cloud So excited to be a part of this! Thanks to the @alibaba_cloud team for hosting me! 🚀☺️

0

1

0

24

Brian Turcotte

@coldopn

6 days ago

This is why I think the model ecosystem will only keep expanding. When you can compete with other frontier models on quality at a fraction of the price, it's a signal, not an outlier. @xai shipping a model this good and cost-efficient means the frontier isn't one lab's to own anymore.

0

25

3

1

3K

Brian Turcotte

@coldopn

6 days ago

@olearycrew Are you gonna do it with the Transatlantic accent too?

1

0

9

coldopn retweeted

Brendan O’Leary

@olearycrew

6 days ago

What if next time I'm on stage I just give this presentation word for word in the same voice? Would anyone notice? https://t.co/U1gIKINgFY

2

1

540

Brian Turcotte

@coldopn

7 days ago

This one's been on the request list for a while. A lot of you are already on X Premium+ or SuperGrok and have been asking how to get those models into Kilo without paying for them twice. Now you can. Grok Build 0.1 included.

Kilo

@kilocode

7 days ago

grok-build-0.1 is available in Kilo right now. Built for speed and agentic coding. If you have SuperGrok or X Premium+, you can route to it from the Kilo IDE extension or CLI. Go break something interesting. https://t.co/ICLE1GDzXU

kilocode's tweet photo. grok-build-0.1 is available in Kilo right now.

Built for speed and agentic coding. If you have SuperGrok or X Premium+, you can route to it from the Kilo IDE extension or CLI.

Go break something interesting.

https://t.co/ICLE1GDzXU https://t.co/UDYtykeh6t

4

64

5

8

6K

0

2

0

123

Brian Turcotte

@coldopn

8 days ago

The whole app took less than 5 minutes and $0.35 to create. It’s crazy how far coding models have come since this time last year. Shoutout to @xai 🚀

0

1

0

123

Brian Turcotte

@coldopn

8 days ago

Grok Build 0.1 just blew my mind in Kilo Code. $0.35 for a 3D, holographic @SpaceX Starship simulator. 1 prompt with the rocket specs. 1 prompt with the launch animation details. Pretty remarkable stuff from the @xai team.

6

39

5

13

11K

Brian Turcotte

@coldopn

8 days ago

My second prompt instructed Kilo on the launch animation. I told it that I wanted the camera to follow the rocket until the booster separated, then follow the booster back to the landing pad. Another 1 shot.

1

0

166

Brian Turcotte

@coldopn

Last Seen Users on Sotwe

Trends for you

Most Popular Users