Niko E. @nefthy - Twitter Profile

1 day ago

@benhylak There is more than one way to have an agent use a browser. That is not going to be a differentiator. Codex is still great.

0

141

Niko E. @nefthy

1 day ago

@GaryMarcus @Kasparov63 Prices have been consistently going up, not down in subsequent releases.

0

2

Niko E. @nefthy

2 days ago

@mattpocockuk I just do "please resolve this merge conflict" and it just works most of the time with GPT 5.5. I'd be are curious to know what else you add.

0

24

Niko E. @nefthy

2 days ago

@ryancarson I use the older flash lite versions for structured data extraction tasks, but for agent work I find the tool calls reliability is unacceptable.

0

6

Who to follow

anthony v martinez

@vntTony562

est.96 forever. PACKERS -Dodgers-Lakers, UFC #GoPackGo #Local1607

Alex

@OSAT8867

https://t.co/Y3d92wUQFj

2 days ago

@Yuchenj_UW There are no signs of the Gemini models getting usable for agents on the horizon.

0

6

Niko E. @nefthy

3 days ago

@thsottiaux No

0

1

0

4

Niko E. @nefthy

3 days ago

@kr0der Try Kimi 2.6 for copy, Opus is not even close.

0

8

Niko E. @nefthy

4 days ago

@gilbert_jc ARR is driven by corporate decisions. They don't necessarily reflect model quality. Jira has a very high ARR and is a disaster of a product. It just appeals to CTOs

0

2

0

48

Niko E. @nefthy

4 days ago

@enjojoyy I rarely get it to do anything longer than 15m. But tbh. I'm pretty much in the loop so I don't mind not having to go through 10k lines of code.

0

10

Niko E. @nefthy

5 days ago

@catalinmpit Isn't fail2ban a bit superfluous if you have passwords disabled?

0

11

Niko E. @nefthy

5 days ago

@thegenioo And you had no version control to undo the slop?

1

0

139

Niko E. @nefthy

5 days ago

@shikhr_ 4.10 is a choice

0

2

Niko E. @nefthy

5 days ago

@thsottiaux Just keep it easy on the pricing. The last hikes where 🌶️

0

76

Niko E. @nefthy

6 days ago

@thsottiaux Benchmarks are not trustworthy. Gemini models do good in benchmarks, but they are not unusable in coding agents. I try stuff myself.

0

10

Niko E. @nefthy

6 days ago

Bye bye and good luck🤞

0

7

0

324

Niko E. @nefthy

7 days ago

@JuiceSharp @mattpocockuk Agree very much about tests and code in the same session is bad. I find that forcing the agent to do proper TDD helps a lot to get better tests. It tends to assume that the original test was correct and the code needs fixing was more than of it does all the tests at the end.

0

16

Niko E. @nefthy

9 days ago

@masonddudley @theo That presupposes a design system, but once you got it, it's a delight.

0

4

Niko E. @nefthy

9 days ago

@masonddudley @theo I created an AI slop skill which is more out less a documentation for the frontend components of the design system we use and it started producing great results. Create a quick excalidraw sketch of what you want and get a ready usable competent.

1

0

58

Niko E. @nefthy

10 days ago

@KaiXCreator I have preferences at points in time, and I pick the best model I can get my hands on. I'd recommend leaving the fan things for team sports.

0

16

Niko E. @nefthy

10 days ago

@rxhit05 My theory is that it's like football, you stay with your team no matter what. Also for corporate environments deepseek feels riskier than anthropic because there is no big company backing it and China. Not very rational considering the amount of rugs pulled by anthropic.

0

146

Niko E.

@nefthy

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users