Dejan Honderd @Dej_h - Twitter Profile

15 days ago

@Tomodovodoo Damn, unfortunate about the depreciation. Maybe because of the knowledge cutoff it learned how to write before the AI-infestation lol

0

13

Dejan Honderd

@Dej_h

17 days ago

@Tomodovodoo How did you test it?

1

0

20

Dejan Honderd

@Dej_h

21 days ago

@ClaudeDevs This is huge for longer agent flows. Being able to keep the expensive context cached while changing the system instructions mid-run is basically made for deep research / coding agents.

0

1

0

72

Dejan Honderd

@Dej_h

21 days ago

@benhylak This is exactly why I love reading models’ reasoning traces while they’re working. Sad to see that most providers are trying to abstract/simplify that away now.

0

2

0

158

Dejan Honderd

@Dej_h

21 days ago

@Nathanone Haven't looked at PC development liked that yet.. But the main insight for me is to keep what you build model agnostic & focus on everything around the model aswell.

1

0

17

Dejan Honderd

@Dej_h

about 1 month ago

Building a retrieval/classification system made me distrust “use the best model”. Best for what? Sometimes embeddings fail, lexical search wins, reranking helps, or the benchmark lies. Find the failing layer before swapping models like Pokémon.

2

0

148

Dejan Honderd

@Dej_h

22 days ago

@AntoineGenesi @_catwu Smart, probably gives better ROI on your tokens.

0

1

0

76

Dejan Honderd

@Dej_h

22 days ago

@alexshander03 Agent-as-judge for long-horizon evals makes sense in theory. Any public benchmarks showing it beats LangSmith offline evals or MLflow trajectory scoring on real traces & datasets?

0

41

Dejan Honderd

@Dej_h

22 days ago

@OpenAIDevs Major development for building inhouse MCP servers at companies safely without all the extra scaffolding for auth

0

1

0

362

Dejan Honderd

@Dej_h

24 days ago

Latency is currently 1–3s total depending on length. Might switch to local transcription models like whisper later to try to bring it even lower. Very lightweight (~15 MB runtime), whole package only 1.38 MB zipped.

0

1

0

72

Dejan Honderd

@Dej_h

24 days ago

Watching @yacineMTB tweet so much using transcription made me realize how much typing is slowing my thinking. So I built my own simple native Windows transcription tool that pastes right where your cursor is focused. https://t.co/aY5khmSaKm

2

1

0

120

Dejan Honderd

@Dej_h

25 days ago

One example I’m working on is a single-stock analysis flow: resolve the listing, run a background deep-research job with an agentic loop, stream tool calls/progress to the user, then return a cited report and continue in chat against that company/report context. LangChain/LangGraph handles enough boring plumbing that I can focus on the research workflow itself.

0

58

Dejan Honderd

@Dej_h

26 days ago

The deeper trap is that sycophancy makes the path of least resistance feel productive. The model adds nuance and a clarifying question so it feels like a real exchange, then ships code & writes some basic tests. You don't notice you stopped thinking, because nothing externally flags it.

Dejan Honderd

@Dej_h

26 days ago

@addyosmani Doing zero/minimal critical thinking and just delegating to agents all day also sucks the joy out of engineering.

0

148

0

1

0

112

Dejan Honderd

@Dej_h

26 days ago

@addyosmani Doing zero/minimal critical thinking and just delegating to agents all day also sucks the joy out of engineering.

0

148

Dejan Honderd

@Dej_h

26 days ago

@JohnGal43951639 Not perse a benchmark, but I really like the sheer size of the Uco3D benchmark and variety of data per entry for 3D reconstruction & segmentation. https://t.co/afOBmqQ28C

0

1

0

25

Dejan Honderd

@Dej_h

27 days ago

@JohnGal43951639 Thank you, haven't seen MTEB before, I recognise most of the top models, but am excited to try/benchmark a few I haven't used before on the domain dataset.

0

1

0

33

Dejan Honderd

@Dej_h

27 days ago

@levelsio @dcbuilder Running that exact routing stack on my self-hosted Linux box. Tailscale let's me ssh & redeploy my server even from my phone using Termux tasks lol. CF is to only expose the ports you want people to access + all the nice anti bot/attacker tools it gives for free.

0

1

110

Dejan Honderd

@Dej_h

29 days ago

@alexfredo87 Looks good, when is the repo coming out?

0

1

0

258

Dejan Honderd

@Dej_h

Last Seen Users on Sotwe

Trends for you

Most Popular Users