“Why did we go with this auth approach again?”
You dig through old chats trying to remember the reasoning.
Even with .cursorrules and manual notes, things slip.
The “why” lives in too many places.
How are you capturing decision history these days?
Been seeeing alot of AI products adopt interactive no-code graph interfaces for building workflows and agents.
Is this the future of AI products with conversational interfaces also adopting them in the future ?
@thsottiaux Benchmarks are useful as a reference point, they’re my first impression and really just an assumption of how a model should perform. My preference is always to implement it in my own workflows and read real user experiences, that’s what actually tells you if it’s valuable.
@theo They just dropping the un-nerfed version of Opus 4.7 (which was already the un-nerfed version of 4.6) anyway. You’re not really missing out on much, tbh…
My Claude Code sub expires tomorrow. I barely use it, but I still had it installed on my Windows PC so I used it to debug some crashing earlier.
They hard cut me off over 24 hours early.
@cjzafir DeepSeek V4 over GLM 5.1? (I see Kimi as more of a DeepSeek-style alternative). Also, how do you handle long sessions in codex? I’ve noticed context degradation after several hours, how often do you start fresh sessions after a certain duration or amount of work?
I fine-tuned a 6B model under $250 with Codex 5.5 and Deepseek v4 pro.
The model beaten GPT-OSS 120B, Qwen 3-32B on all benchmarks.
This would've costed be $5,000+ if Deepseek v4 wasn't here.
Codex 5.5 pro plan is enough to run for 6-8 hr sprints as an orchestrator and using Deepseek v4 pro model to hand write training material, run tests, submit reports, and iterate.
This opened up alot of new opportunities in small language model training space.
I'll be posting specific findings on the go.
Good times.
@cjzafir Did you run all this (inference & SFT) on your MBP M3? Also, what vertical was the 6B tuned for and which framework for SFT? Excited for more!
Are you currently hiring for a role that includes using Node.js? Reply with a link to the opening and any relevant context.
If you're not, we'd appreciate a repost for visibility 💚