yapit.md turns urls and pdfs (even research papers) into clean, listenable markdown. @kepano's defuddle handles websites, a vision LLM handles PDFs and e.g. gives math spoken alt text. Free TTS in your browser, open source, self-hostable - and I didn't make UI/UX an afterthought.
@thdxr yes, especially since you can just think out loud.
and send that when ure stuck.
or even if you arrive at something precise, gives more nuance.
I used to have up to 5k token prompts back in jan/dec when i vibecoded stuff.
currently i like to be more in the loop, think twice
yapit.md turns urls and pdfs (even research papers) into clean, listenable markdown. @kepano's defuddle handles websites, a vision LLM handles PDFs and e.g. gives math spoken alt text. Free TTS in your browser, open source, self-hostable - and I didn't make UI/UX an afterthought.
@dexhorthy claw has a 30 minute heartbeat. agent gets unconditionally invoked. mutiply by a million claw instances and you have yourself a nice load on the infra.
@VictorTaelin good idea. so far ive had this spread out in task (issue) and knowledge files, but a) claude is lazy b) he's (absolutely) right because the 200k ctx window is too small c) I thought about a centralized file like goals/decisions.md, but questions might be more natural for this
yess this is exactly what I wanted to build myself - or am still building because, looking at the repo, i think this can be much more bitter-lesson-pilled. the main challenge i face is sloppyfication without careful oversight, but i bet 10-100x longer contexts completely solve this. I've also only used my setup interactively due to this. Having the agent dream and so on in the background is also simply bottlenecked on context (and token costs...) still, in my experience.