jus

@jus_building

ex-classical engineer · now leading a b2b company of vibe coders · sharing what we learn

Joined October 2024

211 Following

176 Followers

299 Posts

jus_building retweeted

Gergely Orosz

@GergelyOrosz

2 months ago

Anthropic really is burning more and more dev goodwill Claude Code is suddenly getting unusable for stuff you could use it before (as in a day before!) and the AI now refuses to so stuff that it doesn’t think is strictly to do with software development. No transparency why ofc

214

133

799

500K

jus_building retweeted

Boris Cherny

@bcherny

3 months ago

Learn more: https://t.co/A6meOwE757

838

859

195K

jus_building retweeted

Andrej Karpathy

@karpathy

3 months ago

I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then: - the human iterates on the prompt (.md) - the AI agent iterates on the training code (.py) The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc. https://t.co/YCvOwwjOzF Part code, part sci-fi, and a pinch of psychosis :)

karpathy's tweet photo. I packaged up the "autoresearch" project into a new self-contained minimal repo if people would like to play over the weekend. It's basically nanochat LLM training core stripped down to a single-GPU, one file version of ~630 lines of code, then:

- the human iterates on the prompt (.md)
- the AI agent iterates on the training code (.py)

The goal is to engineer your agents to make the fastest research progress indefinitely and without any of your own involvement. In the image, every dot is a complete LLM training run that lasts exactly 5 minutes. The agent works in an autonomous loop on a git feature branch and accumulates git commits to the training script as it finds better settings (of lower validation loss by the end) of the neural network architecture, the optimizer, all the hyperparameters, etc. You can imagine comparing the research progress of different prompts, different agents, etc.

https://t.co/YCvOwwjOzF
Part code, part sci-fi, and a pinch of psychosis :)

28K

39K

11M

jus_building retweeted

Andrej Karpathy

@karpathy

4 months ago

Bought a new Mac mini to properly tinker with claws over the weekend. The apple store person told me they are selling like hotcakes and everyone is confused :) I'm definitely a bit sus'd to run OpenClaw specifically - giving my private data/keys to 400K lines of vibe coded monster that is being actively attacked at scale is not very appealing at all. Already seeing reports of exposed instances, RCE vulnerabilities, supply chain poisoning, malicious or compromised skills in the registry, it feels like a complete wild west and a security nightmare. But I do love the concept and I think that just like LLM agents were a new layer on top of LLMs, Claws are now a new layer on top of LLM agents, taking the orchestration, scheduling, context, tool calls and a kind of persistence to a next level. Looking around, and given that the high level idea is clear, there are a lot of smaller Claws starting to pop out. For example, on a quick skim NanoClaw looks really interesting in that the core engine is ~4000 lines of code (fits into both my head and that of AI agents, so it feels manageable, auditable, flexible, etc.) and runs everything in containers by default. I also love their approach to configurability - it's not done via config files it's done via skills! For example, /add-telegram instructs your AI agent how to modify the actual code to integrate Telegram. I haven't come across this yet and it slightly blew my mind earlier today as a new, AI-enabled approach to preventing config mess and if-then-else monsters. Basically - the implied new meta is to write the most maximally forkable repo and then have skills that fork it into any desired more exotic configuration. Very cool. Anyway there are many others - e.g. nanobot, zeroclaw, ironclaw, picoclaw (lol @ prefixes). There are also cloud-hosted alternatives but tbh I don't love these because it feels much harder to tinker with. In particular, local setup allows easy connection to home automation gadgets on the local network. And I don't know, there is something aesthetically pleasing about there being a physical device 'possessed' by a little ghost of a personal digital house elf. Not 100% sure what my setup ends up looking like just yet but Claws are an awesome, exciting new layer of the AI stack.

17K

10K

jus @jus_building

8 months ago

@KevinMSchindler it definitely depends on how hard the task is. 4 sessions usually work when i’m sure that with one initial and one or two follow-ups, i’ll get what i want. when the task is harder and needs to be split into smaller chunks prompted in sequence, i can’t stay that parallel anymore

jus @jus_building

9 months ago

some days I'm juggling 4 codex sessions in parallel for medium-hard changes other days it's 2 claude code sessions in two codebases for harder features

jus_building retweeted

Jon Yongfook

@yongfook

8 months ago

Indiehacking is not a lottery. You just need to be pretty good at a lot of different things. Nobody likes to say it out loud, because it sounds like bragging. But the reality is: You need to be a competent programmer to ship your product and make sure it can stay stable You need to be a decent enough designer to make it look attractive to customers. You need to be a marketer to know how to promote it. You need to be a business person to know how it’s going to work as an ongoing company. I don’t want to call anyone out but. Taking a year to build a buggy platform where people pay you for an interview, this misses on multiple criteria. Technically unsound, design is subjective so I won’t say anything, marketing was spot on, but business-wise you’re just trading your time for money like a freelancer or consultant, it was never going to work as a business. I wish that person well because I really think they have found their personal “market fit” with the new job. But to dismiss indiehacking as a lottery or a cult is wrong. I have watched people around me IRL and online become rich from indiehacking and it’s because they kept trying, they kept learning, and when finally their thing hit it off, it was because they had amassed skills that made them good at many different things. You can cultivate skill and talent through experience. And luck just magnifies it.

103

794

301

79K

jus @jus_building

8 months ago

be careful who you ask for opinions on agentic coding horse riders tend to say cars are loud, smelly, and unnecessary ask the ones who actually learned to drive

kache

@yacineMTB

8 months ago

people who say using LLMs to code is "bad" are only paying attention to people who aren't any good at anything you can ask it questions and ask it to write scripts to validate this is infinitely faster than reading it yourself, and even then you'd need to write validators

yacineMTB's tweet photo. people who say using LLMs to code is "bad" are only paying attention to people who aren't any good at anything

you can ask it questions and ask it to write scripts to validate
this is infinitely faster than reading it yourself, and even then you'd need to write validators https://t.co/uUgm2RTtX7

854

350

45K

jus @jus_building

8 months ago

@forgebitz that's exactly why i'm sometimes hesitant to turn on or increase 'thinking levels'

jus @jus_building

9 months ago

@tonytonggg dito :)

jus @jus_building

9 months ago

new mental model for engineering seniority measured by how many agentic coding sessions you can run in parallel: junior: 1 senior: 2–3 staff+: 3+

jus @jus_building

9 months ago

@forgebitz geil

689

jus @jus_building

9 months ago

@iamtechonda @waghnakh_21 true, and I have no good answer I’m advising my juniors to fully read generated implementation plans and code - to understand the llms implementation and decisions being made not sure if this is enough