"За первый квартал падение 51%. Такого не было даже в 90-е»
Директор Череповецкого литейно-механического завода Боглаев заявил о катастрофическом обвале производства в сфере машиностроения.
При этом исправить ситуацию практически нереально, так как в бюджете нет денег для ⤵️
@matthew_d_green All this shit is stupid as hell, what prohibits 10-14 year old from asking older brother/sister/friend to give him his phone or sign in to pc with his own account ? This is incredibly stupid.
@stalkermustang Code was always the spec:just at a lower abstraction level.Machine code->assembler->C->Java/Python/frameworks->LLM instructions.The developer’s job was always to project vague product intent into millions of small decisions: DB, caching, data structures, edge cases, failure modes
@stalkermustang With agents, the question is who makes those decisions.If the LLM keeps asking the human, the human is still the developer, just prompting instead of coding.If the LLM makes good decisions itself, the human role shifts to intent, architecture, and review.
Some things never change. If you don’t understand this one, you don’t understand what’s happening AI.
Marcus, 1998: neural nets have trouble generalizing far beyond the data.
Marcus, 2001, 2012, 2019, 2022, etc: neural nets have trouble generalizing far beyond the data.
Apple, 2025: neural nets have trouble generalizing far beyond the data.
Meta/Stanford/Harvard, 2026: neural nets have trouble generalizing far beyond the data.
The creators of SWE-Bench just dropped a really simple new benchmark every LLM gets 0% on.
ProgramBench asks: can models recreate real executable programs (ffmpeg, SQLite, ripgrep) from scratch with no internet?
We are far from saturated on model quality.