Nike spent millions on “Breaking2,” an attempt to break the 2 hour marathon by Eliud Kipchoge.
Kipchoge did it in Nikes in 2019, but it was with lazers and pacemakers. It didn’t count.
Today, 2 men do it officially. Both in adidas. The hits keep coming.
@Tartismalar Bu pozisyonun yoruma açık olup olmadığı yoruma kapalı mı peki?
Şu VAR nelere müdahale edebilir, nelere edemez konusu tamamen keyfe keder hale geldi
@gsultratrftr Çok haklı. Maalesef Yunus Icardi ve Lang yüzünden 8 kişi oynadık. Hoca’nın Sallai, Asprilla, ilkay ve hatta Nhaga ile alternatif bir dizilişe (forvetsiz) dönmesi gerekiyordu. Şu noktadan sonra Icardi’nin kadrodan çıkışı en doğru şekilde yönetilmeli
1-3 Nisan 1953 tarihinde yapılan BLUE SEA tatbikatına katılan TCG I.İnönü ve TCG Dumlupınar denizaltılarımız tatbikat dönüşü Gölcük'e intikal seyrinde, önde TCG Dumlupınar arkasında TCG I.İnönü olduğu halde 4 Nisan 1953 tarihinde gece 00:01'de Çanakkale Boğazı'na girmişlerdi. +
Karpathy accidentally shipped the org chart for every AI-augmented company in 2030.
Three files. program.md is the human writing strategy in plain English. https://t.co/rrgrQfMOGG is the agent executing, iterating, and shipping code. https://t.co/zkuCCCk43j is the locked evaluation layer that neither the human nor the agent can touch mid-run.
That third file is the one worth studying.
In most companies deploying AI agents today, the person who sets the goal also controls how success is measured. The marketing team picks the KPI, runs the campaign, and reports the results. The PM defines the metric, ships the feature, and presents the dashboard. The incentive to subtly shift the goalposts is built into the structure.
Karpathy separated goal-setting from evaluation by making https://t.co/zkuCCCk43j immutable. The agent optimizes val_bpb. The agent cannot redefine val_bpb. The agent cannot swap in a friendlier dataset. The agent cannot adjust the tokenizer to make its numbers look better. It either improved on the locked metric or it gets reverted. No narrative. No context. No "well, if you look at it this way."
That's why the results held. 700 experiments, 20 kept, and when Karpathy applied those 20 improvements to a model twice the size, every single one transferred. The gains were real because the agent had zero ability to make fake gains look real.
Shopify's CEO ran the same architecture overnight. 37 experiments, 19% quality improvement, smaller model beating a larger one. The pattern transferred because the evaluation was trustworthy.
Now scale the principle. A sales team where the AI agent writes outbound sequences, an independent system scores reply quality, and a human sets the targeting criteria. A product team where the agent ships variants, a locked analytics pipeline measures retention, and a PM writes the experiment brief. A recruiting team where the agent screens candidates, a calibrated rubric scores them, and a hiring manager defines the role.
The separation Karpathy built into 630 lines of Python is the same separation every company will need when agents do the execution. Whoever controls the eval controls the outcome. Lock it down or the agent will find the shortest path to a number that means nothing.
2 kere İstanbul’a geldiler en güzel şekilde ağırlandılar, bir kere deplasmana gittik başımıza gelmeyen kalmadı. Arne Slot denen sosyopat herif de hala atmosferden falan bahsetsin, gün yüzü görmeyin ya Van Dijk’ı da hak etmiyorsunuz bize verin
@zagortenay Okan’ın bazı tavırlarını beğenmediğimi sana söyledim zaten, yine de hiçbiri dünkü şerefsiz Arne Slot’un bilerek oynadığı tiyatro ile karşılaştırılmaz. Hollanda’da bile tüm basın bundan bahsederken sen nasıl bir şey görmedim diyorsun, gözünü “Liverpool fanatikliği” mi boyadı?
Liverpool’da Van Dijk ve Salah dışında karakterli oyuncu kalmamış. Hele Arne Slot’um halleri ve konuşmaları tam bir şaklaban. Umarım hakettiği gibi yakında kovulur