Grant me the serenity to accept the problems where I cannot offload my understanding, courage to let the LLM go burrrr on the problems where I can, and the wisdom to know the difference.
The more I do it, the more it feels like there's some misalignment between LLMs and statistical work. Maybe it's over-abstraction leading to making it harder to turn knobs and experiment? Maybe the harnesses are oriented toward shipping code and not sitting on a problem?
No company has ever been more relatable than Dunkin giving up on the “healthy” protein menu strategy and hard pivoting directly into putting Oreos on everything.
Every single “the AI deleted prod!” post I’ve seen could have very easily been avoided by thinking about the principle of least privilege even one time.
The conclusions here feel wrong to me. The two lessons I see are:
1. Don't run agents anywhere they might be able to access production environment credentials - it's on you to know which credentials those are
2. Keep tested backups that are independent from your production host
Quick question, does gpt 5.5 still talk to me like it’s trying to prove to me that it’s read every single book on computer programming?
“We need to leverage a shim to reduce the risk surface area” got it
In case you were wondering how this performed, near-perfect temps meant no weather effect, leaving just the random-walk estimate.
Top-3 average was 2:02:30, ~0.7 posterior SDs faster than the 2:05:10 median, inside the 95% interval (1:57:55–2:12:29).
Based on the attire on the Green Line skewing more Hokas and On Cloudmonsters than a typical workday, I can confirm today is the Boston Marathon.
Just uploaded a new blog post looking at how much of finish-time variation is attributable to weather.
Link in thread
@StatsbyLopez There's actually pretty good availability! At least through 2019 easily available. See: https://t.co/tyFiTAyihM
I scoped it down to top-3 for the sake of grounding to existing work. and to save a bit of sanity for a pedagogical-leaning post. Definitely could be much richer.
Based on the attire on the Green Line skewing more Hokas and On Cloudmonsters than a typical workday, I can confirm today is the Boston Marathon.
Just uploaded a new blog post looking at how much of finish-time variation is attributable to weather.
Link in thread