I'm pleased to announce that next week I'll be joining the Cleveland Guardians R&D team!
Thanks to everyone who has helped me to develop in the world of baseball analytics over the past few years.
@s8mb Beyond the GPS tracking issues that others have mentioned, their estimated arrival times strongly underestimate the impact of traffic.
Something I explored here:
https://t.co/OUX1FAX6Ts
We're looking for a data scientist to join us at the Cleveland Guardians R&D department!
I've really enjoyed working here with lots of great people for the past 2+ years. If you're interested in baseball analysis and solving tough problems with data then apply at the link below:
As I've left the public space, PitchingBot has a new home on FanGraphs.
https://t.co/X7l4hWhmVo
Check out the primer here if you want to know more about what goes on under the hood:
https://t.co/FR0d4Nn71q
@tangotiger Fair enough, I thought the graphs above avoided that by changing the samples to still include players on the "bad" side of random variation.
@levy_cameron@tangotiger@TaiyoKeilin@SassoCT I agree the regular season results are mostly selection bias.
I redid some of the postseason analysis comparing to the odds ratio wOBA of the batter-pitcher matchup and there seemed to be something real there (didn't publish that though, I'll see if I can dig it out)
@VorosM Hard to say much.
Note that the Phillies game would have come under "Cloudy" using this definition of weather. Fastballs show a bit more of a significant change from clear -> cloudy compared to other pitches.
@tangotiger@Light_Saberist One could argue that hitters have more control over whether they pull flyballs than pitchers have over BABIP.
https://t.co/6w8Velxe71
Whether the effect is significant enough to be worth including is another question.
Also, batters have a reliable skill at getting their flyballs to fly further than the model expects when controlling for LA, EV, stadium, & weather.
Year on Year R^2 = 0.2 for min. 50 flyballs which is small but significant.
Here are the batters who gain/lose the most distance.