@peekyblinders@PoloGzzHS@Jonnyinmd@Math_files On the last round, there is only 2 people left - you and the other guy who also won 32 rounds in a row.
You're basically competing to see who is first and second in the whole world. Definitely won't be a baby, elderly, or even someone remotely easy.
@michael_waples@0xdippo@thsottiaux There is a "waste" of tokens in the previous cycle, yes. But you also get an additional fresh cycle. So it's more like -80 and +100.
Notice, this is why, when total days are preserved, the one with the extra reset - logically - ends with a higher cycle number than without.
@michael_waples@0xdippo@thsottiaux The above is about "accessible" token, as in what you can maximally use given the same timeline.
But how about "experientially"? Say we have a pattern of using 80 tokens on thursday and 20 on friday post-reset.
@athiestboi It's completely fair to ask others to make their case if they so choose.
Atheism being the default position, is more about one's own internal evaluation; and is not applicable in situations where ones tries to convince another of a position.
@ivanburazin 400s under 30 here. Though my albumin and shbg were so high that effective free T was as good as being just in the 100s.
Wild stuff because I only checked when gyms were all closed during covid lockdown, and I "had nothing better to do".
@TakeThatNurses@LinchZhang You actually can't tell if he used diameter or radius here, because radius is half diameter, so 1.5 is the remaining constant after eliminating both radius/diameter and/or the conversion factor of 0.5/1.
@spirodonfl@cmuratori The very least, I would expect college grads to be well aware on some level of the social harms caused by social media, increasing polarisation due to media bubble built by recommendation engines, etc.
@spirodonfl@cmuratori I agree that AI is the clearest mirror right now, but I wouldn't go so far as to say that the harms of Big Tech is something that the audience (i.e. college grads, not random Gen Alphas) are that oblivious to either.
Not necessarily Meta buying VPN companies, but at...
@thsottiaux Also, it is much easier to reach over the desk to ask the friend, how he got the model or harness to work in the way he likes it best.
Benchmarks don't give that. They just give general scores for untuned models. There is no next steps or what to try next.
@thsottiaux Friends are an underrated but big factor. It is inevitable that your closest connections are so, because you have something in common - what you work on, how you work, think, or communicate, etc.
Friend who has success often is a better predictor of own success than benches.
@Sabre9186@cmuratori I think the problem with "make use of this hindsight" is highlighted by another part of the video - "how? what? why?"
There was no post mortem about why the bad thing happened. No advice on what to take special note of. Not even a brief hint of decision pitfalls.
@tomieinlove@kaywisehero@Perrid13 You can have 2-sided polygon, and there exists 2 sided-polygons; does not mean that removing 2 sides from a rectangle produces a digon or a 2-sided polygon.
@slopwareindy@koltregaskes By existing benchmarks.
While it is true that Sonnet and Opus are not in the same class, AI companies do try to make it seem like the difference is not huge in their Sonnet/mini classes based on benchmarks.
Swe bench verified Sonnet 4.6 79.6% vs Opus 4.6 80.8%
@josephathomas@theo Though greater than 24 hours early cutoff is properly too much for timezone difference to fully explain - with very rare exceptions which should not apply since neither are on either extreme ends of UTC offset timezones.
@laurie_guilbeau@NepsisVT@todayyearsold But since the question is only about number of sides left, does it still need to be a shape?
For example, we would say that a flat piece of paper has 2 sides - front and back - even if it isn't a particular shape - since it represents a plane.
@weswinder To be fair, asking the model to self-identify is already the wrong approach.
How many more variations of Gemini 3 Pro identifying itself as Gemini 1.5 Pro do we need?
Would be better if he just pulled up an official source for model list and checked.
@christoaivalis@lior_eth To be clear, I'm only talking about optimisation from what is sufficient/typical. So we aren't comparing an athlete with PED but given zero food, vs an athlete without PEDs but given optimal food.
Obviously, zero food/rest/training are potential breaking on their own.
@christoaivalis@lior_eth For a non-athlete, it might be hard to gauge what feels subjective when we toss around labels like "small" and "huge" edge.
But I'd propose that an edge can be considered huge if its effect is greater than the sum of what is obtainable via food, rest, training optimisation.