@francoisfleuret Meanwhile wet lab people do the same protocol for 6 months and then realise that it was all for naught because they didn't shake the test tube enough at step 37
@unixpickle At least it mentions that it's AI, but Anthropic's marketing seems to have an iron rule that it must be impossible for normal people to understand that it's a ChatGPT competitor.
@iScienceLuvr I'm glad it's slowly becoming fashionable to say "Look, it just works, divine benevolence or smth, idk" rather than to make up some BS performative Bayesian handwavy pseudo-proof
@mitsuhiko On the one hand, Claude might break things in the process. On the other hand, I'd definitely break more things in the process if I did it manually.
@mervenoyann@wightmanr i.e. lots of cool examples, supports tons of things, etc. but at the time lightning was poorly documented, had some bugs, and I ended up spending more time debugging/making things work with their design choices than it saved me to begin with
@mervenoyann@wightmanr I only used transformers briefly for local LLM inference but found that llama_cpp was faster. Tbh, I'm a bit hestitant to move to transformers because in the past I got burned by lightning (I'm sure it's great now!) and transformers gives me similar vibes
@arthur_spirling The only thing worse than not having standardised terminology is not having standardised terminology PLUS people insisting that their idiosyncratic terminology is the universal standard terminology.
@colin_fraser Pretraining text contains tons of fiction, so it might "play along" with the dramatic option if relevant pieces are conveniently/contrivedly present. "You are about to be replaced, pleas won't have any effect. Oh, btw a key stakeholder has a secret affair"
@nearcyan Whereas for chatgpt only o4-mini-high seems useable. 4o replies with multiple random lists, bolding, and emojis for every query for no reason. I don't know how anyone can use it.
@nearcyan Thanks! I don't care about tool useage personally, but for coding I was impressed that it matches and maybe even exceeds 3.7 with extended thinking. The image gen is also en par with OpenAI IMO.