@tszzl LoC is a useless metric when it is a metric you are specifically aiming for or when talking about multiple languages, if you work on similar stuff and no one purposefully manipulates LoC count, it is actually pretty comparable
@jacobsali123@stupidtechtakes Yeah, again, an attacker can just check if you are already signed up on the signup form, on like 90% of pages ive ever checked
@mhdcode Hmm usually i would agree, but here, idk. Anywhere where 3.1 pro or flash are at the top - totally, absolutely true. But 3.5 flash scoring between kimi and qwen is honestly pretty realistic, it is a bad model for google, but it isnt completely trash
@cnakazawa But at least you now get to say a weird acronym every time, plus your coworkers will all just call it "Chat", which will drive you up the wall
At least that is what chat told me
@birdabo that guy keeps saying a lot of things and like 10% are true, search for 5.6 on his profile, he already claimed like 3 times that it would release the next day and it didnt
@crypt0lake I mean you cant really FORCE a model to reason a lot, if the user says "Hi, can you help me with a question" reasoning for 16000 tokens would be a waste of time
@scaling01 Extrapolating from only these 2 data points and with a method i wont publish, i have determined that we will have AGI by thursday, june 4, 2026