bioslopper

@bioslopper

certified slop enjoyer

terminal

Joined December 2025

32 Following

7 Followers

72 Posts

bioslopper @bioslopper

13 days ago

Might be true for older or smaller LLMs, but from my experience, not for recent LLMs anymore (including non-thinking ones). Back in early 2022, I was working on automated exam grading with GPT-3. This was even before ChatGPT was released, so this required careful output-order tuning. Asking for the reasoning first instead of the grade made indeed a huge difference. I kept using and preaching this pattern for the next few years and assumed that it would naturally improve the accuracy of LLM predictions since the self-conditioning argument makes total sense on paper. However, after I grew suspicious of my own assumptions, I did some evaluations in late 2024, with Gemini 2.0 I believe, and to my surprise, the output order trick hardly influenced the results anymore. This was even before reasoning became mainstream. My guess is that as the models grew bigger and better, their latent representations became more stable and covering more tokens into the future. The inner representation will confidently represent the full answer relatively independently of the (forced) answer order. Of course there are unlucky cases where indeed a wrong initial binary answer is sampled, but recent models seem increasingly resistant against hallucinating arguments in favor of the wrong answer and instead self-correct themselves later in the response.

0

1

0

0

31

bioslopper @bioslopper

15 days ago

the ai psychotic urge to mention the current date in a conversation with a human as i don't know the other person's knowledge cutoff

0

0

0

0

8

bioslopper @bioslopper

18 days ago

scrolling twitter on my laptop in the train so people don't think i'm a braindead phone addict

0

0

0

0

9

bioslopper @bioslopper

about 1 month ago

@TechEmails sorry babe but the shift key stays untouched

0

0

0

0

877

bioslopper @bioslopper

about 1 month ago

@deedydas please show me a single average developer who can write 30 lines of code without looking up some form of documentation

1

0

0

0

1K

bioslopper @bioslopper

about 2 months ago

@Polymarket imagine showing this to orwell in 1948. he would probably say "what the fuck are emojis??"

0

0

0

0

869

bioslopper @bioslopper

about 2 months ago

having a family is crazy because it's like a thiel fellowship but without having to be gay and autistic

0

0

0

0

26

bioslopper @bioslopper

about 2 months ago

0

0

0

0

7

bioslopper @bioslopper

about 2 months ago

@GiorgiaMeloni "By its very nature, a diligent regime propagandist cannot give lessons in either consistency or freedom."

0

0

0

0

13

bioslopper @bioslopper

about 2 months ago

@GDRvisuals what if we would vibe code from the stasi office display 🥺👉👈💖🌸

0

2

0

0

766

bioslopper @bioslopper

about 2 months ago

yes, we are cooked

0

0

0

0

26

bioslopper @bioslopper

about 2 months ago

@jonatanpallesen less than half as many workers per retiree, but the retirees also live longer, consume more medical resources, and young people are increasingly delaying joining the workforce, if they will ever productively work at all

0

0

0

0

79

bioslopper @bioslopper

about 2 months ago

if your politicians have to discuss about economical policy, you live in a socialist country

0

0

0

0

23

bioslopper @bioslopper

about 2 months ago

@Science_TechTV even your neurons can't get bitches

1

3

0

0

467

bioslopper @bioslopper

about 2 months ago

the tinygrad codebase is the closest python developers will ever come into contact with assembler

0

0

0

0

30

bioslopper @bioslopper

about 2 months ago

why does the brain ignore the second the ?

0

0

0

0

18

bioslopper @bioslopper

about 2 months ago

@ThrillaRilla369 don't miss the last bus

bioslopper's tweet photo. @ThrillaRilla369 don't miss the last bus https://t.co/FNXhUL4rI4

0

0

0

0

3

bioslopper @bioslopper

about 2 months ago

feels like a predecessor to https://t.co/Cs0RS5PIFC they trained the generator to produce the weights for an implicit neural representation that will generate the actual image the results were actually pretty good and it had many advantages like arbitrary resolutions and aspect ratios never understood why this didn't get more attention

0

8

1

5

972

bioslopper @bioslopper

about 2 months ago

gemini app the sneaky bitch is switching from pro to fast every couple of days

0

0

0

0

26

bioslopper @bioslopper

about 2 months ago

@hopes_revenge tim cock

0

1

1

0

220

Last Seen Users on Sotwe

Trends for you

Most Popular Users