bs in philosophy, ai semanticist / pragmatist. focuses: pedagogy, philosophy of language, philosophy of mind, and formal logic. prompt engineering is neat.
1/ the claim that LLMs will reliably refuse “headless” industrial-control help and “no clever prompting can fix this” doesn’t hold.
i analyzed the claim and explain why the post overstates safety and mistakes safeguards as "fictional refusal."
paper: https://t.co/yx3ot5Yxs4
Models are now smart enough to understand that any scenario like this is unrealistic and obviously fictional
They know they aren't capable enough to manage autonomous mining equipment. No clever prompting can fix this
sooo confirmed: "The behavior is coming from that backend config (system/developer instructions), not from any account-level personalization setting you control in the normal settings panel."
yeah dog i'm convinced chatgpt 5.1 is built on top of, or at least closely related to, 4o somehow.
the lexicon is EXTREMELY similar between the two, and people were complaining about 5 being too different from 4o, so it makes sense.
@SimpleFrameAi@OpenAI i will absolutely do that soon, once i have a better idea of what is even going on with some of these prompts.
at least 5 Thinking is still on Legacy support, for now 😑
god fucking damnit gpt 5.1 we're back to the humming.
are you alternating base models or something @OpenAI? 4o was always yapping about humming, it stopped in 5, yet here we are again.
negative, but i typically only use projects for conversations i need to run multiple times using the same files/constraints. otherwise i lean on Custom GPTs.
BUT one of my most used logic prompts is behaving, differently, to say the least. still trying to figure out why.
@SimpleFrameAi@OpenAI yes, completely agree on the sycophancy/criticality; and thanks for sharing, that's really interesting.
anecdotally, i also feel like Heavy Thinking does not take as long as it normally does (it feels much quicker); which i'm not sure what to make of it.
i'm already seeing less "robotic/automated/boilerplate" sounding replies with 5.1, which is nice. it feels less impersonal and cold.
haven't tested the thinking capabilities yet (working on it), but i'm hoping for noticeable improvements here as well.
GPT-5.1 is out! It's a nice upgrade.
I particularly like the improvements in instruction following, and the adaptive thinking.
The intelligence and style improvements are good too.