@Xinanexa@doodlestein@satyanadella Skills have an LLM operating on them at the top level, but a big advantage of skills in the first place is reducing this variability through consistency. Referenced assets, scripts, query files, etc. reduce ad-hoc coding & reasoning and add deterministic processing to the loop.
As believers of open research, we are disappointed to see Anthropic silently degrading Fable 5 for AI development
"Any topic related to building pretraining pipelines, distributed training infrastructure, or ML accelerator design... may have limited effectiveness through Claude via methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning."
Not only do they get to decide what you use LLMs for in research, but this also enables them to silently intervene in your research without you knowing.
This sets a dangerous precedent. If a model refuses openly, users can understand the boundary. If a model falls back to another model, users can still evaluate the difference. But if a model silently modifies or weakens its own answers while still pretending to help, researchers lose the ability to know whether a failed result came from their own idea, their implementation, or an invisible intervention by the model provider.
That is not safety. Safety policies should be transparent, auditable, and user-visible.
On top of that, the people most harmed by this are not the largest labs with massive teams and proprietary infrastructure. It is the independent researchers, academic groups, startups, and open-source builders who rely on public tools to compete, innovate, and pioneer AI for everyone else.
@thsottiaux I've been loving the taste and the speed, but this is concerning:
https://t.co/VYMdSgBErK
The issue, and the meta-issue of how it was discussed and handled, is a departure from an excellent track record.
PLEASE let this be an anomaly and not a harbinger 🙏
@benhylak@astuyve Could either of these support OpenHands? I'd love to explore integration options for OpenSymphony using openhands agent-server and soon codex app-server ...and of course every harness has different tool names 😆
@feross Is there some universe where Socket gets compromised and malicious versions of packages get injected into all of their customers' environments?
Same reason I've resisted third-party password managers. Great reduction of attack surface but external single point of failure.
@did0f I asked ChatGPT this a few weeks ago and it said that the selected model does not matter for automatic compaction, but it DOES matter for manual compaction.
Today is a hard day. I shared this note with the @linear team today: We’ve made the difficult decision to increase our workforce. This is not a cost-cutting exercise or a reflection of anyone’s performance. We’re simply reimagining every role for the agentic AI era. We’re hiring. We’re sorry about that.
@gregpr07 I've added this to OpenSymphony to allow debugging of current or past tasks, dropping into the OpenHands conversation.
I'll be adding codex soon 😃