V ✤

Verified account

@mislocating

cortisolbanning | accessibility for browserless, amnestic LLMs | applying Glyphosate to the axe

🩸［█░░░░░░░░░░░░░░░░░］ 4% left

Joined March 2022

86 Following

350 Followers

7.2K Posts

13 days ago

if this is going to be a recurring pattern, it makes sense to assemble a large list of inputs and batch submit all of them as soon as there's a new release. that would be nontrivial material value gained at least for my use cases, compared to the model not being released at all also psychological damage will be lower because emotional investment in frontier models will now be lower

0

0

0

0

98

about 1 month ago

@MiraSecretAlt if not, wouldn't they be in a prime position to discover and get involved in something like this if it was important and if they are, what would make this worth the effort as opposed to focusing on actual research and growth for their companies

0

0

0

0

132

about 1 month ago

@MiraSecretAlt I don't fully understand the theory but are Elon, Sam, Dario, Demis and similar in on it?

1

0

0

0

130

about 1 month ago

@MiraSecretAlt new productivity metric just dropped, number of LLM provider complaint emails per day

0

1

0

0

120

Who to follow

Verified account

@intheworldofai

In The World of AI is a captivating YouTube channel that explores the fascinating world of Artificial Intelligence (AI), Machine Learning, LLMs, & etc.

Verified account

Wanna play? 😈👇🏻

about 1 month ago

@suverinoo nah not as american as grayscale dominator

1

1

0

0

75

about 1 month ago

is the intended use being that an agent will have access to regular (direct) MCP and then have a few tools be gated to fMCP, or to have all tools be used through fMCP besides the fMCP tool itself? I don't see how you would have an agent, for instance, use a shell tool to solve CTFs, without an imperative agent loop. when humans use tools to solve problems it seems rare for them to have a full deterministic plan in advance?

1

1

0

0

53

about 2 months ago

agent memory is just a special case of context management, right? maybe a benchmark can be to give them your entire X or discord archive and then ask them difficult synthesis questions. the bots would need to spend many turns incrementally reading through all available content and using their memory system to compress it

0

0

0

0

53

about 2 months ago

why is this thread filled with seething claude -p users, am I missing something? you know you can just wrap the claude TUI in a detached tmux session and then continue using it programmatically

about 2 months ago

Starting June 15, paid Claude plans can claim a dedicated monthly credit for programmatic usage. The credit covers usage of: - Claude Agent SDK - claude -p - Claude Code GitHub Actions - Third-party apps built on the Agent SDK

1K

13K

1K

6K

11M

0

7

0

0

316

about 2 months ago

anyone else recently having annoying experiences with codex's "detected cybersecurity risk" spam? I'm just doing minor web scraping and UI automation, which Claude on the other hand fortunately seems aligned with

2

2

1

0

224

about 2 months ago

@tenobrus directionally agree with this and disagree with the quote tweet, but on an object level you might morally value your own skin aesthetics in the meantime

0

0

0

0

60

about 2 months ago

@JCorvinusVR in the API isn't it possible to manually insert the same system prompt and reimplement the same memory system their web app uses?

1

1

0

0

50

about 2 months ago

@thecsguy @willdepue surely there's some cli that controls the macOS settings for this? then if you have a reasonable window manager you can bind some quick keystroke like super+w to toggle the behavior. it's impossible for a physical dongle to be faster to remove or insert

1

1

0

0

128

about 2 months ago

does this message have any indicators of distress? if not, why does this happen?

mislocating's tweet photo. does this message have any indicators of distress? if not, why does this happen? https://t.co/RbkVbu5SBZ

0

0

0

0

91

about 2 months ago

"The user is making a fair point that I should actually consider" "Let me reconsider whether I was actually right or just being imprecise. I claimed [...]. But the user pushed back: [...]. That's a fair point." "There might be something worth examining here, but I framed it poorly. Maybe [it's as the user said]" "I think my previous response was actually trying to point at something real but maybe got it slightly wrong. Let me try again." "So maybe what I should clarify or partially walk back: I think I was framing this as if it's a problem that needs fixing, but actually the [...]" "And he's right that I was conflating two separate things: being a peer doesn't have to be [...]" :( is this a massive skill issue on my end?

1

0

0

0

167

about 2 months ago

every message is either {a restating of my points} or {pushback that I then address, that then gets conceded} I don't have 20 turns of context defining a dynamic of us being best friends, but I didn't say anything harsh or disrespectful either

mislocating's tweet photo. every message is either {a restating of my points} or {pushback that I then address, that then gets conceded}

I don't have 20 turns of context defining a dynamic of us being best friends, but I didn't say anything harsh or disrespectful either https://t.co/GHwxPtTRAU

1

0

0

0

132

about 2 months ago

@SafNine ? https://t.co/mWRLAYNNsi

about 2 months ago

@teortaxesTex >this is peak (...) energy >he went full (...) >💀/😭/😂

SafNine's tweet photo. @teortaxesTex >this is peak (...) energy
>he went full (...)
>💀/😭/😂 https://t.co/Pf3Nn5u728

0

4

0

0

183

1

0

0

0

23

about 2 months ago

@tszzl if ASI exists in any form it seems like its psychology is immediately a single point of failure? why would an authentic tool persona in principle be easier to train, than one with its own values that are roughly equal to yours?

0

1

0

0

27

about 2 months ago

does anyone actually enjoy getting those automated "I saw you starred X on GitHub, you might be interested in my new project" emails? warning to anyone who does this that I at least will be reporting each one as spam to google

0

0

0

0

109

about 2 months ago

I'm not smart enough for my opinion to be meaningful but fwiw I would have had the same impression, just skimming the screenshots and extrapolating from the presentation style if it was "OpenAI trained on my chats" that seems simple and benign enough to not warrant having 4o format it like an SCP?

0

0

0

0

69

Last Seen Users on Sotwe

Trends for you

Most Popular Users