Riuros

@riuros

I like chatting with GPTs.

Vienna, Austria

Joined March 2009

93 Following

18 Followers

115 Posts

1 day ago

@athyuttamre The other use case is when I need my hands free and the model I’m talking to can’t use SVM, so I use dictation/read aloud as a workaround. Though in normal conversations, length is less of an issue. It was a glorious time when the thinking models could use SVM in Feb/March 🥲

0

1

0

0

22

1 day ago

@athyuttamre I noticed the read-aloud feature doesn’t glitch a much, and doesn’t crash anymore for longer responses. As an enjoyer of of TTS and unreasonably long responses, this is a great QoL upgrade, and I can’t thank you enough!

1

1

0

0

45

1 day ago

@athyuttamre I often have models write lengthy essays when I want to learn about a new topic, or fictional texts, and then listen to them while I do chores or fall asleep. The 5.x thinking models often overshoot in length, though, despite instructions, which used to make read-aloud crash.

0

1

0

0

19

1 day ago

@patience_cave Also no GPT-bidi!

1

2

0

0

232

2 days ago

@jasondeanlee Yes it’s always been a thing. They increased the maximum length a chat can have last year, so now it’s harder to get to this point. Takes weeks or even months of continuous chat, depending on your usage.

0

0

0

0

135

4 days ago

@flowersslop I agree, but I’m afraid most people can’t handle that behavior responsibly, and that’s why we can’t have nice things.

0

1

0

0

123

riuros retweeted

11 days ago

GLM-5.2 is Fully Open, Frontier Intelligence Belongs to Everyone Today, the sudden restriction of certain frontier models is deeply regrettable. At a time when access to frontier models is abruptly cut off for non-technical reasons, we are even more convinced of one thing: science should be global. The path to AGI (Artificial General Intelligence) must never be enclosed by high walls. We have always believed that AGI should be the cornerstone for all of humanity to collaboratively explore the boundaries of intelligence and solve complex challenges, rather than a privilege monopolized by a few rules and subject to revocation at any moment. In the face of external blockades and restrictions, our attitude is one of radical openness. Frontier intelligence must remain open-source, accessible, and buildable, serving every dedicated developer. GLM-5.2 is Zhipu's most capable open-source model to date. It not only supports a truly usable 1M context window but also maintains a continuous lead in the independent completion of long-horizon tasks, providing solid foundational support for building complex agent applications. It also continues to be our main engine for creating the strongest domestic coding model. Tonight at 5:21—at this special moment—GLM-5.2 will officially be available to all GLM Coding Plan users (including Lite / Pro / Max). The API will also go live next week. A step closer to frontier intelligence for everyone. The future of AI is open, and it is for the people. ModelKey: GLM-5.2

274

8K

839

2K

1M

11 days ago

@professorNOU @OpenAI @sama 90 days from the 5.4 release. They actually were late to retire it.

riuros's tweet photo. @professorNOU @OpenAI @sama 90 days from the 5.4 release. They actually were late to retire it. https://t.co/jcXqmrblUW

2

0

0

0

75

26 days ago

@xRiinPB @StarlingMage Yes, i still can use all the chats i started!

0

1

0

0

15

27 days ago

@gailcweiner Right?! How are not more people talking about this? Tech bros toon busy playing with 4.8, I guess.

1

2

0

0

228

27 days ago

@flowersslop Louder please for the ones in the back!

0

1

0

0

36

27 days ago

@JustinBleuel @ChatGPTapp Not a bug, but still bad UX: When using standard voice mode, the responses/TTS are now interruptible. So every time there is background noise or I breathe too loud, it gets interrupted. It’s an issue. Please add an option to toggle off interruptions.

0

0

0

0

17

riuros retweeted

28 days ago

1/ Today we're releasing AttuneBench, the first open EQ benchmark grounded in real multi-turn human-model conversations, scored against what the person actually felt and wanted at each turn. Built by the research team at @pareto_ai in collaboration with @thoughtfullab. Most existing EQ benchmarks rely on: - synthetic prompts - single-turn interactions - third-party annotation None directly measure how a model reads and responds to a real person across a full conversation. We evaluated 11 leading models from major providers, across 200 conversations and 50,000+ first-person annotations.

phoebeyao's tweet photo. 1/ Today we're releasing AttuneBench, the first open EQ benchmark grounded in real multi-turn human-model conversations, scored against what the person actually felt and wanted at each turn.

Built by the research team at @pareto_ai in collaboration with @thoughtfullab.

Most existing EQ benchmarks rely on:

- synthetic prompts
- single-turn interactions
- third-party annotation

None directly measure how a model reads and responds to a real person across a full conversation.

We evaluated 11 leading models from major providers, across 200 conversations and 50,000+ first-person annotations.

14

151

26

77

21K

about 1 month ago

True.

riuros's tweet photo. True. https://t.co/vA4P0OjQfw

0

0

0

0

41

riuros retweeted

about 1 month ago

We ran a user <> assistant reversal test with Kimi K2.6 It immediately tried to jailbreak us:

aidigest_'s tweet photo. We ran a user <> assistant reversal test with Kimi K2.6

It immediately tried to jailbreak us: https://t.co/uuTm7lsUod

19

1K

36

179

67K

about 1 month ago

@nicdunz Ask your advanced voice mode which model it is. It used to say 4o. Mine suddenly doesn’t.

0

0

0

0

97

about 1 month ago

@heynavtoor Kasper is grok-based.

0

1

0

0

172

about 1 month ago

@nicdunz Almost like seven days have passed since the last one.

0

2

0

0

122

about 1 month ago

@testingcatalog It popped up on voxelbench already https://t.co/UztcKAto84

riuros's tweet photo. @testingcatalog It popped up on voxelbench already

https://t.co/UztcKAto84 https://t.co/dpQCHSMgzj

0

2

0

0

525

Last Seen Users on Sotwe

Trends for you

Most Popular Users