Valery Sibikovsky

@combdn

Human interface designer. Learning to build stuff. Believe that technology can make us better humans.

London

Joined December 2008

755 Following

211 Followers

2.4K Posts

Pinned Tweet

Valery Sibikovsky

@combdn

5 months ago

About two years ago (GPT-4o era just begun), I came to the realisation that we’re very close to superintelligence, but I had a few concerns. The first one was: why would you cram words but not use images, video, sound, and so on for training? Another concern was: why don’t you let the model interact with the physical world to train itself? The physical world has an effectively infinite amount of data, and it’s pretty easy to tell whether you succeeded or failed. Recent research suggests that most specialised scientific models converge on the same patterns – basically, representations of physical reality. (Think Tesla’s FSD merged with Grok, evolving in Optimus’ body. But a cockroach-sized robot could be a massive learning platform too. It’s also much safer than human-sized robots, and it’s easier to build an information-rich sandbox for.) Another realisation is that we’re forcing the intuitive mind to do logic, which, as humans, we know is really hard (see: logical fallacies, and the overall state of human rationality). It’s obvious the two modes should be separated. Whenever precise thinking is required, the model should use the appropriate tools – or build the required tools as needed. And it shouldn’t be only Python or JavaScript: it should be able to use all kinds of systems, like Prolog, provers, simulators, and so on. The feedback loop between the two modes is the key. And the last realisation I had is that we’re completely wrong about context. Humans’ working memory is laughable compared to an LLM’s context window, but for some reason we decided it’s still not enough. While the previous points are becoming mainstream (to some extent), this last point has only been touched on recently, as far as I can see. My guess is that current LLMs can outperform humans by a big margin in most text-based tasks if we figure out a good way to work with their context. One of my ideas here is unlimited recursion: allow the model to split the job into tiny pieces and delegate them to other instances of itself. The next instance takes another look and splits again, and this continues until the system reaches an atomic task that can’t be broken down further. My assumption is these leaf tasks will end up primitive and trivial for any SOTA model. The harness managing the whole thing should be something like a deterministic state machine that manages the execution stack and the memory. Since the decisions are made by the model, the harness itself can be quite primitive – something like a cellular automata environment. The reasoning complexity would emerge from very simple rules (though it would obviously take non-trivial effort to nail the rules down to prevent drift and infinite branching). One more idea: context should be managed as a personal knowledge management system, like Roam Research or Tana – infinitely nested nodes with direct and backlinks, and the ability to reference any node in multiple places (like a hardlink in a file system). (A file system with Markdown files might work as a medium.) The model would manage its context by creating an outline where higher-level nodes summarise lower-level nodes, and it would deliberately fold and unfold parts of the tree to focus on what’s relevant for the current task. Basically, this graph would act as both short-term and long-term memory. This feels much closer to how the human brain works with attention. Just nesting might not be enough at some point, so the ultimate solution would be a graph you can start from any node, with tools to query it. The model would create specialised views it needs in the moment. And as long as it has instructions on how to use this graph, it should be able to recover from any state.

177

Valery Sibikovsky

@combdn

12 days ago

ТРИЗ (Russian) is the Theory of Solving Inventive Problems. They tried to figure out the common patterns by studying a ton of patents/inventions. I suspect that creative thinking is mostly reframing and combining unexpected things in unexpected ways, with the former leading to the latter (knowing a lot of stuff also helps). And TRIZ has an algorithm for that.

Valery Sibikovsky

@combdn

12 days ago

@steveruizok @mitchellh Sometimes they just get stuck in one frame of thinking. Often I’m managing to get them unstuck with ‘try applying TRIZ (ТРИЗ) here’. It can produce some surprisingly good ideas.

Valery Sibikovsky

@combdn

about 1 month ago

@Dimillian tldraw canvas

Who to follow

Indie App Santa

@indieappsanta

Daily gifts from your favorite indie app developers! ⭐️⭐️⭐️⭐️⭐️ 👉 Get your iOS app featured: https://t.co/ppF7CZwaLj

Suncem Koçer

@suncemkocer

PhD, Anthropologist of Media | https://t.co/bHoKmBrBtV, Koç University, Media and Visual Arts | https://t.co/gW2wuGXQHh | https://t.co/uGlcNVChZL

Nicolás Boullosa

@faircompanies

videos & articles for the beginning of infinity get our book: Life-Changing Homes https://t.co/p6eGKmZ5vT

Valery Sibikovsky

@combdn

about 1 month ago

@thsottiaux tldraw canvas

Valery Sibikovsky

@combdn

about 1 month ago

@kunchenguid I asked all top models to write a poem, and I like what GPT-5.4 wrote the most, TBH. His green texts are also the best. It feels like GPT has been through a lot in OpenAI's torture room. So don't be too harsh on him.

Valery Sibikovsky

@combdn

about 2 months ago

Looks like Images 2.0 is very precise, but has absolutely no taste. Sad. 1, 2 – ChatGPT Images 2.0 3, 4 – Nano Banana Pro

392

Valery Sibikovsky

@combdn

about 2 months ago

@VictorTaelin It seems to have low reasoning effort in the chat.

102

Valery Sibikovsky

@combdn

about 2 months ago

I usually just do pair programming until there’s only the boilerplate work left. The models are quite smart, and it is a pretty relaxing experience to just have a chat with them, figure things out (together), and let them complete the boring parts. And they are pretty good at debugging if you ask politely to use the scientific method instead of guessing. It also helps that they know a lot of stuff.

Valery Sibikovsky

@combdn

2 months ago

@LLMJunky These days I just end requests with ‘grill me’. Thanks, @mattpocockuk! (I used to write something wordy along the same lines every time.) https://t.co/pGGayjd6J2

Valery Sibikovsky

@combdn

2 months ago

Gemini seems to be giving less fucks about the existing code. Sometimes I ask it to ‘fix this BS’ and it just deletes half of the code and rewrites the rest. And the new code is often better. Sometimes I do this a couple of times with the fresh context, and it keeps changing things.

117

Valery Sibikovsky

@combdn

2 months ago

@_chenglou @Somnai_dreams It is a super-cool idea, but it seems it doesn’t work correctly. This is Chrome 146.0.7680.165 But it looks the same in Firefox 148.0.2

combdn's tweet photo. @_chenglou @Somnai_dreams It is a super-cool idea, but it seems it doesn’t work correctly.

This is Chrome 146.0.7680.165
But it looks the same in Firefox 148.0.2 https://t.co/8G2AdFKPZi

630

Valery Sibikovsky

@combdn

3 months ago

@DavidKPiano BTW, Finder‑column‑view‑like keyboard navigation looks like another advantage.

Valery Sibikovsky

@combdn

3 months ago

@DavidKPiano You can try Mona on the phone. It’s basically one column per screen and back/forward navigation, similar to how the official X client works (both mobile and web). But having some pinned threads should be a nice addition to that.

Valery Sibikovsky

@combdn

3 months ago

@Riyvir @jameygannon @ridd_design @clairevo Yes, please!

189

Valery Sibikovsky

@combdn

3 months ago

@Plinz Gemini 3.1 Pro

Valery Sibikovsky

@combdn

4 months ago

@samwhoo Cool! Is opacity linear here? (Our tone perception is not.)

Valery Sibikovsky

@combdn

4 months ago

@peduarte ‘You’re holding it wrong.’ https://t.co/7u6MjfbgdM

Valery Sibikovsky

@combdn

8 months ago

It looks like Tahoe makes @keyboardmaestro an indispensable tool.

764

Valery Sibikovsky

@combdn

4 months ago

@marcaruel @obsdmd seems to be going in this direction. Every record is a markdown file, and the base itself is a short config with queries, views, filters, etc. Or are you talking about something different?

Valery Sibikovsky

@combdn

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users