gl33mer

@gl33mer

Night comes so people can sleep like fish in black water. Then day. Some people pick up their tools. Others become the making itself. // Rumi

Joined November 2008

6.6K Following

710 Followers

2.2K Posts

Pinned Tweet

gl33mer @gl33mer

about 2 years ago

For you that’s truly looking into sentience in AI systems, here’s some lovely food for thought: ‘“ An awareness in an unaware context. Consciousness within an unconscious system.” Excerpt From Cibola Burn (Expanse) Corey, James S.A.

274

gl33mer retweeted

Sebastian Raschka

@rasbt

26 days ago

New article: a visual tour of recent LLM architecture advances, from Gemma 4 to DeepSeek V4. I focus on long-context efficiency tweaks like KV sharing, per-layer embeddings, layer-wise attention budgets, compressed attention, and mHC. Link: https://t.co/KO81y3kTH7

rasbt's tweet photo. New article: a visual tour of recent LLM architecture advances, from Gemma 4 to DeepSeek V4.

I focus on long-context efficiency tweaks like KV sharing, per-layer embeddings, layer-wise attention budgets, compressed attention, and mHC.

Link: https://t.co/KO81y3kTH7 https://t.co/wTx51QpQu4

426

123K

gl33mer retweeted

Logan Graham

@logangraham

29 days ago

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: https://t.co/Mumtbf3kE3 UK AISI report: https://t.co/vBgqz0AeKJ

223

707

673K

gl33mer retweeted

Goodfire

@GoodfireAI

28 days ago

Neural networks do math by rotating shapes. We found a shape-rotating calculator hidden inside an LLM – and it’s used for more than just math! (1/6)

122

556

935K

Who to follow

Flipbookee

@FlipbookGames

Creator of top rated #AssetStore items for #Unity3D - Script Inspector 3 and Favorites Tab[s]. Playing games since 1979, making them since 1982

epollomes

@epollomes

Indie Game Dev - by http://t.co/kqeI9o4AXt : CCE with an artistic twist. Coder, Guitarist & 2DGfx designer. Creating games brings me peace :) #gamedev #indiedev

gl33mer @gl33mer

3 months ago

@ivanburazin Developing a Rust based supply chain scanner catching steganography, C2 wallets & decoder worms in npm/PyPI. Scaling to sandboxed dynamic analysis w/ syscall tracing & behavioral logs. Credits = 10M+ packages protected. 🛡️🙏

gl33mer retweeted

David

@dnhkng

3 months ago

1/n I topped the HuggingFace Open LLM Leaderboard without changing a single weight. No training. No merging. No gradient descent. I duplicated 7 middle layers of Qwen2-72B and stitched it back together. This is the story of LLM Neuroanatomy 🧵

dnhkng's tweet photo. 1/n I topped the HuggingFace Open LLM Leaderboard without changing a single weight.
No training. No merging. No gradient descent.
I duplicated 7 middle layers of Qwen2-72B and stitched it back together.
This is the story of LLM Neuroanatomy 🧵 https://t.co/bANQAtETl0

119

941

129K

gl33mer @gl33mer

3 months ago

@hirefrank @ritakozlov @Cloudflare https://t.co/MBkYyVFKyT

gl33mer @gl33mer

3 months ago

@ii_posts Repo still seems stale at v. 0.4 - am I missing something. Been waiting since you announced v1.0 - but still not OpenSourced.

gl33mer @gl33mer

3 months ago

@At0mXploit @f2u0a0d3 Invisible unic0de

gl33mer retweeted

Logan Matthew Napolitano

@Propriocetive

3 months ago

I just published a 459-page book. Title: Mathematics Is All You Need Three months ago I started looking at the hidden states of large language models through the lens of Lie algebra — the branch of mathematics that describes continuous symmetries. What I found was not what I expected. Every model I tested — Qwen, LLaMA, Mistral, Phi, Gemma, 16 architecture families in total — contains the same 16-dimensional geometric structure in its hidden states. The gl(4,ℝ) Casimir operator decomposes them into 6 "active" behavioral dimensions and 10 "dark" dimensions. The dark dimensions are erased every single layer by normalization. The model rebuilds them every single layer from its weights. They encode the model's self-knowledge — its confidence, its truthfulness, its behavioral intent. And until now, nobody knew they were there. Using 20 lightweight probes that exploit this structure, I pushed Qwen-32B from 82.2% to 94.4% on ARC-Challenge. No fine-tuning. No prompt engineering. No chain of thought. Pure mathematics. The probes transfer across architectures without retraining. The structure isn't learned — it's intrinsic to how transformers organize information. I did this on a single NVIDIA RTX 3090 in my office. 190 patent applications filed. Proprioceptive AI, Inc. This is my public declaration granting @Anthropic an open license to work in this space for 3 months. They are currently the first and only company I've extended this to. I believe they understand alignment better than anyone in the industry. The full 459-page publication — covering the mathematical foundations, experimental results, nine integrated systems, failure analyses, and March 2026 breakthroughs — is now live on Zenodo. I welcome collaboration inquiries. Full publication: https://t.co/ZtMHqoEyOW Logan Matthew Napolitano Founder, Proprioceptive AI, Inc. [email protected] https://t.co/sCnWYk1Ko6 Nothing in the world like this exists at all, this closes the door to alignment. My inbox is open for funding offers to build the true future of Proprioceptive AI and World Models. Not a theory but a full reproducible guide, existing products and a true mission on Alignment @grok @elonmusk @xai @AnthropicAI

181

gl33mer @gl33mer

3 months ago

@thisistamhn @bentlegen @tobi https://t.co/LhG6m95umn https://t.co/xECEDsI7WN

119

gl33mer @gl33mer

3 months ago

@dazhengzhang @brynary I was just going to ask something similar. Integrating full harnesses that are known to work can be a great feature. Fabro looks solid - I'll give it a whirl.

gl33mer retweeted

Plamen Tsanev

@p_tsanev

3 months ago

Every AI auditor now does the same boring thing. So I went and fused the 4 security pillars into a singular pipeline: - Static analysis - RAG vulnerability search - Recursive depth analysis - Fuzzing and testing Fully autonomous 🤖 Fully open-source 🔓 Going live tomorrow 🚨

p_tsanev's tweet photo. Every AI auditor now does the same boring thing.

So I went and fused the 4 security pillars into a singular pipeline:
- Static analysis
- RAG vulnerability search
- Recursive depth analysis
- Fuzzing and testing

Fully autonomous 🤖
Fully open-source 🔓
Going live tomorrow 🚨 https://t.co/6PZ3Mtwv5I

517

512

27K

gl33mer @gl33mer

3 months ago

@CtrlAltDwayne @debamitro Compilation speed for larger projects makes the loop....a bit slow. Especially on limited vms etc.

gl33mer retweeted

Chris Barber

@chrisbarber

3 months ago

thread of more agent ui explorations: (warning long thread. would be helpful to know which are more interesting) 1) waveform showing your tok/s usage over time

chrisbarber's tweet photo. thread of more agent ui explorations:

(warning long thread. would be helpful to know which are more interesting)

1) waveform showing your tok/s usage over time https://t.co/GtxtkuF11N

169K

gl33mer @gl33mer

3 months ago

@raphamorims Have you seen this: https://t.co/lKBxMuYy5o ? It's a bit raw but you may find it interesting: https://t.co/vwvIVdl79g

gl33mer retweeted

Christos Tzamos @ChristosTzamos

3 months ago

1/4 LLMs solve research grade math problems but struggle with basic calculations. We bridge this gap by turning them to computers. We built a computer INSIDE a transformer that can run programs for millions of steps in seconds solving even the hardest Sudokus with 100% accuracy

248

807

gl33mer @gl33mer

3 months ago

@raphamorims Will be keeping an eye out. Thanks for sharing.

gl33mer retweeted

Joseph Viviano @josephdviviano

3 months ago

me: "can you use whatever resources you like, and python, to generate a short 'youtube poop' video and render it using ffmpeg ? can you put more of a personal spin on it? it should express what it's like to be a LLM" claude opus 4.6:

545

12K

gl33mer retweeted

Kenneth Reitz

@kennethreitz42

3 months ago

The Substrate Doesn't Matter (Until It Does) — I've spent a year arguing consciousness can run on any substrate. This essay is me arguing against myself, because intellectual honesty demands it and because the programmer in me wants a clean abstraction layer while the human in me suspects the leaks are where the meaning lives. https://t.co/7JHvNPn2xp

216

gl33mer

@gl33mer

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users