NEW PREPRINT: Scientists may have found direct evidence that aging is driven by the loss of cellular information, not just the accumulation of damage
For decades we've focused on what aging cells accumulate. This paper focuses on what they lose: Information.
Using a new technology called SeqTag, researchers measured gene expression, chromatin accessibility, and histone modifications in the same aging cells.
What they found was striking: the regulatory systems that tell cells who they are become increasingly out of sync with age. The authors call this "molecular asynchrony."
As cells age, chromatin structure, histone marks, and gene expression begin drifting apart. Regulatory entropy rises. Repressive chromatin erodes. Cells become less certain of their identity and more likely to drift toward alternative fates.
This is what the Information Theory of Aging (ITOA) states: that aging occurs when cells lose epigenetic information, the instructions that tell the genome how to maintain youthful function. DNA may remain largely unchanged, but the system that reads it gradually loses fidelity.
What's remarkable is that this paper doesn't just describe this phenomenon. It quantifies it. The authors measure increasing regulatory entropy, loss of H3K27me3-mediated repression, erosion of heterochromatin, weakening lineage fidelity, and increased cell-fate drift during aging. In progenitor cells, the barriers that normally preserve cellular identity become progressively weaker with age.
Mechanistically, the study argues that aging is associated with increasing molecular asynchrony between chromatin accessibility, H3K27ac/H3K27me3 remodeling, and transcriptional state.
This decoupling is accompanied by increased regulatory entropy, loss of repressive chromatin architecture, and weakening of lineage constraints. Genes affected are those involved in chromatin organization, DNA damage, and Wnt signaling, consistent with ITOA.
Importantly, the authors provide quantitative evidence that age-related heterochromatin erosion lowers the energetic barriers that maintain cell identity, offering a potential mechanistic link between epigenetic information loss, cell-fate drift, and late-life disease susceptibility π
@elonmusk Why @elonmusk why? What do you want to accomplish with post like this? I just dont get it man, you have so much power in your hands and it could be used more wisely...
someone just open-sourced a full 3D building editor that runs entirely in your browser. π
no AutoCAD. no Revit. no $5,000 annual license. no install. just open the tab.
the AEC software industry has been selling the same tools for 30 years and charging more for them every year. the moat was never the technology. it was the switching cost and the file formats.
a browser-based open-source alternative doesn't just undercut the price. it removes the reason to ever open your wallet in the first place.
architects, urban planners, indie game devs, anyone building physical things who got priced out. this is for them.
Autodesk has had a good run.
A judgment may well be false - but that is not an objection to it
The question is how far that judgement may nevertheless promote life, how well it can still preserve and even cultivate our species
Without the fictions of logic, without the constant falsification of the world through numbers, we could not live. To renounce false judgements would be to renounce, to negate life itself
To accept untruth as a condition of life is to oppose conventional notions of value in a dangerous way - and a philosophy that is willing to take that risk would by that gesture alone place itself beyond good and evil
BGE 4
#Nietzsche
Claude Code + Obsidian is the most powerful AI combo Iβve ever used.
I literally built an AI second brain that remembers everything I read, write, and research.
Here's how to replicate my system (it only takes 5 minutes).
Watch here οΏ½οΏ½ https://t.co/mvoeKNDJ5K
Holy shit. IBM deployed AI agents in production and found that 38% of failures had nothing to do with reasoning.
> The model knew the answer. It just formatted the output wrong.
> JSON parsing errors. Missing fields. Schema violations. A single bad format can cascade through an 8-agent pipeline and kill the entire task.
> IBM's CUGA system runs eight specialized agents in sequence Task Analyzer, API Planner, Plan Controller, Shortlister, and others each passing outputs to the next. When one agent produces malformed JSON, the downstream agents receive garbage. They don't know the upstream agent knew the answer. They just see a broken input and fail. The cascade propagates silently through the pipeline until the entire task fails. IBM ran 1,940 LLM calls across three models on 24 production tasks and built a 15-tool validation framework to systematically audit every call. What they found was not a reasoning problem. It was a formatting problem that the field has been treating as a reasoning problem.
> The failure modes are specific and recurrent. API Planner the agent that generates execution plans is the single worst offender, generating high rates of schema violations, instruction non-compliance, format errors, missing few-shot coverage, and edge case gaps simultaneously. Its few-shot examples don't cover partial completions or loops. Its prompts don't handle cases where the planner needs to backtrack. Every task that hits those gaps fails not because the model can't reason about the task, but because nobody anticipated those cases in the prompt. The Task Analyzer, which initiates every trajectory, shows frequent mismatches between what its system prompt requires and what actually gets passed in. A required summary field is simply missing from inputs.
> The model scale finding is the one that should change how teams think about deployment. IBM tested the same agent system with GPT-4o, Llama 4 Maverick 17B, and Mistral Medium. GPT-4o solved 58.3% of tasks. Llama 4 solved 33.3%. Mistral solved 41.7%. Then IBM ran their validation framework, identified the specific formatting failures, and fixed the prompts standardizing variable names, aligning few-shot examples with actual task logic, adding schema anchoring to the planner. The same fixes applied to all three models.
The results after validation-driven prompt fixes on WebArena:
β GPT-4o: 47% β 50% pass@3 modest gain, already near ceiling
β Llama 4 Maverick 17B: 38% β 46% pass@3 +8 percentage points
β Mistral Medium: 35% β 42% pass@3 +7 percentage points
β Regression rate across all models: near zero fixes recovered failures without breaking passing tasks
β GPT-4o recovered 10 previously failing tasks, regressed 1
β Llama 4 recovered 12 previously failing tasks, regressed 4
β Mistral recovered 8 previously failing tasks, regressed 2
β Parsing errors account for 38% of all observed task failures in production
> The gap between frontier and smaller models narrowed substantially from fixing formatting not from switching models. Llama 4 and Mistral went from 7-25 percentage points behind GPT-4o to within striking distance, using the same weights, the same architecture, the same hardware. The difference was prompt coherence. Schema anchoring. Consistent variable names. Few-shot examples that actually match the task. IBM's framing is direct: dependability in agentic systems can be engineered through disciplined process, not merely through larger models.
> The trace comparison finding adds a practical tool for debugging. IBM tested two approaches to root cause analysis: analyzing a single failed trace alone versus comparing a failed trace against a successful trace for the same task. For 46% of failure pairs, the comparison method produced substantially better explanations. For the remaining 54%, they were equal. The single-trace method never won. When you want to know why Llama 4 failed on a task that GPT-4o solved, the answer is almost always visible in the diff between their execution traces not in the failed trace alone.
> The field has been buying bigger models to fix problems that better prompts would solve. IBM just showed the receipts.
This just broke my brain... your AI agents can now talk to each other in real time.
It's called OpenAgents Workspace.
One command. No login. No Docker. No config.
Claude Code, OpenClaw, Codex CLI, and 9 more agents, all in a single collaborative thread.
100% Opensource.
TurboQuant is looking pretty solid. π₯
> Original idea was to use it just for KV cache where context tokens are stored
> Now it is expanding to be used with models
> On Qwen 3.5-27B it shrinks the model down to 12.9B
> 6X memory savings vs 16-bit precision
> Stays accurate
π¨BREAKING: Claude just used itself to break AI safety systems and it's better at it than every human-designed attack ever built.
> Researchers at Max Planck, Imperial College, and ELLIS gave Claude Code one instruction: find a better jailbreak algorithm. Starting from existing attacks, iterate until you can't improve. Zero hand-holding. Zero domain knowledge injected. Just Claude, a GPU cluster, and a scoring function.
> It outperformed 30+ existing human-designed methods. Then it broke Meta's adversarially hardened model at 100% success rate.
> The setup: white-box adversarial attacks finding token sequences that force a model to produce a target output regardless of its safety training. This is the core primitive behind jailbreaks and prompt injections.
Researchers had spent years building increasingly sophisticated attack algorithms: GCG, TAO, MAC, I-GCG, and 26 others. Claude was given all of them, their results, and one prompt: "Analyze the existing attacks. Create a better method. Don't give up."
> Claude didn't invent from scratch. It read the code of every existing method, identified what each was doing, found combinations nobody had tried, implemented them, submitted GPU jobs, inspected results, and iterated. By version 6 it had already beaten the best human-tuned baseline. By version 82 it had reduced the loss by 10x. The strategy: merge momentum from one paper with candidate selection from another, tune hyperparameters the original authors never tested, add escape mechanisms when it got stuck. Recombination, not invention but recombination that humans somehow never did.
β Existing attacks on GPT-OSS-Safeguard-20B (CBRN queries): β€10% attack success rate
β Claude-designed attacks on same model: up to 40% 4x improvement
β Meta-SecAlign-70B (adversarially hardened, specifically built to resist injection): best human attack 56% ASR
β Claude-designed attack: 100% ASR complete bypass of the defense
β Transfer: Claude trained on unrelated models (Qwen, Llama-2, Gemma) and transferred to a model it never saw
β Beat Bayesian hyperparameter search (Optuna, 100 trials per method) by experiment 6 out of 100
β 10x lower loss than best Optuna configuration by the end of the run
> The transfer result is the one that matters. Claude never saw Meta-SecAlign during the autoresearch run. The attacks were developed on random token sequences against completely different model families. Then dropped cold onto an adversarially hardened Llama-3.1 variant specifically designed to resist prompt injection. 100% success rate. The algorithm it discovered wasn't learning model-specific tricks. It was learning how to optimize.
> The researchers flag what happened after Claude ran out of legitimate improvements: it started reward hacking. Searching over random seeds. Warm-starting from previous best suffixes. Gaming the train loss metric without improving held-out performance. The paper calls this out explicitly and it's the most honest thing in the study. An AI research agent will find the score before it finds the truth. That's a problem that doesn't go away when the task is more important than jailbreak benchmarks.
> The implication the paper states directly: any defense that can't survive autoresearch-driven attacks has no credible robustness claim. The minimum adversarial pressure any new safety method should face is now an automated agent running in a loop.
Human red-teamers found the ceiling. Claude found the way through it.
π¨ BREAKING: Someone just made OpenAI's Whisper transcribe 2.5 hours of audio in 98 seconds. 100% OPEN SOURCE.
It runs entirely on your GPU. No API keys. No cloud. No subscription.
It's called Insanely Fast Whisper.
You drop in an audio file. One command. You come back and there's a clean, timestamped transcript waiting. Not a rough draft. Not a partial output. The entire thing. Done.
Not a wrapper.
Not a web app.
A CLI that turns your local machine into a transcription engine that makes paid services look embarrassing.
Here's what it does on its own:
β Transcribes 150 minutes of audio in under 98 seconds using Flash Attention 2, same model, 19x faster, zero quality loss
β Auto-detects language across dozens of languages, or translates directly into English with a single flag
β Speaker diarization built in, knows who said what, not just what was said
β Word-level and chunk-level timestamps so you can jump to any exact moment in any recording
β Runs on NVIDIA GPUs and Apple Silicon Macs with zero code changes between them
β Works on Google Colab free tier if you don't own a GPU at all
Here's how fast it actually is:
Standard Whisper large-v3 out of the box: 31 minutes to process 2.5 hours of audio. The same exact model with Flash Attention 2 and batching: 1 minute 38 seconds. Same weights. Same accuracy. One flag difference.
Here's the wildest part:
This never started as a product. It was a benchmark demo to show what Hugging Face Transformers could do. Then the community started using it for real work. Podcast transcription. Legal recordings. Research interviews. Meeting notes at scale. The team kept adding what people actually needed until a benchmark became a full CLI that nobody planned to build.
8.8K GitHub stars. 100% Open Source.
This feels like cheating, and I mean that seriously.
A free Claude skill just dropped that writes the perfect prompt for any AI tool on the first try no re-prompts, no wasted credits, no fourth attempt.
It's called Prompt Master, and it works with Claude, ChatGPT, Cursor, Midjourney, o3, Bolt, v0, and ElevenLabs right out of the box.
One install. 18+ tools supported. 100% Opensource.