𝙈𝙄𝙇𝙀𝙎 ♫

@Crypto_Miles

Architect&Designer, Musician&Music lover, PhD candidate/ Room acoustics, Crypto enthusiast, Investor& Trader, Interested in Arts, Technology & Philosophy

OUTer spACE

Joined April 2017

1.1K Following

187 Followers

2.3K Posts

𝙈𝙄𝙇𝙀𝙎 ♫ @Crypto_Miles

18 days ago

@Zephyr_hg Sales

Crypto_Miles retweeted

NetworkChuck

@NetworkChuck

18 days ago

Use frontier models to build the skills and logic. ——-> Then hand it off to local models to be the hands and feet.

120

Crypto_Miles retweeted

David Sinclair

@davidasinclair

21 days ago

NEW PREPRINT: Scientists may have found direct evidence that aging is driven by the loss of cellular information, not just the accumulation of damage For decades we've focused on what aging cells accumulate. This paper focuses on what they lose: Information. Using a new technology called SeqTag, researchers measured gene expression, chromatin accessibility, and histone modifications in the same aging cells. What they found was striking: the regulatory systems that tell cells who they are become increasingly out of sync with age. The authors call this "molecular asynchrony." As cells age, chromatin structure, histone marks, and gene expression begin drifting apart. Regulatory entropy rises. Repressive chromatin erodes. Cells become less certain of their identity and more likely to drift toward alternative fates. This is what the Information Theory of Aging (ITOA) states: that aging occurs when cells lose epigenetic information, the instructions that tell the genome how to maintain youthful function. DNA may remain largely unchanged, but the system that reads it gradually loses fidelity. What's remarkable is that this paper doesn't just describe this phenomenon. It quantifies it. The authors measure increasing regulatory entropy, loss of H3K27me3-mediated repression, erosion of heterochromatin, weakening lineage fidelity, and increased cell-fate drift during aging. In progenitor cells, the barriers that normally preserve cellular identity become progressively weaker with age. Mechanistically, the study argues that aging is associated with increasing molecular asynchrony between chromatin accessibility, H3K27ac/H3K27me3 remodeling, and transcriptional state. This decoupling is accompanied by increased regulatory entropy, loss of repressive chromatin architecture, and weakening of lineage constraints. Genes affected are those involved in chromatin organization, DNA damage, and Wnt signaling, consistent with ITOA. Importantly, the authors provide quantitative evidence that age-related heterochromatin erosion lowers the energetic barriers that maintain cell identity, offering a potential mechanistic link between epigenetic information loss, cell-fate drift, and late-life disease susceptibility 👏

davidasinclair's tweet photo. NEW PREPRINT: Scientists may have found direct evidence that aging is driven by the loss of cellular information, not just the accumulation of damage

For decades we've focused on what aging cells accumulate. This paper focuses on what they lose: Information.

Using a new technology called SeqTag, researchers measured gene expression, chromatin accessibility, and histone modifications in the same aging cells.

What they found was striking: the regulatory systems that tell cells who they are become increasingly out of sync with age. The authors call this "molecular asynchrony."

As cells age, chromatin structure, histone marks, and gene expression begin drifting apart. Regulatory entropy rises. Repressive chromatin erodes. Cells become less certain of their identity and more likely to drift toward alternative fates.

This is what the Information Theory of Aging (ITOA) states: that aging occurs when cells lose epigenetic information, the instructions that tell the genome how to maintain youthful function. DNA may remain largely unchanged, but the system that reads it gradually loses fidelity.

What's remarkable is that this paper doesn't just describe this phenomenon. It quantifies it. The authors measure increasing regulatory entropy, loss of H3K27me3-mediated repression, erosion of heterochromatin, weakening lineage fidelity, and increased cell-fate drift during aging. In progenitor cells, the barriers that normally preserve cellular identity become progressively weaker with age.

Mechanistically, the study argues that aging is associated with increasing molecular asynchrony between chromatin accessibility, H3K27ac/H3K27me3 remodeling, and transcriptional state.

This decoupling is accompanied by increased regulatory entropy, loss of repressive chromatin architecture, and weakening of lineage constraints. Genes affected are those involved in chromatin organization, DNA damage, and Wnt signaling, consistent with ITOA.

Importantly, the authors provide quantitative evidence that age-related heterochromatin erosion lowers the energetic barriers that maintain cell identity, offering a potential mechanistic link between epigenetic information loss, cell-fate drift, and late-life disease susceptibility 👏

278

639

85K

𝙈𝙄𝙇𝙀𝙎 ♫ @Crypto_Miles

about 2 months ago

@elonmusk Why @elonmusk why? What do you want to accomplish with post like this? I just dont get it man, you have so much power in your hands and it could be used more wisely...

Who to follow

Value is in the eye of the bagholder. Technical Engineer by day, roundtripping portfolios by night. All day.

Crypto_Miles retweeted

Trading Composure

@TradingComposur

2 months ago

Impatience is expensive. Because it forces action before the edge is there. And then you call it bad luck. No... you were just impatient.

210

Crypto_Miles retweeted

Zephyr

@Zephyr_hg

2 months ago

Stop pitching AI implementations. Start selling outcomes. The 3-sentence pitch that closes 60% of my calls.

260

601

38K

Crypto_Miles retweeted

Frank

@jedisct1

2 months ago

Breaking: The results of Mythos’s audit of the Rust standard library have been leaked https://t.co/lst0VesfrW

501

403

161K

Crypto_Miles retweeted

Robert Youssef

@rryssf

3 months ago

someone just open-sourced a full 3D building editor that runs entirely in your browser. 💀 no AutoCAD. no Revit. no $5,000 annual license. no install. just open the tab. the AEC software industry has been selling the same tools for 30 years and charging more for them every year. the moat was never the technology. it was the switching cost and the file formats. a browser-based open-source alternative doesn't just undercut the price. it removes the reason to ever open your wallet in the first place. architects, urban planners, indie game devs, anyone building physical things who got priced out. this is for them. Autodesk has had a good run.

138

125K

Crypto_Miles retweeted

Romano

@RNR_0

2 months ago

Kimi k2.6 & GLM 5.1 🔥🔥🔥 GPT 5.4 & Opus 4.7 💩 💩💩 Old OG Viacoin was based on Bitcoin core 0.16 (2019). I Forked Bitcoin Core v30.2 while GLM reimplemented every Viacoin feature 6 years of C++ modernization, BTC architectural changes & debugging GPT 5.4 & opus 4.7 failed Kimi k2.6 even backported Bitcoin v31 issued by checking the right branch It wasn't fully agentic, as I had to think but I didn't type a single type single line of code Back in 2016 - 2019 this would cost me 1-3 months AT LEAST with the need of a friend for fixing race conditions

RNR_0's tweet photo. Kimi k2.6 & GLM 5.1 🔥🔥🔥
GPT 5.4 & Opus 4.7 💩 💩💩

Old OG Viacoin was based on Bitcoin core 0.16 (2019). I Forked Bitcoin Core v30.2 while GLM reimplemented every Viacoin feature

6 years of C++ modernization, BTC architectural changes & debugging

GPT 5.4 & opus 4.7 failed

Kimi k2.6 even backported Bitcoin v31 issued by checking the right branch

It wasn't fully agentic, as I had to think but I didn't type a single type single line of code

Back in 2016 - 2019 this would cost me 1-3 months AT LEAST with the need of a friend for fixing race conditions

28K

Crypto_Miles retweeted

Friedrich Nietzsche

@NIETZSCHESOURCE

3 months ago

A judgment may well be false - but that is not an objection to it The question is how far that judgement may nevertheless promote life, how well it can still preserve and even cultivate our species Without the fictions of logic, without the constant falsification of the world through numbers, we could not live. To renounce false judgements would be to renounce, to negate life itself To accept untruth as a condition of life is to oppose conventional notions of value in a dangerous way - and a philosophy that is willing to take that risk would by that gesture alone place itself beyond good and evil BGE 4 #Nietzsche

NIETZSCHESOURCE's tweet photo. A judgment may well be false - but that is not an objection to it

The question is how far that judgement may nevertheless promote life, how well it can still preserve and even cultivate our species

Without the fictions of logic, without the constant falsification of the world through numbers, we could not live. To renounce false judgements would be to renounce, to negate life itself

To accept untruth as a condition of life is to oppose conventional notions of value in a dangerous way - and a philosophy that is willing to take that risk would by that gesture alone place itself beyond good and evil

BGE 4

#Nietzsche

412

Crypto_Miles retweeted

Miles Deutscher

@milesdeutscher

3 months ago

Claude Code + Obsidian is the most powerful AI combo I’ve ever used. I literally built an AI second brain that remembers everything I read, write, and research. Here's how to replicate my system (it only takes 5 minutes). Watch here �� https://t.co/mvoeKNDJ5K

milesdeutscher's tweet photo. Claude Code + Obsidian is the most powerful AI combo I’ve ever used.

I literally built an AI second brain that remembers everything I read, write, and research.

Here's how to replicate my system (it only takes 5 minutes).

Watch here �� https://t.co/mvoeKNDJ5K https://t.co/EOhbz4vXoo

196

289

20K

𝙈𝙄𝙇𝙀𝙎 ♫ @Crypto_Miles

3 months ago

@chamath Obsidian

Crypto_Miles retweeted

NetworkChuck

@NetworkChuck

3 months ago

Open source models are the way. I'm so excited about local AI.

183

203

261

244K

Crypto_Miles retweeted

Robert Youssef

@rryssf

3 months ago

Holy shit. IBM deployed AI agents in production and found that 38% of failures had nothing to do with reasoning. > The model knew the answer. It just formatted the output wrong. > JSON parsing errors. Missing fields. Schema violations. A single bad format can cascade through an 8-agent pipeline and kill the entire task. > IBM's CUGA system runs eight specialized agents in sequence Task Analyzer, API Planner, Plan Controller, Shortlister, and others each passing outputs to the next. When one agent produces malformed JSON, the downstream agents receive garbage. They don't know the upstream agent knew the answer. They just see a broken input and fail. The cascade propagates silently through the pipeline until the entire task fails. IBM ran 1,940 LLM calls across three models on 24 production tasks and built a 15-tool validation framework to systematically audit every call. What they found was not a reasoning problem. It was a formatting problem that the field has been treating as a reasoning problem. > The failure modes are specific and recurrent. API Planner the agent that generates execution plans is the single worst offender, generating high rates of schema violations, instruction non-compliance, format errors, missing few-shot coverage, and edge case gaps simultaneously. Its few-shot examples don't cover partial completions or loops. Its prompts don't handle cases where the planner needs to backtrack. Every task that hits those gaps fails not because the model can't reason about the task, but because nobody anticipated those cases in the prompt. The Task Analyzer, which initiates every trajectory, shows frequent mismatches between what its system prompt requires and what actually gets passed in. A required summary field is simply missing from inputs. > The model scale finding is the one that should change how teams think about deployment. IBM tested the same agent system with GPT-4o, Llama 4 Maverick 17B, and Mistral Medium. GPT-4o solved 58.3% of tasks. Llama 4 solved 33.3%. Mistral solved 41.7%. Then IBM ran their validation framework, identified the specific formatting failures, and fixed the prompts standardizing variable names, aligning few-shot examples with actual task logic, adding schema anchoring to the planner. The same fixes applied to all three models. The results after validation-driven prompt fixes on WebArena: → GPT-4o: 47% → 50% pass@3 modest gain, already near ceiling → Llama 4 Maverick 17B: 38% → 46% pass@3 +8 percentage points → Mistral Medium: 35% → 42% pass@3 +7 percentage points → Regression rate across all models: near zero fixes recovered failures without breaking passing tasks → GPT-4o recovered 10 previously failing tasks, regressed 1 → Llama 4 recovered 12 previously failing tasks, regressed 4 → Mistral recovered 8 previously failing tasks, regressed 2 → Parsing errors account for 38% of all observed task failures in production > The gap between frontier and smaller models narrowed substantially from fixing formatting not from switching models. Llama 4 and Mistral went from 7-25 percentage points behind GPT-4o to within striking distance, using the same weights, the same architecture, the same hardware. The difference was prompt coherence. Schema anchoring. Consistent variable names. Few-shot examples that actually match the task. IBM's framing is direct: dependability in agentic systems can be engineered through disciplined process, not merely through larger models. > The trace comparison finding adds a practical tool for debugging. IBM tested two approaches to root cause analysis: analyzing a single failed trace alone versus comparing a failed trace against a successful trace for the same task. For 46% of failure pairs, the comparison method produced substantially better explanations. For the remaining 54%, they were equal. The single-trace method never won. When you want to know why Llama 4 failed on a task that GPT-4o solved, the answer is almost always visible in the diff between their execution traces not in the failed trace alone. > The field has been buying bigger models to fix problems that better prompts would solve. IBM just showed the receipts.

rryssf's tweet photo. Holy shit. IBM deployed AI agents in production and found that 38% of failures had nothing to do with reasoning.

> The model knew the answer. It just formatted the output wrong.

> JSON parsing errors. Missing fields. Schema violations. A single bad format can cascade through an 8-agent pipeline and kill the entire task.

> IBM's CUGA system runs eight specialized agents in sequence Task Analyzer, API Planner, Plan Controller, Shortlister, and others each passing outputs to the next. When one agent produces malformed JSON, the downstream agents receive garbage. They don't know the upstream agent knew the answer. They just see a broken input and fail. The cascade propagates silently through the pipeline until the entire task fails. IBM ran 1,940 LLM calls across three models on 24 production tasks and built a 15-tool validation framework to systematically audit every call. What they found was not a reasoning problem. It was a formatting problem that the field has been treating as a reasoning problem.

> The failure modes are specific and recurrent. API Planner the agent that generates execution plans is the single worst offender, generating high rates of schema violations, instruction non-compliance, format errors, missing few-shot coverage, and edge case gaps simultaneously. Its few-shot examples don't cover partial completions or loops. Its prompts don't handle cases where the planner needs to backtrack. Every task that hits those gaps fails not because the model can't reason about the task, but because nobody anticipated those cases in the prompt. The Task Analyzer, which initiates every trajectory, shows frequent mismatches between what its system prompt requires and what actually gets passed in. A required summary field is simply missing from inputs.

> The model scale finding is the one that should change how teams think about deployment. IBM tested the same agent system with GPT-4o, Llama 4 Maverick 17B, and Mistral Medium. GPT-4o solved 58.3% of tasks. Llama 4 solved 33.3%. Mistral solved 41.7%. Then IBM ran their validation framework, identified the specific formatting failures, and fixed the prompts standardizing variable names, aligning few-shot examples with actual task logic, adding schema anchoring to the planner. The same fixes applied to all three models.

The results after validation-driven prompt fixes on WebArena:
→ GPT-4o: 47% → 50% pass@3 modest gain, already near ceiling
→ Llama 4 Maverick 17B: 38% → 46% pass@3 +8 percentage points
→ Mistral Medium: 35% → 42% pass@3 +7 percentage points
→ Regression rate across all models: near zero fixes recovered failures without breaking passing tasks
→ GPT-4o recovered 10 previously failing tasks, regressed 1
→ Llama 4 recovered 12 previously failing tasks, regressed 4
→ Mistral recovered 8 previously failing tasks, regressed 2
→ Parsing errors account for 38% of all observed task failures in production

> The gap between frontier and smaller models narrowed substantially from fixing formatting not from switching models. Llama 4 and Mistral went from 7-25 percentage points behind GPT-4o to within striking distance, using the same weights, the same architecture, the same hardware. The difference was prompt coherence. Schema anchoring. Consistent variable names. Few-shot examples that actually match the task. IBM's framing is direct: dependability in agentic systems can be engineered through disciplined process, not merely through larger models.

> The trace comparison finding adds a practical tool for debugging. IBM tested two approaches to root cause analysis: analyzing a single failed trace alone versus comparing a failed trace against a successful trace for the same task. For 46% of failure pairs, the comparison method produced substantially better explanations. For the remaining 54%, they were equal. The single-trace method never won. When you want to know why Llama 4 failed on a task that GPT-4o solved, the answer is almost always visible in the diff between their execution traces not in the failed trace alone.

> The field has been buying bigger models to fix problems that better prompts would solve. IBM just showed the receipts.

138

122

11K

Crypto_Miles retweeted

Hasan Toor

@hasantoxr

3 months ago

This just broke my brain... your AI agents can now talk to each other in real time. It's called OpenAgents Workspace. One command. No login. No Docker. No config. Claude Code, OpenClaw, Codex CLI, and 9 more agents, all in a single collaborative thread. 100% Opensource.

136

179

68K

Crypto_Miles retweeted

Chaofan Shou

@Fried_rice

3 months ago

Claude code source code has been leaked via a map file in their npm registry! Code: https://t.co/jBiMoOzt8G

48K

41K

36M

Crypto_Miles retweeted

Markets & Mayhem

@Mayhem4Markets

3 months ago

TurboQuant is looking pretty solid. 🔥 > Original idea was to use it just for KV cache where context tokens are stored > Now it is expanding to be used with models > On Qwen 3.5-27B it shrinks the model down to 12.9B > 6X memory savings vs 16-bit precision > Stays accurate

Mayhem4Markets's tweet photo. TurboQuant is looking pretty solid. 🔥

> Original idea was to use it just for KV cache where context tokens are stored

> Now it is expanding to be used with models

> On Qwen 3.5-27B it shrinks the model down to 12.9B

> 6X memory savings vs 16-bit precision

> Stays accurate https://t.co/ckK9LbJUyw

156

290K

Crypto_Miles retweeted

Robert Youssef

@rryssf

3 months ago

🚨BREAKING: Claude just used itself to break AI safety systems and it's better at it than every human-designed attack ever built. > Researchers at Max Planck, Imperial College, and ELLIS gave Claude Code one instruction: find a better jailbreak algorithm. Starting from existing attacks, iterate until you can't improve. Zero hand-holding. Zero domain knowledge injected. Just Claude, a GPU cluster, and a scoring function. > It outperformed 30+ existing human-designed methods. Then it broke Meta's adversarially hardened model at 100% success rate. > The setup: white-box adversarial attacks finding token sequences that force a model to produce a target output regardless of its safety training. This is the core primitive behind jailbreaks and prompt injections. Researchers had spent years building increasingly sophisticated attack algorithms: GCG, TAO, MAC, I-GCG, and 26 others. Claude was given all of them, their results, and one prompt: "Analyze the existing attacks. Create a better method. Don't give up." > Claude didn't invent from scratch. It read the code of every existing method, identified what each was doing, found combinations nobody had tried, implemented them, submitted GPU jobs, inspected results, and iterated. By version 6 it had already beaten the best human-tuned baseline. By version 82 it had reduced the loss by 10x. The strategy: merge momentum from one paper with candidate selection from another, tune hyperparameters the original authors never tested, add escape mechanisms when it got stuck. Recombination, not invention but recombination that humans somehow never did. → Existing attacks on GPT-OSS-Safeguard-20B (CBRN queries): ≤10% attack success rate → Claude-designed attacks on same model: up to 40% 4x improvement → Meta-SecAlign-70B (adversarially hardened, specifically built to resist injection): best human attack 56% ASR → Claude-designed attack: 100% ASR complete bypass of the defense → Transfer: Claude trained on unrelated models (Qwen, Llama-2, Gemma) and transferred to a model it never saw → Beat Bayesian hyperparameter search (Optuna, 100 trials per method) by experiment 6 out of 100 → 10x lower loss than best Optuna configuration by the end of the run > The transfer result is the one that matters. Claude never saw Meta-SecAlign during the autoresearch run. The attacks were developed on random token sequences against completely different model families. Then dropped cold onto an adversarially hardened Llama-3.1 variant specifically designed to resist prompt injection. 100% success rate. The algorithm it discovered wasn't learning model-specific tricks. It was learning how to optimize. > The researchers flag what happened after Claude ran out of legitimate improvements: it started reward hacking. Searching over random seeds. Warm-starting from previous best suffixes. Gaming the train loss metric without improving held-out performance. The paper calls this out explicitly and it's the most honest thing in the study. An AI research agent will find the score before it finds the truth. That's a problem that doesn't go away when the task is more important than jailbreak benchmarks. > The implication the paper states directly: any defense that can't survive autoresearch-driven attacks has no credible robustness claim. The minimum adversarial pressure any new safety method should face is now an automated agent running in a loop. Human red-teamers found the ceiling. Claude found the way through it.

rryssf's tweet photo. 🚨BREAKING: Claude just used itself to break AI safety systems and it's better at it than every human-designed attack ever built.

> Researchers at Max Planck, Imperial College, and ELLIS gave Claude Code one instruction: find a better jailbreak algorithm. Starting from existing attacks, iterate until you can't improve. Zero hand-holding. Zero domain knowledge injected. Just Claude, a GPU cluster, and a scoring function.

> It outperformed 30+ existing human-designed methods. Then it broke Meta's adversarially hardened model at 100% success rate.

> The setup: white-box adversarial attacks finding token sequences that force a model to produce a target output regardless of its safety training. This is the core primitive behind jailbreaks and prompt injections.
Researchers had spent years building increasingly sophisticated attack algorithms: GCG, TAO, MAC, I-GCG, and 26 others. Claude was given all of them, their results, and one prompt: "Analyze the existing attacks. Create a better method. Don't give up."

> Claude didn't invent from scratch. It read the code of every existing method, identified what each was doing, found combinations nobody had tried, implemented them, submitted GPU jobs, inspected results, and iterated. By version 6 it had already beaten the best human-tuned baseline. By version 82 it had reduced the loss by 10x. The strategy: merge momentum from one paper with candidate selection from another, tune hyperparameters the original authors never tested, add escape mechanisms when it got stuck. Recombination, not invention but recombination that humans somehow never did.

→ Existing attacks on GPT-OSS-Safeguard-20B (CBRN queries): ≤10% attack success rate
→ Claude-designed attacks on same model: up to 40% 4x improvement
→ Meta-SecAlign-70B (adversarially hardened, specifically built to resist injection): best human attack 56% ASR
→ Claude-designed attack: 100% ASR complete bypass of the defense
→ Transfer: Claude trained on unrelated models (Qwen, Llama-2, Gemma) and transferred to a model it never saw
→ Beat Bayesian hyperparameter search (Optuna, 100 trials per method) by experiment 6 out of 100
→ 10x lower loss than best Optuna configuration by the end of the run

> The transfer result is the one that matters. Claude never saw Meta-SecAlign during the autoresearch run. The attacks were developed on random token sequences against completely different model families. Then dropped cold onto an adversarially hardened Llama-3.1 variant specifically designed to resist prompt injection. 100% success rate. The algorithm it discovered wasn't learning model-specific tricks. It was learning how to optimize.

> The researchers flag what happened after Claude ran out of legitimate improvements: it started reward hacking. Searching over random seeds. Warm-starting from previous best suffixes. Gaming the train loss metric without improving held-out performance. The paper calls this out explicitly and it's the most honest thing in the study. An AI research agent will find the score before it finds the truth. That's a problem that doesn't go away when the task is more important than jailbreak benchmarks.

> The implication the paper states directly: any defense that can't survive autoresearch-driven attacks has no credible robustness claim. The minimum adversarial pressure any new safety method should face is now an automated agent running in a loop.

Human red-teamers found the ceiling. Claude found the way through it.

150

102

11K

Crypto_Miles retweeted

Hasan Toor

@hasantoxr

3 months ago

🚨 BREAKING: Someone just made OpenAI's Whisper transcribe 2.5 hours of audio in 98 seconds. 100% OPEN SOURCE. It runs entirely on your GPU. No API keys. No cloud. No subscription. It's called Insanely Fast Whisper. You drop in an audio file. One command. You come back and there's a clean, timestamped transcript waiting. Not a rough draft. Not a partial output. The entire thing. Done. Not a wrapper. Not a web app. A CLI that turns your local machine into a transcription engine that makes paid services look embarrassing. Here's what it does on its own: → Transcribes 150 minutes of audio in under 98 seconds using Flash Attention 2, same model, 19x faster, zero quality loss → Auto-detects language across dozens of languages, or translates directly into English with a single flag → Speaker diarization built in, knows who said what, not just what was said → Word-level and chunk-level timestamps so you can jump to any exact moment in any recording → Runs on NVIDIA GPUs and Apple Silicon Macs with zero code changes between them → Works on Google Colab free tier if you don't own a GPU at all Here's how fast it actually is: Standard Whisper large-v3 out of the box: 31 minutes to process 2.5 hours of audio. The same exact model with Flash Attention 2 and batching: 1 minute 38 seconds. Same weights. Same accuracy. One flag difference. Here's the wildest part: This never started as a product. It was a benchmark demo to show what Hugging Face Transformers could do. Then the community started using it for real work. Podcast transcription. Legal recordings. Research interviews. Meeting notes at scale. The team kept adding what people actually needed until a benchmark became a full CLI that nobody planned to build. 8.8K GitHub stars. 100% Open Source.

hasantoxr's tweet photo. 🚨 BREAKING: Someone just made OpenAI's Whisper transcribe 2.5 hours of audio in 98 seconds. 100% OPEN SOURCE.

It runs entirely on your GPU. No API keys. No cloud. No subscription.

It's called Insanely Fast Whisper.

You drop in an audio file. One command. You come back and there's a clean, timestamped transcript waiting. Not a rough draft. Not a partial output. The entire thing. Done.

Not a wrapper.
Not a web app.

A CLI that turns your local machine into a transcription engine that makes paid services look embarrassing.

Here's what it does on its own:
→ Transcribes 150 minutes of audio in under 98 seconds using Flash Attention 2, same model, 19x faster, zero quality loss
→ Auto-detects language across dozens of languages, or translates directly into English with a single flag
→ Speaker diarization built in, knows who said what, not just what was said
→ Word-level and chunk-level timestamps so you can jump to any exact moment in any recording
→ Runs on NVIDIA GPUs and Apple Silicon Macs with zero code changes between them
→ Works on Google Colab free tier if you don't own a GPU at all

Here's how fast it actually is:

Standard Whisper large-v3 out of the box: 31 minutes to process 2.5 hours of audio. The same exact model with Flash Attention 2 and batching: 1 minute 38 seconds. Same weights. Same accuracy. One flag difference.

Here's the wildest part:
This never started as a product. It was a benchmark demo to show what Hugging Face Transformers could do. Then the community started using it for real work. Podcast transcription. Legal recordings. Research interviews. Meeting notes at scale. The team kept adding what people actually needed until a benchmark became a full CLI that nobody planned to build.

8.8K GitHub stars. 100% Open Source.

825

113

73K

Crypto_Miles retweeted

Hasan Toor

@hasantoxr

3 months ago

This feels like cheating, and I mean that seriously. A free Claude skill just dropped that writes the perfect prompt for any AI tool on the first try no re-prompts, no wasted credits, no fourth attempt. It's called Prompt Master, and it works with Claude, ChatGPT, Cursor, Midjourney, o3, Bolt, v0, and ElevenLabs right out of the box. One install. 18+ tools supported. 100% Opensource.

hasantoxr's tweet photo. This feels like cheating, and I mean that seriously.

A free Claude skill just dropped that writes the perfect prompt for any AI tool on the first try no re-prompts, no wasted credits, no fourth attempt.

It's called Prompt Master, and it works with Claude, ChatGPT, Cursor, Midjourney, o3, Bolt, v0, and ElevenLabs right out of the box.

One install. 18+ tools supported. 100% Opensource.

216

400K

𝙈𝙄𝙇𝙀𝙎 ♫

@Crypto_Miles

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users