Tom Leranth

@xkey

Father; Applied Mathematician; Dev (C/C++/Julia/Python/Go/smidgeons of Elixir & Rust); Algorithms; Cryptography; H/w & S/w; Amat Photographer; Powerlifter

Midwest

Joined March 2008

489 Following

194 Followers

2.5K Posts

Pinned Tweet

Tom Leranth @xkey

over 8 years ago

“A conversation is a kind of illusion. I don’t know what is going on in your brain. All I can know is what I’m thinking.”-Hiroshi Ishiguro

xkey retweeted

Phoronix

@phoronix

1 day ago

OpenCV 5.0 Released With Rewritten DNN Engine, Built-In LLM & VLM Support https://t.co/RPcmbJyAlN

xkey retweeted

Cliff Pickover

@pickover

1 day ago

"I looked round the trees. The thin net of reality. These trees, this sun. I was infinitely far from home. The profoundest distances are never geographical." ~John Fowles, The Magus

pickover's tweet photo. "I looked round the trees. The thin net of reality. These trees, this sun. I was infinitely far from home. The profoundest distances are never geographical."

~John Fowles, The Magus https://t.co/boX9nAGxnC

xkey retweeted

JuliaHub

@JuliaHub_Inc

10 days ago

How do you improve vehicle #simulation when key physics are difficult to model directly? This #webinar with Dr. Michael Hoffmann explores how Scientific Machine Learning in #Dyad helps engineers combine test data with physics-based models to uncover missing behaviour, improve ride model fidelity, and avoid the trade-off between rigid first-principles models and black-box AI. A practical session for simulation and vehicle dynamics teams. Register now - https://t.co/uMuekBOGg7 #julialang #Dyad #ScientificMachineLearning #VehicleDynamics #Simulation #Engineering #SciML

199

Who to follow

Outshift by Cisco

@outshiftbycisco

Outshift by Cisco is the incubation engine delivering what’s next and new for @Cisco: Emerging technologies that achieve innovative results.

Vaishak Belle

@vaishakbelle

AI and Science. Faculty at U of Edinburgh. Write and think on: https://t.co/aUJYgRUKkc

mat

@ballforest

Hopfield Networks / Associative Memory / Kernel methods / Dynamical Systems / Information Geometry

xkey retweeted

Vik Paruchuri

@VikParuchuri

11 days ago

Announcing Surya OCR 2: - 650M params - 83.3% olmocr bench score (top under 3B) - 87% on internal 91-lang benchmark - 5 pages/s on RTX 5090 - Runs on CPU, GPU, MPS

VikParuchuri's tweet photo. Announcing Surya OCR 2:

- 650M params
- 83.3% olmocr bench score (top under 3B)
- 87% on internal 91-lang benchmark
- 5 pages/s on RTX 5090
- Runs on CPU, GPU, MPS https://t.co/ve0EcGknqw

484

301

50K

Tom Leranth @xkey

8 days ago

@MsVictoriaVixen 🫂

xkey retweeted

Artem Y

@artem_aero

12 days ago

Run @tenstorrent kernels on your CPU. ttsim is a bit-exact, full-system simulator of Wormhole and Blackhole. The whole tt-metal stack on Linux, Mac (UTM/QEMU) or Windows (WSL2) — numerics match silicon bit-for-bit. Bring up kernels of explore the architecture. No silicon required. First of its kind. Now Open Source https://t.co/AtcZ9L9e6T

231

107

21K

xkey retweeted

UCLA Mathematics @uclamath

2 months ago

"A team of UCLA computer scientists and mathematicians has been awarded a three-year, $5 million grant by the Defense Advanced Research Projects Agency to develop artificial intelligence tools aimed at transforming how mathematical discoveries are made." https://t.co/pJ3PKoD1F1

312

xkey retweeted

Quansight @quansightai

16 days ago

free-threaded Python in 2025: 🧵 asyncio: 6x faster with 6 workers 🧵 NumPy arrays: ~1000x faster at scale 🧵 PyTorch inference: 7x lower latency Ralf Gommers keynote covers all of it, including a live SciPy + JAX demo → https://t.co/0xeXQbMNfK #Python #FreethreadedPython

722

xkey retweeted

Tom Kalil

@tkalil2050

about 1 month ago

I am very proud of the progress that @RenPhilanthropy and its partners have made in our first two years – $533 million in funding for science and technology, launching more than 20 programs, training 45 scientists and innovators to design transformative, ARPA-style programs, partnering with the UK, German and Japanese governments, and publishing our first 10 playbooks on the “how” of innovation. Our field strategists are accelerating progress in areas such as AI for fundamental math, AI for early literacy, geologic hydrogen, stabilizing the Arctic climate, low-cost space telescopes, and open-source software for life sciences. Thanks to all the philanthropists, foundations, and governments that have supported our work – and please reach out if you’d like to collaborate. The best is yet to come!

13K

xkey retweeted

Vipul Vaibhaw

@vaibhaw_vipul

about 1 month ago

I have been diving deep into Hyperbolic geometry, I wrote about it. https://t.co/WXGyGczJqM

168

136

xkey retweeted

Ineffable Intelligence @IneffableLabs

about 1 month ago

Introducing Ineffable Intelligence. Led by David Silver, we're assembling the best engineers and researchers in the world to make first contact with superintelligence. We’ll be solving the hardest problems in AI on the way. Come join us. https://t.co/zUuvPJGmcq

IneffableLabs's tweet photo. Introducing Ineffable Intelligence. Led by David Silver, we're assembling the best engineers and researchers in the world to make first contact with superintelligence. We’ll be solving the hardest problems in AI on the way. Come join us.

https://t.co/zUuvPJGmcq https://t.co/pkmwDkJWbt

160

622

349K

xkey retweeted

Boost C++ | Open Source Libraries

@Boost_Libraries

about 1 month ago

Boost Blueprint 036: Boost.Pool. Fast, fixed-size-block memory allocation with O(1) allocate and deallocate — no fragmentation, no per-object header overhead, no free-list traversal object_pool<T> constructs and destroys typed objects; singleton_pool gives you thread-safe global pools; pool_allocator plugs directly into STL containers. All memory returned to the system in one shot on pool destruction Level up your C++ architecture. Follow @Boost_Libraries for the #BoostBlueprint series #cpp

Boost_Libraries's tweet photo. Boost Blueprint 036: Boost.Pool. Fast, fixed-size-block memory allocation with O(1) allocate and deallocate — no fragmentation, no per-object header overhead, no free-list traversal

object_pool<T> constructs and destroys typed objects; singleton_pool gives you thread-safe global pools; pool_allocator plugs directly into STL containers. All memory returned to the system in one shot on pool destruction

Level up your C++ architecture. Follow @Boost_Libraries for the #BoostBlueprint series

#cpp

Tom Leranth @xkey

about 1 month ago

@nmlinguaphile He can get wrecked.

xkey retweeted

Sakana AI

@SakanaAILabs

about 1 month ago

We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system! Blog: https://t.co/36Ud311KCP Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task. Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning Apply for the beta test here: https://t.co/1fjuAha7ci

SakanaAILabs's tweet photo. We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system!

Blog: https://t.co/36Ud311KCP

Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task.

Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes.

🐟 Fugu Mini: High-speed orchestration optimized for latency
🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning

Apply for the beta test here: https://t.co/1fjuAha7ci

707

161

335

366K

xkey retweeted

MIT CSAIL

@MIT_CSAIL

about 2 months ago

Today, MIT & the IMO released MathNet, the world’s largest dataset of International Math Olympiad problems & solutions 🌍 MathNet is 5x larger than previous datasets & is sourced from over 40 countries across 4 decades: https://t.co/vvojP7Fu9t

MIT_CSAIL's tweet photo. Today, MIT & the IMO released MathNet, the world’s largest dataset of International Math Olympiad problems & solutions 🌍

MathNet is 5x larger than previous datasets & is sourced from over 40 countries across 4 decades: https://t.co/vvojP7Fu9t https://t.co/qpuJOgmyfH

544

196K

xkey retweeted

Fermat's Library

@fermatslibrary

about 2 months ago

Here's a surprising appearance of e: the limit when n approaches infinity of the nth root of the least common multiple of 1,2,…,n

fermatslibrary's tweet photo. Here's a surprising appearance of e: the limit when n approaches infinity of the nth root of the least common multiple of 1,2,…,n https://t.co/mUOkHDwGBV

354

106

254K

xkey retweeted

Alex Kontorovich

@AlexKontorovich

about 2 months ago

Congratulations to the AMR @AMathRes on receiving a major donation from the Sergey Brin Family Foundation and from Michael Brin! These funds will be used to sustain and grow the AMR's diamond open access journals. If you too value the work we're doing in open science, please donate here: https://t.co/hVosmwp9hY

AlexKontorovich's tweet photo. Congratulations to the AMR @AMathRes on receiving a major donation from the Sergey Brin Family Foundation and from Michael Brin! These funds will be used to sustain and grow the AMR's diamond open access journals.

If you too value the work we're doing in open science, please donate here: https://t.co/hVosmwp9hY

xkey retweeted

Khai Loong Aw

@khai_loong_aw

about 2 months ago

Today's best AI needs orders of magnitude more data than a human child to achieve visual competence. We introduce the Zero-shot World Model (ZWM), an approach that substantially narrows this gap. Even when trained on the first-person experience of a single child, BabyZWM matches state-of-the-art models on diverse visual-cognitive tasks – with no task-specific training, i.e., zero-shot. 🧵

khai_loong_aw's tweet photo. Today's best AI needs orders of magnitude more data than a human child to achieve visual competence.

We introduce the Zero-shot World Model (ZWM), an approach that substantially narrows this gap. Even when trained on the first-person experience of a single child, BabyZWM matches state-of-the-art models on diverse visual-cognitive tasks – with no task-specific training, i.e., zero-shot. 🧵

359

263

39K

xkey retweeted

David Golchinfar

@DavidGFar

2 months ago

We taught a 1.3M parameter model to play DOOM. It outperforms LLMs up to 92,000x its size. Happy Easter Monday! Here's our Easter egg release: SauerkrautLM-Doom-MultiVec-1.3M. 17.8 average points per episode. We benchmarked our tiny model against GPT-4o-mini (via OpenAI API), Nemotron-120B, Qwen3.5-27B, and Gemini Flash Lite (via OpenRouter API) on VizDoom's defend_the_center: - Our model: 17.8 avg points/episode, 31ms per decision, runs on CPU - Gemini Flash Lite: 0.8 avg points/episode (920ms latency) - Qwen3.5-27B: 0.67 avg points/episode (13.3s latency) - Nemotron-120B: 0.6 avg points/episode (8.9s latency) - GPT-4o-mini: 0.0 avg points/episode (just dodges, never engages) The architecture: ModernBERT-Hash We took hash embeddings (Svenstrup et al. 2017), previously only applied to the original BERT architecture (see @neumll 's BERT-Hash models), and brought them to ModernBERT, adding rotary position embeddings, alternating local/global attention, Flash Attention 2 support, and learned depth embeddings from VizDoom's depth buffer. The result is a 5-layer encoder with a 75-token character-level tokenizer (no BPE, every ASCII character is one token, preserving spatial structure), attention pooling, and a 4-action classification head. Total: 1,319,300 parameters, ~5MB on disk, 31ms inference on CPU. Trained on 31K frames of a human playing DOOM for about 2 hours. That's it. Fully open source. Everything you need to reproduce this: Model weights: https://t.co/bBvtlYFq2l Training data (31K frames): https://t.co/AyEXw4mwbp Code, training scripts, benchmark framework: https://t.co/GmPnTbQgAL Full paper with methodology included in the repo. Why does this matter beyond the fun factor? Small specialized models can decisively beat general-purpose LLMs at real-time control tasks. Not by a small margin, by 22x on average points per episode. At 1/400th the latency. On a CPU. For free. This has real implications for robotics, autonomous systems, game AI, and any domain where you need sub-100ms decisions on edge hardware. The future of AI isn't exclusively large. It's appropriately sized. Thank you to my co-authors Daryoush Vaziri (University of Applied Sciences Bonn-Rhein-Sieg) and Alexander Marquardt (Nara Institute of Science and Technology, CARE Laboratory) for their contributions to this work. Built with VizDoom, PyTorch, HuggingFace Transformers, and the ModernBERT architecture by @benjamin_warner , @antoine_chaffin, @ClavierBenjamin et al. Hash embedding approach inspired by NeuML's BERT-Hash models. #AI #DOOM #GameAI #SmallModels #OpenSource #ModernBERT #SauerkrautLM #VAGOSolutions #Easter #TinyML

DavidGFar's tweet photo. We taught a 1.3M parameter model to play DOOM. It outperforms LLMs up to 92,000x its size.

Happy Easter Monday! Here's our Easter egg release: SauerkrautLM-Doom-MultiVec-1.3M.

17.8 average points per episode.

We benchmarked our tiny model against GPT-4o-mini (via OpenAI API), Nemotron-120B, Qwen3.5-27B, and Gemini Flash Lite (via OpenRouter API) on VizDoom's defend_the_center:

- Our model: 17.8 avg points/episode, 31ms per decision, runs on CPU
- Gemini Flash Lite: 0.8 avg points/episode (920ms latency)
- Qwen3.5-27B: 0.67 avg points/episode (13.3s latency)
- Nemotron-120B: 0.6 avg points/episode (8.9s latency)
- GPT-4o-mini: 0.0 avg points/episode (just dodges, never engages)

The architecture: ModernBERT-Hash

We took hash embeddings (Svenstrup et al. 2017), previously only applied to the original BERT architecture (see @neumll 's BERT-Hash models), and brought them to ModernBERT, adding rotary position embeddings, alternating local/global attention, Flash Attention 2 support, and learned depth embeddings from VizDoom's depth buffer.

The result is a 5-layer encoder with a 75-token character-level tokenizer (no BPE, every ASCII character is one token, preserving spatial structure), attention pooling, and a 4-action classification head. Total: 1,319,300 parameters, ~5MB on disk, 31ms inference on CPU.

Trained on 31K frames of a human playing DOOM for about 2 hours. That's it.

Fully open source. Everything you need to reproduce this:

Model weights: https://t.co/bBvtlYFq2l
Training data (31K frames): https://t.co/AyEXw4mwbp
Code, training scripts, benchmark framework: https://t.co/GmPnTbQgAL
Full paper with methodology included in the repo.

Why does this matter beyond the fun factor?

Small specialized models can decisively beat general-purpose LLMs at real-time control tasks. Not by a small margin, by 22x on average points per episode. At 1/400th the latency. On a CPU. For free.

This has real implications for robotics, autonomous systems, game AI, and any domain where you need sub-100ms decisions on edge hardware. The future of AI isn't exclusively large. It's appropriately sized.

Thank you to my co-authors Daryoush Vaziri (University of Applied Sciences Bonn-Rhein-Sieg) and Alexander Marquardt (Nara Institute of Science and Technology, CARE Laboratory) for their contributions to this work.

Built with VizDoom, PyTorch, HuggingFace Transformers, and the ModernBERT architecture by @benjamin_warner , @antoine_chaffin, @ClavierBenjamin et al. Hash embedding approach inspired by NeuML's BERT-Hash models.

#AI #DOOM #GameAI #SmallModels #OpenSource #ModernBERT #SauerkrautLM #VAGOSolutions #Easter #TinyML

217

172

41K

xkey retweeted

Jeff Dean

@JeffDean

2 months ago

Today we're releasing Gemma 4, our new family of open foundation models, built on the same research and technology as our Gemini 3 series. These models set a new standard for open intelligence, offering SOTA reasoning capabilities from edge-scale (2B and 4B w/ vision/audio) up to a 26B parameter MoE model and a 31B dense model. By releasing Gemma 4 under the Apache 2.0 license, we hope to enable more innovation across the research and developer communities. Our earlier Gemma 3 models were downloaded 400M times and over 100,000 variants of those models have been published, so we're excited to see what the community will do with the even better Gemma 4 models! Learn more at https://t.co/BW6O3Gr8bc and https://t.co/8M0XSQSP4u Great work by everyone involved! #Gemma4 #AI #OpenSource #ML

180

176

100K

Tom Leranth

@xkey

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users