Human biology matters. Scientists and AI need human data to understand health and disease.
Crownlands is open sourcing Gateway 4M, the largest single-cell tissue dataset ever released from living humans, to advance research on brain aging and neurodegeneration.
Thrilled to share that Manuel Tran is joining Valinor! Manuel joins us from Roche and TUM, where he spent his time as an ML scientist building generative models for various modalities like histopathology, transcriptomics, and proteomics.
Among others, Manuel led the development of three models that are worth calling out:
LoReTTa is a self-supervised pre-training algorithm that enables a single multimodal transformer to operate across heterogeneous data types — images, text, audio, genomics, transcriptomics, and proteomics — even when the training data never contained all modality combinations simultaneously. This is a critical bottleneck in healthcare, where complete multi-modal patient records are rarely available. Validated on cancer molecular data from 7,030 TCGA patients, it outperformed GPT, BERT, and CLIP on survival prediction across all modality combinations, including those entirely absent from training.
HistoGPT is a vision-language model for dermatopathology report generation that operates on full gigapixel WSIs rather than the ≤1024px patches earlier generative models were limited to. It couples a pathology vision encoder to BioGPT via Flamingo-style gated cross-attention, keeping the backbone frozen and training only the XATTN blocks. It does zero-shot tumor subtype/thickness/margin prediction and exposes gradient×attention saliency maps that localize each generated token back to tissue.
Phoenix is the more recent and ambitious model: single-cell spatial transcriptomics predicted from H&E via latent flow matching. It’s a 1.2B-parameter conditional flow-matching transformer (DiT-style, with an MLP-Mixer autoencoder for the gene latent) conditioned on pathology foundation model image features, trained on 22.2M Xenium cell-image/expression pairs. The notable result is generalization: with cohort-level train/test splits, it transfers zero-shot to unseen organs and tissues and improves Spearman correlation by 35–173% over baselines that otherwise collapse to the mean. It scales to a 9,544-patient TCGA atlas and fine-tunes cleanly to sarcoma and mouse PDAC.
Manuel’s deep expertise in pathology foundation models, generative architectures, and getting models to actually generalize across the messy reality of clinical data will be critical as we scale our multimodal virtual patient models.
It’s funny….every AI startup deck claims a data moat. 5% actually have one. Would your data be impossible to replicate even if a competitor raised $500M tomorrow? If yes cool you have a business.
Why AI Progress Suddenly Feels Real - my conversation with @yanndubs, who co-leads the Post-Training Frontiers team at @OpenAI
00:00 - Intro
01:30 - Why recent AI progress feels like a step function
04:13 - Model reliability & the emotional rollercoaster of shipping GPT-5.5
07:33 - How OpenAI structures vertical and horizontal teams
09:49 - Improving model efficiency and test-time compute
12:32 - Yann's journey from Switzerland to OpenAI
15:37 - Reasoning in 2026: Real-world utility vs verifiable rewards
18:34 - GPT-5.5 Thinking vs Pro: Scaling test-time compute
20:09 - How reasoning models become more efficient
23:23 - Pre-training scaling and overcoming the data wall
27:03 - Multimodal data, synthetic data, and embodied AI
31:05 - Demystifying mid-training and post-training
37:21 - Does RL create new capabilities in AI?
38:53 - The challenges and frontier of scaling RL
43:09 - Is building AI models a craft or a strict science
48:21 - How AI models generalize across different domains
54:18 - How reinforcement learning cures AI hallucinations
56:04 - Negative generalization and conflicting instructions
58:05 - Can RL scale to law, medicine, and the broader economy?
1:00:19 - The evaluation bottleneck and Model as a Judge
1:04:21 - Continuous AI progress & continual learning
1:08:49 - Will foundation models eat the agent harness
1:11:23 - Why startups should focus on the last mile of AI
New blackboard lecture w @reinerpope
How do chips actually work – starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do.
0:00:00 – Building a multiply-accumulate from logic gates
0:16:20 – Muxes and the cost of data movement
0:25:59 – How systolic arrays work
0:39:00 – Clock cycles and pipeline registers
0:51:40 – FPGAs vs ASICs
1:03:14 – Cache vs scratchpad
1:07:16 – Why CPU cores are much bigger than GPU cores
1:11:49 – Brains vs chips
1:15:22 – A GPU is just a bunch of tiny TPUs
Look up Dwarkesh Podcast on YouTube/Spotify/etc to watch. Enjoy!
Following up on the suggestion from Will Sawin, here is an illustration of the new configurations that disprove Erdos' unit distance conjecture (made with the help of ChatGPT 5.5 Thinking).
SpaceX, Cerebras, & OpenAI/Anthropic have made it obvious that the surest path to building something generational is to pursue something extremely difficult to the point of absurdity