Iโve always believed the No.1 application of AI should be to improve human health.
That work started with AlphaFold, and now at @IsomorphicLabs with the mission to reimagine drug discovery and one day solve all disease!
We are turbocharging that goal with $2.1B in new funding.
Science lost a true visionary this week.
We honor the life and profound legacy of Dr. J. Craig Venter, whose bold leadership fundamentally changed modern genomics and ignited the field of synthetic biology.
From accelerating the mapping of the human genome to creating the first synthetic cell, his drive to turn discovery into real-world impact will inspire generations to come.
#CraigVenter #Genomics #SyntheticBiology #ScienceLegacy #JCVI #DNA #SciencePioneer #HumanGenome
https://t.co/8WgMgJjWRw
Welcome home Reid, Victor, Christina, and Jeremy! ๐ซถ
The Artemis II astronauts have splashed down at 8:07pm ET (0007 UTC April 11), bringing their historic 10-day mission around the Moon to an end.
Hear from the man himself ๐คฉ
In his first interview as @UNC_Basketball's head coach, Michael Malone joins @JonesAngell & @jadamlucas in the @carolina_pod studio!
Full Interview โก๏ธ https://t.co/vylShAFWHU
Liftoff.
The Artemis II mission launched from @NASAKennedy at 6:35pm ET (2235 UTC), propelling four astronauts on a journey around the Moon.
Artemis II will pave the way for future Moon landings, as well as the next giant leap โ astronauts on Mars.
Resilience is the word of the day! ๐
The latest @MBAMortgage Chart of the Week shows the overall homeownership rate holding steady at 65.3%.
While the 35โ44 demographic navigates a shifting market, this stability creates a solid foundation for new solutions and opportunities in the year ahead. ๐
Insights from Dr. Eddie Seiler here:
https://t.co/nT4UEVhw6e
#HousingMarket #MortgageBankers #Homeownership #Economy #MBA #HousingMarket #Growth
The Huajiang Grand Canyon Bridge opened to traffic for the first time on Sunday.
The bridge will showcase engineering capabilities and boost the goal of becoming a world-class tourist destination.
https://t.co/BbgipKuRMj
Exciting breakthroughs with Biomni-R0: enabling agentic automation in biomedical research tasks with higher accuracy.
Using reinforcement learning to hill-climb biomedical reasoning agents to expert-level.
Technical report: https://t.co/4YGaXJZW5B
Looking forward to collaborating with the OSS code and to enhance life science agentic models' ability to transfer learned reasoning skills to entirely new biomedical problems, towards a general biomedical reasoning agent.
Thanks
@RyanLi0802
@KexinHuang5@ProjectBiomni@shiyi_c98@NovaSkyAI@jure
#BiomniR0
#BiomedicalAI
#ReinforcementLearning
#OpenSource
#AIAgent
#AIforScience
#Biotech
#Stanford
#UCBerkeley
Reinforcement learning leads to better AI scientist agents! ๐ By training models end-to-end with multi-turn RL, weโre seeing breakthroughs in reasoning and problem-solving for real biomedical research.
Excited to introduce Biomni-R0 โ an agentic LLM trained with this approach. On 10 real research tasks, it nearly doubles performance over its open-source base model and even surpasses closed-source frontier models by >10%. A scalable path to expert-level AI in biomedicine.
Led by @RyanLi0802 @KexinHuang5@ProjectBiomni with exciting collaboration with the SkyRL team @shiyi_c98@NovaSkyAI.
Learn more: https://t.co/bEmkZGbG07 โ open sourcing soon!
Declared this Sunday, July 6th, as a Day of Prayer in Texas in response to the floods in the Hill Country.
I invite Texans to join me in prayer for the communities affected by this disaster.
The INVEST AMERICA ACT. Passed July 4, 2025. Accounts established & funded July 4, 2026.
Because every child deserves to share in the upside of America.
Happy birthday America! Never perfect. Always rising.
A new cover for SUPER AGERS after making the NYT bestseller list. Thanks to you for making it the #1 ranked new non-fiction book on Amazon.
https://t.co/2LU5uH821R
Good read: The Leaderboard Illusion: https://t.co/68usghEsaL
Big Tech commercially dependent on marketing model performance for revenues putting their best models out on Chatbot Arena is not surprising.
I would argue against prohibiting score retraction after submission and instead encourage proprietary, open-weights, and open-source models to also do testing of as many model variants as resource permit and publish all model benchmarks.
Agree on a non-repudiation audit trail for all models published on Chatbot Arena.
Also important to clearly and conspicuously disclose that real world model performance / model mileage will vary, as Chatbot Arena is not how most real world model applications are utilized.
There's a new paper circulating looking in detail at LMArena leaderboard: "The Leaderboard Illusion"
https://t.co/tVMrx68zwa
I first became a bit suspicious when at one point a while back, a Gemini model scored #1 way above the second best, but when I tried to switch for a few days it was worse than what I was used to. Conversely as an example, around the same time Claude 3.5 was a top tier model in my personal use but it ranked very low on the arena. I heard similar sentiments both online and in person. And there were a number of other relatively random models, often suspiciously small, with little to no real-world knowledge as far as I know, yet they ranked quite high too.
"When the data and the anecdotes disagree, the anecdotes are usually right." (Jeff Bezos on a recent pod, though I share the same experience personally). I think these teams have placed different amount of internal focus and decision making around LM Arena scores specifically. And unfortunately they are not getting better models overall but better LM Arena models, whatever that is. Possibly something with a lot of nested lists, bullet points and emoji.
It's quite likely that LM Arena (and LLM providers) can continue to iterate and improve within this paradigm, but in addition I also have a new candidate in mind to potentially join the ranks of "top tier eval". It is the @openrouter LLM rankings:
https://t.co/N1NCZyVCv3
Basically, OpenRouter allows people/companies to quickly switch APIs between LLM providers. All of them have real use cases (not toy problems or puzzles), they have their own private evals, and all of them have an incentive to get their choices right, so by choosing one LLM over another they are directly voting for some combo of capability+cost. I don't think OpenRouter is there just yet in both the quantity and diversity of use, but something of this kind I think has great potential to grow into a very nice, very difficult to game eval.
Congrats to the @AIatMeta team on the launch of their new Llama 4 open-weights models. For the U.S. to win the AI race, we have to win in open source too, and Llama 4 puts us back in the lead.
New 2h11m YouTube video: How I Use LLMs
This video continues my general audience series. The last one focused on how LLMs are trained, so I wanted to follow up with a more practical guide of the entire LLM ecosystem, including lots of examples of use in my own life.
Chapters give a sense of content:
00:00:00 Intro into the growing LLM ecosystem
00:02:54 ChatGPT interaction under the hood
00:13:12 Basic LLM interactions examples
00:18:03 Be aware of the model you're using, pricing tiers
00:22:54 Thinking models and when to use them
00:31:00 Tool use: internet search
00:42:04 Tool use: deep research
00:50:57 File uploads, adding documents to context
00:59:00 Tool use: python interpreter, messiness of the ecosystem
01:04:35 ChatGPT Advanced Data Analysis, figures, plots
01:09:00 Claude Artifacts, apps, diagrams
01:14:02 Cursor: Composer, writing code
01:22:28 Audio (Speech) Input/Output
01:27:37 Advanced Voice Mode aka true audio inside the model
01:37:09 NotebookLM, podcast generation
01:40:20 Image input, OCR
01:47:02 Image output, DALL-E, Ideogram, etc.
01:49:14 Video input, point and talk on app
01:52:23 Video output, Sora, Veo 2, etc etc.
01:53:29 ChatGPT memory, custom instructions
01:58:38 Custom GPTs
02:06:30 Summary
Link in the reply post ๐