Excited to be at CVPR in Denver for the two workshop days, June 3 and 4 to present our research on foundation models in radiology! We'll be at the FMV workshop on June 3. Feel free to contact me for a chat about our research!
Today @cdancette from @raidium_med stopped by to talk about Curia, a multi-modal open-weights foundation model for radiology building in the style of DINOv2 with tons of downstream applications.
Do check it out!
I am starting a venture on top of LeRobot!
We’re at a pivotal time. AI is moving beyond digital to the physical world. Embodied AI will change our surroundings in ways we can barely imagine. This technology holds the potential to empower everyone. It must not be controlled by just a few.
This conviction led me to propose an ambitious open-source AI robotics project to Thom, Clem, and Julien back in 2024. Hugging Face, home to a community of millions of AI builders and a team of experts who brought us transformers, datasets, and the Hugging Face Hub, was the perfect place to launch LeRobot.
I’m incredibly grateful for all the support that allowed me to build LeRobot alongside an amazing team and community. In such a short time, we built one of the most adopted open-source robotics platforms, used by startups, universities, and research labs. It is helping countless people take their first steps in robotics. Together, we’ve even assembled the world’s largest open robotics dataset. And this is only the beginning for LeRobot!
Building on this momentum, I now feel the urgency to start something new on top of LeRobot. It will push the limit of what robots are capable of and commoditize them within society. Like LeRobot, it will start in Paris, leveraging its vibrant international AI scene. Stay tuned!
As LeRobot continues to expand, it’s now in the best possible hands with @AractingiMichel, @pepijn2233 and Steven Palma taking the lead. Watching the team deliver exceptional results over the last weeks has been one of the most rewarding experiences. Their creativity, dedication, and capability to ship fast is proving just how strong the team is today!
I am extremely grateful to the many people who contributed to making LeRobot at Hugging Face and within its powerful community. Many thanks to Thom, Clem, Julien, Simon, Rob, Michel, Pepijn, Steven, Gloria, Adil, Martino, Caroline, Marine, Mishig, Guillaume, Pablo, Lysandre, Arthur, Quentin, Florent, Brigitte, Victor, Marina, Mustafa, Francesco, Jess, Jade, Ville, Leo, Max, Julien, Alexander, Flavien, Raphael, Adina, Tao, Dana, Batu, Olivier, Matthieu, Eugene, Theo, Guilherme, Hynek, Loubna, Clémentine, Merve, Vaibhav, Anna, Jeff, Adrien, Emily, Johanne, Adrien and others. There are too many of you to be all named!
Thanks again and see you soon!!! :)
~ Remi
Excited to share our latest work at @raidium_med
Curia: A Multi-Modal Foundation Model for Radiology.
Pre-print: https://t.co/ek2NkukzZT
We are releasing our base model's weights, Curia-B, to the research community: https://t.co/bSGP75u1Sm
Congrats to the whole Raidium team, @SovanKhlt, Antoine Saporta, Helene Philippe, @Elferodie_bis , Baptiste Callard, Théo Danielou, Léo Alberge, Léo Machado, Daniel Tordjman, Julie Dupuis, Korentin Le Floch, @Phylliade, @paulherent, and @Beleyem
Second,
RAPS-3D: Efficient interactive segmentation for 3D radiological imaging (https://t.co/5VQqQNGiLy) by Théo Danielou.
We propose an efficient 3D-native promptable segmentation model for organ segmentation. It will be presented this week at MIUA in Leeds, UK!
Happy to share two publications by our team!
First, RadSAM: Segmenting 3D radiological images with a 2D promptable model by @SovanKhlt. We show how to adapt SAM for 3D CT images, using an iterative inference. https://t.co/mQAGXluRiQ
It will be presented at MICCAI 2025 in Seoul!
The Worldwide @LeRobotHF hackathon is in 2 weeks, and we have been cooking something for you…
Introducing SmolVLA, a Vision-Language-Action model with light-weight architecture, pretrained on community datasets, with an asynchronous inference stack, to control robots🧵
We are organizing a CVPR in Paris event the 6th of June. It will feature poster sessions for papers accepted at CVPR, and Keynotes from Alexei Efros, @dlarlus and @AlexAlahi.
You can register here: https://t.co/XkuHXSONYe
We release a large scale study to answer the following:
- Is late fusion inherently better than early fusion for multimodal models?
- How do native multimodal models scale compared to LLMs.
- How sparsity (MoEs) can play a detrimental role in handling heterogeneous modalities? 🧵
Want to check out the source for the "AlexNet" paper? Google has made the code from Alex Krizhevsky, @ilyasut, and @geoffreyhinton's seminal "ImageNet Classification with Deep Convolutional
Neural Networks" paper public, in partnership with the Computer History Museum.
As I said in the press release, "Google is delighted to contribute the source code for the groundbreaking AlexNet work to the Computer History Museum".
https://t.co/62Ilp7jaeT