@SenSanders The existential risk is sitting right before your eyes in a house made by former slaves. The existential risk is all those who support that. And those who voted for misogyny and racism, not to mention stupidity. There is the very present risk.
I will be in Rio. I am looking to talk to former and current colleagues and students interested in working on vison language word models and human-agent interactions in the physical world.
Heading to ICLR in Rio 🇧🇷?
We’re hosting our first networking mixer on April 24.
Meet AMI’s technical team and cofounders, and learn more about what we’re building.
Food, drinks, and great conversation included.
Register at https://t.co/PQmb6wAoFZ
Heading to ICLR in Rio 🇧🇷?
We’re hosting our first networking mixer on April 24.
Meet AMI’s technical team and cofounders, and learn more about what we’re building.
Food, drinks, and great conversation included.
Register at https://t.co/PQmb6wAoFZ
@amilabs AMI:
The final frontier.
These are the voyages of a new AI enterprise.
Its 5-year mission:
To explore & learn about strange new worlds,
To seek out & support new life and new civilizations,
To boldly go where no man or woman has gone before.
I am hiring researchers and builders for our #Paris team to build advanced machine intelligence that is fundamentally human-centered. https://t.co/kHJhaNUP0Y
Meta’s former chief AI scientist has long argued that human-level AI will come from mastering the physical world, not language. His new startup, AMI, plans to prove it. https://t.co/jCFpoorFOn
I am happy to share that I have joined forces with @ylecun and fellow founders as Co-Founder and Chief Research & Innovation Officer at AMI - Advanced Machine Intelligence. I will lead research initiatives that push AI to be genuinely human-centered - AI that perceives, learns, reasons and acts like we do and in our best interest. I am thankful for the trust placed in us and deeply aware of the responsibility we share to making the world a better place through our work everyday. Join us!
Advanced Machine Intelligence (AMI) is building a new breed of AI systems that understand the world, have persistent memory, can reason and plan, and are controllable and safe.
We’ve raised a $1.03B (~€890M) round from global investors who believe in our vision of universally intelligent systems centered on world models. This round is co-led by Cathay Innovation, Greycroft, Hiro Capital, HV Capital, and Bezos Expeditions, along with other investors and angels across the world.
We are a growing team of researchers and builders, operating in Paris, New York, Montreal and Singapore from day one.
Read more: https://t.co/kyVAL7EoFx
AMI - Real world. Real intelligence.
People tend to either overestimate or underestimate the power of LLMs. No, LLMs do not know the world first hand but are second hand learners from linguistic, symbolic and visual descriptions of human knowledge of the world. No, LLMs are not just “parrots” trying to string words together, NGrams are. LLMs can make correlations from the words they trained to predict. When you know these two fundamental characteristics of LLMs you would neither be surprised by hallucinations nor overly impressed or reliant on LLMs for everything. #LLM #llmhallucinations #GenerativeAI #LanguageModel
See our paper on arXiv (https://t.co/ALynmtziVn) -- Action100M: A Large-scale Video Action Dataset
by Delong Chen (@Delong0_0), Tejaswi Kasarla (@tkasarla_), Yejin Bang (@yejin_bang), Mustafa Shukor (@MustafaShukor1), Willy Chung (@willyhcchung), Jade Lei Yu, Allen Bolourchi (@AllenBolourchi), Théo Moutakanni (@TheoMoutakanni), and Pascale Fung (@pascalefung).
Dataset can be acceesed from this repo: https://t.co/ac9BnEmOn0
We release Action100M, the hero behind VL-JEPA. It is a large dataset with O(100 million) dense action annotations on HowTo100M procedural videos. We hope it serves as a robust data foundation to advance physical world modeling research.
Introducing VL-JEPA: Vision-Language Joint Embedding Predictive Architecture for streaming, live action recognition, retrieval, VQA, and classification tasks with better performance and higher efficiency than large VLMs.
• VL-JEPA is the first non-generative model that can perform general-domain vision-language tasks in real-time, built on a joint embedding predictive architecture.
• We demonstrate in controlled experiments that VL-JEPA, trained with latent space embedding prediction, outperforms VLMs that rely on data space token prediction.
• We show that VL-JEPA delivers significant efficiency gains over VLMs for online video streaming applications, thanks to its non-autoregressive design and native support for selective decoding.
• We highlight that our VL-JEPA model, with an unified model architecture, can effectively handle a wide range of classification, retrieval, and VQA tasks at the same time.
by @Delong0_0@MustafaShukor1@TheoMoutakanni@willyhcchung Jade Lei Yu Tejaswi Kasarla @AllenBolourchi@ylecun@pascalefung
https://t.co/oUnjCaMKVv
My father is gone suddenly on Oct 21st 2025. A painter, director, educator and a pioneer in Chinese animation, he was also a devoted father and husband to the very end, he is missed but will be remembered forever.
Thanks @_akhaliq for sharing!
More about our VLWM:
- Non-pixel-generative world model that reasons in abstract semantic space
- Learned from 20k hours of unlabeled egocentric / web procedural videos with 5.7M action steps
- System-2 planning with reasoning by cost-guided plan search
Congrats to the whole team! @TheoMoutakanni@willyhcchung@yejin_bang , @ZiweiJi184538@AllenBolourchi@pascalefung
Thanks @_akhaliq for sharing!
More about our VLWM:
- Non-pixel-generative world model that reasons in abstract semantic space
- Learned from 20k hours of unlabeled egocentric / web procedural videos with 5.7M action steps
- System-2 planning with reasoning by cost-guided plan search
Congrats to the whole team! @TheoMoutakanni@willyhcchung@yejin_bang , @ZiweiJi184538@AllenBolourchi@pascalefung