🚀 Day 0: Warming up for #OpenSourceWeek!
We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcing 5 repos, sharing our small but sincere progress with full transparency.
These humble building blocks in our online service have been documented, deployed and battle-tested in production.
As part of the open-source community, we believe that every line shared becomes collective momentum that accelerates the journey.
Daily unlocks are coming soon. No ivory towers - just pure garage-energy and community-driven innovation.
Meet Hibiki, our simultaneous speech-to-speech translation model, currently supporting 🇫🇷➡️🇬🇧.
Hibiki produces spoken and text translations of the input speech in real-time, while preserving the speaker’s voice and optimally adapting its pace based on the semantic content of the source speech.
Based on objective and human evaluations, Hibiki outperforms previous systems for quality, naturalness and speaker similarity and approaches human interpreters. 🧵
Excited to announce LTX-Video!
Our new text-to-video model generates stunning, high-quality videos faster than real-time—5 seconds of 24fps video at 768x512 in just 4 seconds on an Nvidia H100! ⚡
We’re open-sourcing the code & weights. Check out the results 🎥👇
When you’re building a new product, you’re often thinking about all the new things people are going to be able to do with it.
But there’s a better question to ask: What are people going to stop doing once they start using your product?
What does your product replace? What are they switching from? How did they do the job before your product came along? Whatever it is, it’s already being done some other way.
Habit, momentum, familiarity, anxiety of the unknown – these are incredibly hard bonds to break. When you try to sell someone something, you have to overcome those bonds and break the grip of that gravity.
So, when you’re thinking about your product, think about what it replaces, not just what it offers. What are you asking people to leave behind when they move forward with you? How hard will that be for them? How can you help them overcome everything that’s tugging them in the opposite direction?
🎥 Today we’re premiering Meta Movie Gen: the most advanced media foundation models to-date.
Developed by AI research teams at Meta, Movie Gen delivers state-of-the-art results across a range of capabilities. We’re excited for the potential of this line of research to usher in entirely new possibilities for casual creators and creative professionals alike.
More details and examples of what Movie Gen can do ➡️ https://t.co/M19x2ndwnr
🛠️ Movie Gen models and capabilities
Movie Gen Video: 30B parameter transformer model that can generate high-quality and high-definition images and videos from a single text prompt.
Movie Gen Audio: A 13B parameter transformer model that can take a video input along with optional text prompts for controllability to generate high-fidelity audio synced to the video. It can generate ambient sound, instrumental background music and foley sound — delivering state-of-the-art results in audio quality, video-to-audio alignment and text-to-audio alignment.
Precise video editing: Using a generated or existing video and accompanying text instructions as an input it can perform localized edits such as adding, removing or replacing elements — or global changes like background or style changes.
Personalized videos: Using an image of a person and a text prompt, the model can generate a video with state-of-the-art results on character preservation and natural movement in video.
We’re continuing to work closely with creative professionals from across the field to integrate their feedback as we work towards a potential release. We look forward to sharing more on this work and the creative possibilities it will enable in the future.
📣📣📣We are excited to announce the release of Open-Sora Plan v1.1.0.
🙌Thanks to ShareGPT4Video's capability to annotate long videos, we can generate higher quality and longer videos.
🔥🔥🔥We continue to open-source all data, code, and models!
https://t.co/C28gHbiPrU
We've been in the kitchen cooking 🔥 Excited to release the first @AIatMeta LLama-3 8B with a context length of over 1M on @huggingface - coming off of the 160K context length model we released on Friday!
A huge thank you to @CrusoeEnergy for sponsoring the compute. Let us know if you want to work with our team on custom models or automating business workflows: https://t.co/QCCHZ2Qzt2
🔗 https://t.co/HOd6SwSq28
The hidden mechanism behind AI regulations:
1. Claim that AI can do everything .
2. Raise tons of money from investors.
3. Tell governments that AI is very dangerous and that open source AI should be regulated out of existence.
....
4. Profits!