Seamless4MT: Massive Multilingual Multimodal Machine Translation.
Language translation + speech recognition + speech synthesis in a single model: speech-to-speech, text-to-text, speech-to-text and text-to-speech.
Works for 100 languages.
Code available under CC-BY-NC license.
From Meta - Fundamental AI Research.
Blog post: https://t.co/lyYF0b0rjr
Demo: https://t.co/Kwh2mIHuE6
Paper: https://t.co/kM7PUGprEX
Code: https://t.co/78jhrGt7T1
📢 Today, we announce that Metaco becomes part of @Ripple, joining forces in providing exceptional mission-critical software infrastructure solutions that empower institutions to thrive in the rapidly evolving digital asset economy.
To learn more: https://t.co/U68za92n6o
'Can AI influence us to do more than risk our jobs?'
A snippet from Yuval's keynote presentation on April 29 2023 on 'AI and the future of humanity' at the Frontiers Forum.
Full video: https://t.co/J6Uo99nfxp
In the coming months, we’re launching a new tool called "About this image." It will show important context like when the image was first indexed by Google, where it may have first appeared, and other background information so you can get a better understanding of whether an image is reliable — or if you need to take a second look. Learn more: https://t.co/sxo0QEmZIi #GoogleIO #Google
«Nous ne sommes pas comme les dinosaures qui ont été détruits par un astéroïde venu de l’espace.
Nous avons créé les problèmes.
Nous pouvons les résoudre.»
L’historien @harari_yuval invité exceptionnel de @RTSmonde ce lundi à 8h10.
With great scale comes great responsibility 🇪🇺
Extra #DSA obligations as of 25 August for:
AliExpress
Amazon Store
AppStore
Bing
Booking
Facebook
Google Maps
Google Play
Google Search
Google Shopping
Instagram
LinkedIn
Pinterest
Snapchat
TikTok
Twitter
Wikipedia
YouTube
Zalando
We’re releasing GPT-4 — a large multimodal model (image & text in, text out) which is a significant advance in both capability and alignment.
Still limited in many ways, but passes many qualification benchmarks like the bar exam & AP Calculus: https://t.co/L6VGJ0WfFv
*If* GPT-4 is multimodal, we can predict with reasonable confidence what GPT-4 *might* be capable of, given Microsoft’s prior work Kosmos-1:
- Visual IQ test: yes, the ones that humans take!
- OCR-free reading comprehension: input a screenshot, scanned document, street sign, or any pixels that contain text. Reason about the contents directly without explicit OCR. This is extremely useful to unlock AI-powered apps on multimedia web pages, or “text in the wild” from real world cams.
- Multimodal chat: have a conversation about a picture. You can even provide “follow-up” images in the middle.
- Broad visual understanding abilities, like captioning, visual question answering, object detection, scene layout, common sense reasoning, etc.
- Audio & speech recognition (??): wasn’t mentioned in Kosmos-1 paper, but Whisper is already an OpenAI API and should be fairly easy to integrate.
Note: the predictions are based on what Andreas Braun, Microsoft Germany CTO, allegedly said. They may or may not be accurate (that’s why I call it “prediction”). But Kosmos-1 is very real and rock solid. It offers a glimpse of either GPT-4 or whatever AI service that Microsoft will provide next. I find it difficult to believe Kosmos-1 will stay in the lab and not become a product.
In any case, prepare yourself for multimodal APIs - they’ll happen sooner or later!
#Communiqué_de_presse 🕹👾Clôture du projet pilote de Memoriav sur la situation de la conservation des jeux vidéo en Suisse. Le rapport final est désormais disponible et sera présenté dans le cadre d’une manifestation en ligne le 25 janvier. https://t.co/sdpLc7Cakc
@jcschwaab @jphwalter @GregoireBarbey @Forum_RTS Pour agir il faut savoir ce qui se passe, non? Qui surveille en Suisse le mode de fonctionnement des algorithmes de recrutement ? On laisse le soin aux américains de réguler ?
@jcschwaab @jphwalter @GregoireBarbey @Forum_RTS Comment savoir qu’il y a discrimination quand ni le candidat ni l’entreprise ne connaissent le fonctionnement de l’algorithme ? Audit obligatoire comme à NY? Qui comme auditeur? Sur quelle bases?
Soccer star Lionel Messi's financial package in his move to French club Paris St Germain includes a payment in crypto currency fan tokens, a source close to the matter told Reuters on Thursday. https://t.co/uqsTSE9N9i
Le droit suisse ne protège pas suffisamment le débat libre dans l'espace public numérique. C'est la conclusion de @smetille. Dans son étude pour @ofcomCH, il recommande des mesures que l'État devrait prendre à l'égard des plateformes https://t.co/kiiZavVIyn