Top Tweets for #speechtech
Sharing this great opportunity co-located with #Interspeech2026 for PhD students in speech science and technology, highly encourage eligible students to attend! #speechtech #speechscience
📢PhD Students in Speech Science and Technology!
Join the 12th ISCA-SAC Doctoral Consortium @ #Interspeech2026 🇦🇺
Get expert mentorship & network with peers and leaders.
📍UNSW, Sydney
🗓️Sep 26, 2026
⏰DDL: June 17
💰FREE registration!
Details: https://t.co/Bn7h0BCmnY

Talking to AI should feel natural, not mechanical—Atomesus Speech brings voice, speed, and human-like interaction together, with the Developer API coming soon to help you build voice-first experiences effortlessly ⚡
#AI #VoiceAI #SpeechTech #Developer #API #BuildInPublic #Startups #IndiaAI

Our #PickOfTheWeek by @sarapapi: "RASST: Fast Cross-modal Retrieval-Augmented #Simultaneous #Speech #Translation" by Jiaxuan Luo, @siqi_ouyang, and @lileics (2026).
#RAG #SpeechTech
Very interesting new paper about combining Simultaneous Speech Translation and RAG to improve translation quality! Check it out: https://t.co/xptPCHHE0Y
#Speech #SpeechTech #Translation #RAG
Very interesting new paper about combining Simultaneous Speech Translation and RAG to improve translation quality! Check it out: https://t.co/xptPCHHE0Y
#Speech #SpeechTech #Translation #RAG
Our #PickOfTheWeek by @mgaido91: "AZEROS: Extending LLM to Speech with Self-Generated Instruction-Free Tuning" by Yiwen Shao, Wei Liu, Jiahong Li, Tianzi Wang, Kun Wei, Meng Yu, Dong Yu (2025)
https://t.co/JqnYl2W9xU
#Speech #SpeechTech
cool work showing we can build generic speechLLM capable of addressing a wide range of tasks by leveraging only ASR data and training a small adapter between the speech encoder and LLM! https://t.co/yYqD4xLNL8
@fbk_mt
New from Qwen: Open-source multilingual TTS that does voice design, cloning & speech gen—in real time. Built for devs, not demos. #AI #CRO #Startups #SpeechTech #OpenSource
https://t.co/ivhHFmKRcs
Our #PickOfTheWeek by @lina_conti: "#Voice #gender #diversity: expression, perception and acoustics" by Victor Rosi and Carolyn McGettigan (Royal Society Open Science 2025).
#Speech #SpeechTech
💬Pick of the week @fbk_mt: "Voice gender diversity: expression, perception and acoustics."
A comprehensive review of gender-diverse voices. Covers acoustic properties, perception, and the gap between self-expression of gender and how voices are heard.
https://t.co/Barx89hfWC
GenAI Pro speech helps creators turn text into realistic voiceovers. Fast, simple, and creator-friendly.
#AIVoice #SpeechTech #CreatorEconomy
Read more: https://t.co/dnzgH9Av3x

Sideは12月10日、11日に ニューヨークで開催される #TheAISummit に参加します! 🏙️
イベントについての詳細はこちらの記事をご覧ください👇
https://t.co/43NVvsBv68
現地にいらっしゃる際は、お気軽にご連絡ください。
#AI #MachineLearning #Datasets #SpeechTech
AI Voice Generators synthesize realistic speech from text, enabling applications in media, accessibility, and virtual assistants. They enhance personalization and automation in digital communication. #AIVoice #SpeechTech #AIart https://t.co/HZlFjVItz2
Excited to share that I’m joining @MIL_UTokyo at The University of Tokyo as a Project Assistant Professor! 🎉 Working at the cutting edge of Speech × AI. 🇯🇵🔊🤖 #AI #SpeechTech
Our #PickOfTheWeek by @beomseok_lee_: "Can Speech LLMs Think while Listening?" by @yijenshih, @rdesh26, Chunyang Wu, Wei Zhou, SK Bong, @YasheshGaur, Jay Mahadeokar, Ozlem Kalinli, and Mike Seltzer (2025).
#Speech #SpeechLLM #LLM #SpeechTech #AI
Can we make Speech LLMs actually think as they listen? 👂💭
This fascinating work applies CoT inspired by human “thinking while listening”, training models to find the inflection point when reasoning starts.
📄 https://t.co/8sNi8d47gk
🤖 Laid off from Meta AI? @smallest_AI is hiring in San Francisco with $200K–$600K salaries & equity! 🗣️💼 #AIJobs #SmallestAI #MetaLayoffs #SpeechTech #IndianStartups #startupUpdates #Startups #Innovation #IndiaEconomy #viralpost #scoopearth #scoopearth.in #scoopearthmagazine

Just checked out AssemblyAI's speech-to-text API and wow, accuracy really does make a difference. I can finally stop fighting with transcripts in my voice note app projects. Worth a look if you've hit similar headaches. #speechtech #devtools https://t.co/MqxPcE3KA9
@mgaido91 and @RoldanoFBK presenting our SimulStream Demo at the @DI_FBK Demo Day at @FBK_research!
The open-source tool, which is going to be released soon, natively supports any speech-to-text #HuggingFace models! 🤖
#SpeechTech #Translation

🚀 Excited to present FAMA, the first large-scale #OpenScience #Speech foundation model for 🇮🇹 Italian & 🇬🇧 English, at #clicit2025 (17:30–18:45 oral session)!
🔗 Models: https://t.co/CMYObCFV0y
📊 Data: https://t.co/BWFC2lu1QV
💻 Code: https://t.co/MITSYRbH1m
#SpeechTech

Thanks @slatornews for featuring Marco-Voice! 🗣️
Pushing boundaries in TTS with unified voice cloning & emotion control.
Check out our work and join us in advancing expressive speech synthesis!
https://t.co/xTfwBQ8pCJ
#AI #SpeechTech #MarcoVoice
Alibaba unveils Marco-Voice, a new text-to-speech system that combines #voicecloning 🗣️ and emotional #speechsynthesis, 😐😄😠😢😮 delivering more natural and expressive synthetic #speech in Mandarin and English.
@Chenyang_Lyu @wangly0229 @AlibabaGroup
https://t.co/Q2wXtU2dez
I am excited to share my latest project: "Hey Rodea": An AI Speech Coach & Transcriber.
👉Try it live: https://t.co/mNejESWIb6
Upload or record your speech → get transcript + scores (Clarity, Confidence, Engagement) + feedback
#AI #SpeechTech #DataScience
🚀 As @TaliaGold notes, @davidgu & @recallai are scaling fast, making conversations accessible in real time at scale. Powering AI agents, automation & more. And yes, they’re hiring! 👀
#AI #SpeechTech #AIInfrastructure #AgentAI #Hiring #StartupJobs
Excited to back @Recallai's $38M Series B.
Context fuels AI agents and apps. The richest untapped source of context is conversations. This data that has been locked away in meetings, calls, and chats.
Until @recallai. Recall makes it easy to capture and use conversation data across every platform. In real time. At scale. Via a single API.
It processes billions of minutes for customers like HubSpot, DataDog, Rippling, and ClickUp.
RecallAI powers AI scribes, recruiters, medical apps, workflow and CRM automation, and more. The results speak for themselves: 40x growth in 2 years, adoption across the ecosystem.
@davidgu, @amandarecall, and team are building the backbone of conversation intelligence. Bessemer is proud to support their journey.
Python’s driving speech AI forward! 🗣️
Tools like AssemblyAI use Python for high-accuracy transcription & call analytics. Voice tech is booming! 🎙️
#Python #AI #SpeechTech
Universal, our async speech-to-text model, now handles 99 languages through a single endpoint with production grade accuracy.
Here's whats new:
1. Automatically detect all 99 languages (up from 17!)
2. Identify speakers in 95 languages with precision
3. 2-3x faster processing for languages like Spanish, French, and German
4. You can set expected languages and fallback options tailored to your specific use case
Super simple to get started, just set language_detection=True in your POST request.
Check out our blog and documentation below to explore Universal's capabilities today!
Most Popular Users

Elon Musk 
@elonmusk
240.3M followers

Barack Obama 
@barackobama
119.2M followers

Donald J. Trump 
@realdonaldtrump
111.6M followers

Cristiano Ronaldo 
@cristiano
109.6M followers

Narendra Modi 
@narendramodi
106.9M followers

Rihanna 
@rihanna
97.4M followers

NASA 
@nasa
92.1M followers

Justin Bieber 
@justinbieber
90.7M followers

KATY PERRY 
@katyperry
87.2M followers

Taylor Swift 
@taylorswift13
81M followers

Lady Gaga 
@ladygaga
72.5M followers

Kim Kardashian 
@kimkardashian
69.5M followers

Virat Kohli 
@imvkohli
69.1M followers

YouTube 
@youtube
68.6M followers

Bill Gates 
@billgates
63.6M followers

The Ellen Show
@theellenshow
62.5M followers

CNN 
@cnn
61.9M followers

Neymar Jr 
@neymarjr
61.7M followers

X 
@x
60.9M followers

Selena Gomez 
@selenagomez
60.3M followers





















