MOSS-TTS-v1.5 just reached #1 on Hugging Face Trending for Text-to-Speech, with 20.6K downloads.
A multilingual, controllable TTS model with stable voice cloning, long-form generation, and precise pause control.
MOSS-TTS-v1.5 is now officially supported by vLLM-Omni and SGLang-Omni.
Built by OpenMOSS-Team.
Try it:
GitHub: https://t.co/mSlALD6Fzy
Hugging Face: https://t.co/qTv7xu1MZ5
ModelScope: https://t.co/NzAXgAzagL
ElevenLabs just lost its moat 🤯
They charges $5 to $99/month for AI voice cloning. Their Business plan costs $1,320/month.
Someone open-sourced a Voice AI that clones any voice from just a 3-second audio clip, running 100% locally on your machine.
→ 646 languages - ElevenLabs supports 32
→ Voice design: gender, age, accent, pitch, emotion, dialect
→ Paste a YouTube URL
→ transcribes → translates → re-voices → MP4
→ Global dictation widget: ⌘+⇧+Space from any app
→ Demucs vocal isolation - keeps the background music
→ Pyannote diarization - auto-tags who said what
→ Batch queue: drop 50 videos, walk away
→ MCP server - call it from Claude or Cursor
→ AudioSeal watermarking (by Meta) baked in
100% Open Source. Already 3.6k stars.
Microsoft just dropped Trellis.2 — a 4B model that converts any image into a 3D asset in 3 seconds.
Textured mesh under 100ms on CUDA, outputs a GLB file ready for Blender, Unity, and Unreal.
Open source. Any image. 3 seconds.
THE 3D ASSET PIPELINE JUST CHANGED FOREVER.
🔥 We release Gamma-World from @nvidia — a generative multi-agent world model that finally goes beyond 2 players.
⚡ 24 FPS real-time streaming
🧩 Simplex Rotary Agent Encoding — permutation-symmetric
🌐 Sparse Hub Attention: O(N²) → O(N)
🎯 2 → more players, zero-shot
🤖 Games → real multi-robot worlds
━━━━━━━━━━━━━━━
💥 THE SINGLE-AGENT ERA IS OVER. 💥
━━━━━━━━━━━━━━━
🔗 https://t.co/TuyX2d2XuT
AutoCAD costs $2,030 a year.
SolidWorks costs $4,195 a year just for the standard license.
Someone open sourced a 3D parametric modeler that does most of what both of them do, and you can install it on Windows, Mac, and Linux for $0.
It's called FreeCAD.
Sketch geometry-constrained 2D shapes, extrude them into parametric 3D models, and edit any parameter in the history tree to update the entire design instantly.
Reads and writes STEP, IGES, STL, DXF, OBJ, and IFC out of the box.
28.9K stars. LGPL-2.1. 100% Opensource.
https://t.co/wNhWxN9dAM
Stable Audio 3 by @StabilityAI is just out
It mainly comes with 3 open source variants:
- Stable Audio 3 Medium (2B)
- Stable Audio 3 Small (0.6B) - Music
- Stable Audio 3 Small (0.6B) - VFX
(and a "large" closed variant)
The open models are really fast and high quality
Holy sh*t... a company that raised $32M just open sourced their entire product for free.
It's called cal .diy.
The Cal .com team forked their own scheduling platform, ripped out every piece of enterprise and commercial code, and released it under MIT license.
43.6K GitHub stars. And counting.
Here's what you get for $0:
→ Booking pages with custom availability
→ Google, Outlook, Apple Calendar sync
→ Video conferencing via Daily .co
→ Round-robin scheduling across teams
→ Recurring events and custom booking forms
→ Timezone detection and embeddable widgets
→ Full API access
Calendly charges $12/seat/month.
SavvyCal charges $12/seat/month.
Cal .com's hosted version starts at $15/month.
cal .diy does the same thing for nothing.
No license key. No feature gates. No user limits. No seat pricing. Self-hosted on your own server.
Your scheduling data never leaves your machine.
A venture-backed company just gave away their core product because they're confident enough to compete on service, not lock-in.
That's the most dangerous kind of open source.
100% Open Source. MIT License.
( Link in comments )
there's no catch; SAM3 is open source and really good
one of the things it does really well is object tracking, even in crazy complex scenes like basketball
probably my favorite computer vision model ever
open sourcing Marlin-2B 🐟
a tiny VLM to extract structured information from videos
Marlin is finetuned for two questions devs want to ask in their videos: what is happening, and when?
Best open model in its weight class, competitive with Gemini-2.5-flash at only 2B params 🧵
MIT open-sourced an AI model that converts photos into fully editable CAD programs and it quietly kills the $150/hour CAD modeling industry.
Just upload a sketch or photo and it generates the full parametric 3D model. exportable as STL. ready for manufacturing.
→ no SolidWorks license
→ no weeks of modeling
→ no CAD engineer needed
100% Open Source
Check out Ariticraft 🦾 - a highly efficient agentic system that generates articulated 3D assets fully automatically at scale!
🚀 https://t.co/anSM87Li49
Awesome Robotics! 💾
Building your own robot just got a lot easier.
MathWorks have released an open-source GitHub repository packed with robotics resources for anyone interested in getting hands-on.
The repo includes examples for robot arms, ground vehicles, and drones, with projects that show how to connect with ROS and ROS2 or even deploy Simulink models directly as ROS nodes.
There are also more advanced demos, like modeling off-road environments and testing navigation algorithms in photorealistic simulations.
Everything is well-documented, with tutorials and links that make it easy to go from concept to prototype. 📑
Whether you’re a student, researcher, or just curious, there’s material here for every level. And since it’s an open community project, you can not only explore but also share your own contributions.
For anyone looking to learn robotics by doing, this is a solid place to start!
Here’s the link: https://t.co/2KyP6YMFny
~~
♻️ Join the weekly robotics newsletter, and never miss any news → https://t.co/GoA3ZuwoPB
This python library scrapes any website while bypassing every bot protection on the internet.
It rotates fingerprints, mimics real browser headers, and handles CAPTCHAs automatically.
→ Bypasses Cloudflare, Akamai, DataDome
→ Headless Chromium + Playwright built-in
→ Auto-retries, proxy rotation, session pools
→ Same engine that powers Apify
100% Open Source.
"You need a 24 GB GPU for serious local LLMs in 2026."
Everyone repeats this. It's not true anymore.
Just ran a 35B-parameter model on an RTX 4060 Ti 8 GB: • 41 tok/s at 16k context • 24 tok/s at 200k context
Recipe + benchmarks below 🧵