Show HN: Audio AI had a wild day – 5 major open-source / real-time TTS drops | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Show HN: Audio AI had a wild day – 5 major open-source / real-time TTS drops (github.com/flashlabs-ai-corp)
		5 points by pratik227 3 months ago \| hide \| past \| favorite \| 2 comments
		The audio/TTS space just moved fast. In the last week alone: NVIDIA – PersonaPlex-7B Open-source, full-duplex conversational speech model. Inworld AI – TTS-1.5 Realtime TTS (<250ms), $0.005/min, currently #1 on Artificial Analysis. Flash Labs – Chroma 1.0 First open-source, end-to-end, real-time speech-to-speech model. Alibaba Qwen – Qwen3-TTS Fully open-sourced TTS family: Base, CustomVoice, VoiceDesign. Kyutai Labs – Pocket TTS Runs locally on a laptop. No GPU required. Feels like TTS is hitting the same acceleration moment LLMs had last year. Realtime, open-source, and local is becoming the default. Curious what people here are building with this

knowitnone3 3 months ago [–]

https://github.com/FlashLabs-Corp/FlashLabs-Chroma is gone

toolhouseAI 3 months ago | [–]

https://github.com/FlashLabs-AI-Corp/FlashLabs-Chroma

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact