qwen3-TTS-studio: ElevenLabs-style voice cloning + NotebookLM-style podcast generation, but local
(github.com)
from yogthos@lemmy.ml to technology@lemmy.ml on 03 Feb 08:56
https://lemmy.ml/post/42616848
from yogthos@lemmy.ml to technology@lemmy.ml on 03 Feb 08:56
https://lemmy.ml/post/42616848
- Clone any voice with just a 3-second audio sample
- Fine-tune parameters (temperature, top-k, top-p) with quality presets
- Generate complete podcasts from just a topic – AI writes the script, assigns voices, and synthesizes everything
- 10 languages supported (Korean, English, Chinese, Japanese, etc.
Currently uses gpt5.2 for script generation, but the architecture is modular – you can swap in any local LLM (Qwen, Llama, etc.) if you want fully local.
threaded - newest