Local11Labs: High-Quality Text-to-Speech & Podcast Generator

Local11Labs is a powerful text-to-speech and podcast generation tool powered by the lightweight Kokoro-82M model. Generate natural-sounding speech and multi-speaker podcasts locally on your machine.

Key Changes

Gradio WebUI Integration
- Integrated Gradio WebUI into the application to enhance user interaction and accessibility.
Text-to-Speech (TTS) Tab
- Input Options: Users can provide input via a text field or upload a .txt file.
- Speech Rhythm Control: Adjust the rhythm and pacing of generated speech.
- Voices Dropdown Menu: Select from a variety of available voice profiles.
- Device Selection: Automatic device detection with the option to specify CUDA or CPU.
Podcast Dialogue Generation Tab (Powered by Gemini)
- Enables dynamic generation of podcast dialogues.
- Input customizable host names to create unique and personalized conversations.
Podcast Audio Generation
- Script Input Options:
  - Use the JSON output from the Dialogue Generation tab.
  - Upload a custom JSON file.
  - Directly edit or input text within the interface.
- Dynamic Host Mapping:
  - Assign available voices to host names.
  - Add as many hosts as needed without any limitations.
- Speech Rhythm Control: Fine-tune the speech rhythm for each host to enhance the audio experience.

Key Features

⚡ Fast and efficient text-to-speech generation
🎙️ Multi-speaker podcast creation with distinct voices
📝 Smart text chunking for handling long content
🎛️ Customizable voice profiles with caching
🔊 Professional audio quality with natural pauses and transitions
🚀 Easy to use with minimal setup required

Podcast demo

podcast_compressed.mp4

Quick Start

demo_podcast_compressed.mp4

Text-to-Speech Demo

Try out basic text-to-speech generation in our interactive Colab notebook:

Podcast Generation Demo

Create multi-speaker podcasts using our podcast generation notebook:

Webui demo :

Notebook will be added soon for not you can clone it and run it.

TODO List

Add REST API endpoints for remote TTS generation
Implement streaming audio support
Create web interface for easy usage
Add Docker support for easy deployment

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
src		src
.gitignore		.gitignore
README.md		README.md
app.py		app.py
podcast_generator.py		podcast_generator.py
podcast_tab.py		podcast_tab.py
requirements.txt		requirements.txt
text_to_speech_tab.py		text_to_speech_tab.py
ui-sceenshot1.png		ui-sceenshot1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local11Labs: High-Quality Text-to-Speech & Podcast Generator

Key Changes

Key Features

Podcast demo

Quick Start

Text-to-Speech Demo

Podcast Generation Demo

Webui demo :

TODO List

Screenshots :

About

Releases

Packages

Contributors 2

Languages

nhaouari/local11labs

Folders and files

Latest commit

History

Repository files navigation

Local11Labs: High-Quality Text-to-Speech & Podcast Generator

Key Changes

Key Features

Podcast demo

Quick Start

Text-to-Speech Demo

Podcast Generation Demo

Webui demo :

TODO List

Screenshots :

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages