A newer version of the Gradio SDK is available:
5.49.1
π§ Chatterbox TTS - Audiobook Edition Features
π New Voice Management System
The Audiobook Edition adds powerful voice management capabilities perfect for creating consistent character voices across your audiobook projects.
β¨ Key Features
π Voice Library Tab
- Organized Voice Storage: Keep all your character voices in one place
- Custom Voice Profiles: Save voice settings with names, descriptions, and reference audio
- Easy Voice Selection: Quick dropdown to switch between saved voices
- Voice Testing: Test voices before saving or using them
π Character Voice Management
- Voice Profiles: Each voice includes:
- Voice name (for file organization)
- Display name (human-readable)
- Description (character notes)
- Reference audio file
- Optimized settings (exaggeration, CFG/pace, temperature)
ποΈ Voice Testing & Configuration
- Live Testing: Test voice settings with custom text
- Parameter Tuning: Fine-tune exaggeration, CFG/pace, and temperature
- Instant Feedback: Hear changes immediately
- Save Optimized Settings: Store perfect settings for each character
π οΈ How to Use
1. Launch the Audiobook Edition
# Use the audiobook launcher
launch_audiobook.bat
2. Set Up Your Voice Library
- Go to the "π Voice Library" tab
- Set your voice library folder path (default:
voice_library) - Click "π Update Library Path"
3. Create a Voice Profile
- Upload Reference Audio: Upload 10-30 seconds of clear speech
- Configure Settings:
- Exaggeration: 0.3-0.7 for most voices
- CFG/Pace: Lower = slower, more deliberate
- Temperature: Higher = more variation
- Test the Voice: Use the test text to hear how it sounds
- Save Profile: Give it a name and description, then save
4. Use Saved Voices
- Select Voice: Choose from dropdown in Voice Library
- Load Voice: Click "π₯ Load Voice" to load settings
- Generate Speech: Switch to TTS tab and generate with loaded voice
π Voice Library Structure
voice_library/
βββ narrator_male_deep/
β βββ config.json # Voice settings
β βββ reference.wav # Reference audio
βββ character_female_young/
β βββ config.json
β βββ reference.mp3
βββ villain_gravelly/
βββ config.json
βββ reference.wav
π― Audiobook Workflow
Step 1: Character Planning
- List all characters in your audiobook
- Gather reference audio for each (record or find samples)
- Plan voice characteristics (age, personality, accent)
Step 2: Voice Creation
- Create a voice profile for each character
- Test and refine settings for consistency
- Save with descriptive names (e.g., "Harry_confident", "Hermione_intelligent")
Step 3: Production
- Load character voice before generating their dialogue
- Use consistent settings throughout the book
- Test voice regularly to maintain quality
Step 4: Quality Control
- Use the same test phrase for all characters
- Ensure voices are distinguishable
- Adjust settings if characters sound too similar
π‘ Pro Tips
Voice Creation
- Reference Audio: Use clean, noise-free recordings
- Length: 10-30 seconds is optimal
- Content: Natural speech, not overly dramatic
- Quality: Higher quality audio = better cloning
Settings Optimization
Exaggeration:
- 0.3-0.5: Subtle, natural voices
- 0.5-0.7: Standard character voices
- 0.7-1.0: Dramatic or distinctive voices
CFG/Pace:
- 0.3-0.4: Slow, deliberate (elderly, wise characters)
- 0.5: Standard pace
- 0.6-0.8: Faster pace (young, energetic characters)
Temperature:
- 0.5-0.8: Consistent delivery
- 0.8-1.2: More natural variation
- 1.2+: Creative but less predictable
Organization
- Naming Convention: Use descriptive names (character_trait_type)
- Descriptions: Include character details and usage notes
- Backup: Keep your voice_library folder backed up
- Version Control: Save multiple versions for different emotions
π§ Advanced Features
Voice Library Management
- Import/Export: Copy voice_library folder between projects
- Sharing: Share voice profiles with other audiobook creators
- Backup: Regular backups of your voice library
- Organization: Folder structure for different projects
Batch Processing (Future)
- Process entire chapters with character voice switching
- Automatic voice detection based on speaker tags
- Export management for audiobook production
π΅ Example Character Voices
Narrator
- Settings: Exaggeration 0.4, CFG 0.5, Temp 0.7
- Description: Clear, neutral, professional tone
- Use: Chapter narration, scene descriptions
Hero Character
- Settings: Exaggeration 0.6, CFG 0.6, Temp 0.8
- Description: Confident, determined, slightly higher energy
- Use: Main character dialogue
Wise Mentor
- Settings: Exaggeration 0.3, CFG 0.3, Temp 0.6
- Description: Slow, deliberate, thoughtful delivery
- Use: Advisor character, important wisdom
Comic Relief
- Settings: Exaggeration 0.8, CFG 0.7, Temp 1.0
- Description: Energetic, expressive, variable delivery
- Use: Funny sidekick, lighthearted moments
π‘οΈ Best Practices
- Consistency: Always use the same voice profile for each character
- Testing: Test voices regularly during production
- Backup: Keep voice profiles backed up
- Documentation: Maintain character voice notes
- Quality: Use high-quality reference audio
- Organization: Use clear naming conventions
Ready to create amazing audiobooks with consistent character voices? Launch the Audiobook Edition and start building your voice library! π§β¨