Hey everyone! Just dropped a major update to ChatterBox Voice that transforms how you create multi-character audio content.
Also, as people asked for in the last update, I updated the workflows examples with the new F5 nodes and The Audio Wave Analyzer used for the F5 speech precise editing. Check them on GitHub or if already installed Menu>Workflows>Browse Templates
P.S.: very recently I found a bug on Chatterbox when you generate small segments in sequence you have a high chance of having a CUDA error with a ComfyUI crash. So I added a crash_protection_template system that will increase small segments to avoid this. Not ideal, but it’s not something I can fix as far as I know.
Stay updated with the my latest workflows development and community discussions:
💬 Discord: Join the server 🛠️ GitHub: Get the latest releases
LLM text (I reviewed, of course):
🌟 What’s New in 3.1?
Character Switching System
Create audiobook-style content with different voices for each character using simple tags:
Hello! This is the narrator speaking. [Alice] Hi there! I’m Alice with my unique voice. [Bob] And I’m Bob! Great to meet you both. Back to the narrator for the conclusion.
Key Features:
Works across all TTS nodes (F5-TTS or ChatterBox and on the SRT nodes) Character aliases - map simple names to complex voice files for eady of use Full voice folder discovery - supports folder structure and flat files Robust fallback - unknown characters gracefully use narrator voice Performance optimized with character-aware caching
Overlapping Subtitles Support
Create natural conversation patterns with overlapping dialogue! Perfect for:
Realistic conversations with interruptions Background chatter during main dialogue Multi-speaker scenarios
🎯 Use Cases
Audiobooks with multiple character voices Game dialogue systems Educational content with different speakers Podcast-style conversations Accessibility - voice distinction for better comprehension
📺 New Workflows Added (by popular request!)
🌊 Audio Wave Analyzer - Visual waveform analysis with interactive controls 🎤 F5-TTS SRT Generation - Complete SRT-to-speech workflow 📺 Advanced SRT workflows - Enhanced subtitle processing
🔧 Technical Highlights
Fully backward compatible - existing workflows unchanged Enhanced SRT parser with overlap support Improved voice discovery system Character-aware caching maintains performance
📖 Get Started
Download v3.1.0 Complete Character Switching Guide Example workflows included!
Perfect for creators wanting to add rich, multi-character audio to their ComfyUI workflows. The character switching works seamlessly with both F5-TTS and ChatterBox engines.
💬 Discussion r/StableDiffusion (99 points, 16 commentaires) 🔗 Source