Skip to content

Roadmap & Features

We aim to provide a comprehensive voice-over solution for Unity.

FeatureDescriptionStatusProvider
Multi-Provider SupportToggle between ElevenLabs and Sarvam AI seamlessly.ImplementedAll
Text-to-Speech (TTS)Generate lifelike speech from text using standard models.ImplementedAll
Voice SelectionBrowse and select voices from your provider library.ImplementedAll
Batch GenerationGenerate audio for multiple lines/steps at once.ImplementedAll
Voice HistoryView and retrieve past generations.ImplementedVoiceover
ZIP ExportExport generated audio as a ZIP archive.ImplementedAll
Speech-to-SpeechTransform input audio into a different voice.🚧 PlannedVoiceover
Multilingual Voice OverGenerate voiceovers in multiple languages of the user's choice.🚧 PlannedAll
Sound Effects (SFX)Generate sound effects from text descriptions.🚧 PlannedVoiceover
Runtime APIGenerate voiceovers dynamically in a built game.🚧 PlannedAll
Timeline IntegrationNative integration with Unity's Timeline for cutscenes.BacklogAll
Lip-Sync (Viseme) GenerationAuto-generate blendshape data alongside audio for character speaking animations.BacklogAll
Local / Offline TTSSupport for local models (e.g., Piper, Coqui) for fully offline generation.BacklogLocal
Narrative Tools IntegrationImporters for Yarn Spinner, Ink, and Dialogue System for Unity.BacklogAll

Legend: ✅ Implemented | 🚧 Planned (Next Up) | ⏳ Backlog (Later)

Released under the MIT License.