Language Availability Per Character

Characters have reference audio in some or all of five language options: Auto, Chinese, English, Japanese, Korean. "Auto" picks whichever language is available for that character. If a character lacks a reference recording in your chosen language, the submit button stays disabled until you switch to a supported language or character.

Single sentences or short paragraphs with proper punctuation
Under 1000 characters tends to sound more natural
Clear emotional context helps delivery

Up to 2000 characters supported
Split at natural paragraph breaks, generate in segments
Tone description can guide overall pacing

What the Tone Description Field Does

The tone description is optional, up to 500 characters. Use it to pass delivery instructions like "slightly slower pace, gentle tone" or "cold, low-pitched, slightly weary." This text never appears in the generated audio — it only influences how the model interprets the voice. If left empty, the model infers tone from the text itself.

History and Results

After submitting, stay on the page and wait — generation typically takes 20–60 seconds. The audio appears directly on the page once ready, where you can preview and download it. The last 7 days of generations are available in the history panel, showing character name, language, character count, and credits used.

Tips for Better Results

Include complete punctuation in the text — pauses and intonation are more accurate when the model can read sentence boundaries

Avoid mixing multiple languages in a single submission (e.g. Chinese and English interleaved); this can reduce voice fidelity

For very long scripts, split at dialogue paragraph breaks and generate separately, then combine

Character names and place names are best written in the character's native language

Genshin Voice Synthesis

Language Availability Per Character

Short lines work best

Longer narration

What the Tone Description Field Does

History and Results

Tips for Better Results