Overview
Chinese phoneme control uses pinyin with tone numbers, also known as tone3 pinyin. Wrap one syllable in each<|phoneme_start|> and <|phoneme_end|> tag.
Tone Numbers
Put the tone number at the end of each pinyin syllable:| Tone | Example | Description |
|---|---|---|
| 1 | ma1 | High level |
| 2 | ma2 | Rising |
| 3 | ma3 | Dipping |
| 4 | ma4 | Falling |
| 5 | ma5 | Neutral |
Multi-character Words
For a multi-character word, place adjacent phoneme tags in the same order as the original characters:Polyphonic Characters
For polyphonic characters, choose the pinyin that matches the phrase meaning:Generate Pinyin
The training pipeline uses thepypinyin dictionary and converts entries to tone3 pinyin. The helper below mirrors that behavior for single characters:
重 should be chong2 in 重庆 but zhong4 in 重要.
Practical Tips
- Use one phoneme tag per Chinese character or syllable.
- Keep Chinese punctuation, brackets, and spaces outside the tag.
- Choose readings manually for names and polyphonic characters.
- Use
ma5-style tone 5 when you need to mark a neutral tone explicitly.

