SSML-Tags
Last updated
Last updated
The TTS providers Microsoft Azure Speech, Amazon Polly, and Google Wavenet support Speech Synthesis Markup Language (SSML) tags.
SSML allows you to specify how speech is pronounced, enabling you to fine-tune text-to-speech output. You can use SSML in your custom dictionary with phonemes to improve pronunciation, as well as define how specific words, mathematical expressions, or words in different languages are pronounced.
For Microsoft the most important SSML-Tags are:
Phonemes: < phoneme alphabet="ipa" ph=" insert phoneme "> insert original word < /phoneme >
English pronunciation: < lang xml:lang="en-US"> insert orginal word < /lang >
Further information you can read by your selected TTS Provider: