Speech Synthesis Markup Language (SSML)

Use SSML to humanize text-to-speech (TTS) prompts by applying speech parameters such as pitch, volume, emphasis, and timing.

TTS Prompts

TTS prompts apply to the Cloud STT, Prompt, and Record nodes, and Prompts content items.

SSML and TTS Prompts

Use SSML to humanize TTS prompts by applying speech parameters such as pitch, volume, emphasis, and timing. Apply SSML parameters using the Add Pause, Volume, Pitch, Emphasis, Say As, and Rate menus above the TTS prompt. Select the text and select the parameter. The SSML code is added to the text in red as seen in the illustration.

Use the icons above the top-right corner of the TTS prompt to redo or undo the last SSML tag.

Studio supports all SSML tags supported by Google. For a list of tags, see Speech Synthesis Markup Language (SSML). To use SSML tags other than those available on the menus, paste or type SSML in the text field. When you type SSML in the text field, you might need to reopen the window to see SSML formatting applied. Note that Studio does not validate SSML, so ensure your code is accurate.

Use the toggle switch above the top-right corner of the TTS prompt to show or hide SSML tags. When the SSML tags are hidden, the text cannot be edited.

Note:
  • SSML is not supported when the selected TTS voice is from Lumenvox.

  • Only specific IBM voices allow SSML editing and are labeled as transformable.

  • The TTS voice en-US_AllisonVoice supports ExpressiveSSML syntax. ExpressiveSSML can produce expressive synthesized speech in speaking styles such as upbeat or apologetic. Learn more about expressive SSML.

Preview the TTS Prompt

Preview the TTS prompt as the caller will hear it.

From the area beneath the TTS prompt, click preview to generate the audio.

Once the audio is generated, download or play it.

When the content of the TTS prompt changes, regenerate the audio.

Preview Action

 

Description

Download

Download a recording of the prompt as the caller will hear it.

The recording is in wav format.

Play

Play the prompt as the caller will hear it.

Regenerate

Regenerate the audio when the content of the TTS prompt changes.

Note:

Preview is not supported when the selected TTS voice is from Lumenvox.

Preview plays static text and prompts. Preview does not play variable values.