Text to Speech pronunciation problems

Apr 12, 2024

Hello,

I am having troubles getting the correct pronunciation of the word Analysis using the text to speech. I tried to use some SSML to change how it comes out sounding.  

My text is: <speak>Welcome to the Strategic Discipline course introducing the Quartile and Quad <say-as>A nal i sis</speak>

 

I am getting an error message saying it cant convert. Tells me to verify the tags are correct and the voice supports the command. 

I am using the Neural voice Danielle, which should support the command. 

Help please?

Thank you 

Danielle

2 Replies
Jose Tansengco

Hello Danielle, 

Happy to help! 

I went through the documentation explaining how to use the <say-as> tag and its use might be different from the effect you're trying to achieve. In any case, here's the correct syntax if you want to play around with the tag: 

<speak>Welcome to the Strategic Discipline course introducing the Quartile and Quad <say-as interpret-as="value">A nal i sis</say-as></speak>

You'll need to replace the word "value" in the script with any one of these that are best suited to your design: 

  • characters or spell-out: Spells out each letter of the text, as in a-b-c.

     
  • cardinal or number: Interprets the numerical text as a cardinal number, as in 1,234.

  • ordinal: Interprets the numerical text as an ordinal number, as in 1,234th.

  • digits: Spells out each digit individually, as in 1-2-3-4.

  • fraction: Interprets the numerical text as a fraction. This works for both common fractions such as 3/20, and mixed fractions, such as 2 ½. See below for more information.

  • unit: Interprets a numerical text as a measurement. The value should be either a number or a fraction followed by a unit with no space in between as in 1/2inch, or by just a unit, as in 1meter.

  • date: Interprets the text as a date. The format of the date must be specified with the format attribute. See below for more information.

  • time: Interprets the numerical text as duration, in minutes and seconds, as in 1'21".

  • address: Interprets the text as part of a street address.

  • expletive: "Beeps out" the content included within the tag.

  • telephone: Interprets the numerical text as a 7-digit or 10-digit telephone number, as in 2025551212. You can also use this value for handle telephone extensions, as in 2025551212x345. See below for more information.

Hope this helps!