Forum Discussion
Text to Speech SSML tag
Hi,
I would like to know if it is possible to tune the articulates TTS voices using the SSML markup tags that are defined by Speech Synthesis Markup Language (SSML) Version 1.1, W3C Recommendation ?
or any other tags ?
pascal
Hi, everyone!
I have some great news to share. We just released another update for Storyline 360. In Update 83, we’ve included important fixes and new features!
One enhanced feature we’ve included:
Unlock new possibilities for text-to-speech audio. Use speech synthesis markup language (SSML) to adjust the speaking rate, modify pronunciation, emphasize words, add pauses, and more.
To take advantage of this update, launch the Articulate 360 desktop app on your computer, and click the Update button next to Storyline 360. You'll find our step-by-step instructions here!
- JakobHofsttterCommunity Member
Hi,
I wanted to ask if there is any news concerning SSML yet and what is the probability that the support for SSML will be implemented. If you could create breaks, it would already be very helpful. Hi Jakob,
Thanks for checking in. We do not have any updates to share at this time. I've added your voice to this request. We'll be sure to update this thread with any news in the future!
- JoshNellist-71bCommunity Member
With the recent addition of the Polly neural voices is there an SSML option? I don't need fancy buttons or sliders, just the ability to enter the SSML tags. This would significantly improve my workflow so that I don't need to generate the TTS files elsewhere and then import into Storyline.
Hi Josh,
It looks like you posted this question in another thread as well. You can check out my response to your inquiry here, but just to recap, SSML support is not yet available. We'll let you know if this changes in the future!
- joelroom-d18e87Community Member
Please add SSML support. The neural voices are a great addition but without SSML it's just not very useable.
Hi, everyone!
I have some great news to share. We just released another update for Storyline 360. In Update 83, we’ve included important fixes and new features!
One enhanced feature we’ve included:
Unlock new possibilities for text-to-speech audio. Use speech synthesis markup language (SSML) to adjust the speaking rate, modify pronunciation, emphasize words, add pauses, and more.
To take advantage of this update, launch the Articulate 360 desktop app on your computer, and click the Update button next to Storyline 360. You'll find our step-by-step instructions here!
- KarenLoftus-586Community Member
Kelly, I'm trying to use the SSML option using today's update (13DEC23).
Once I paste in the text, and choose a voice (Danielle), how do I need get the SSML characters to be applied? I added a few in manually, but that doesn't seem to work.
Trying it a different way, seems like I might need to start with <speak> and end with </speak>. When I did that the other SSL characters look like they took, but the audio just "said" those things.What am I missing?
These voices are all from Amazon Polly, right?Hi Karen, I'm not sure how you got into that state, but I was able to get it working by doing the following:
- Deleted the opening and closing speak tags, saved, and then re-added them. When you try to save you'll get an error message.
- Deleted the space so "amazon: effect" is now "amazon:effect"
- Added the closing tag "</amazon:effect>" for the whisper effect at the end of the sentence
- Changed the voice to a standard voice in another language because the whisper effect doesn't work with neural voices
I attached the modified project. Hope this helps!
- BirgitteHitz417Community Member
It's really a great feature. However, I've selected a standard voice but it keeps reading the <start> text <break> more text </break></start>
what am I doing wrong?
Sorry for borrowing an "old" question to ask this new one.
- StephenGilbertCommunity Member
Hi, I've updated and am testing out. I wanted to change the pitch of a voice.
<speak>
<prosody pitch="x-low">This text has extra low pitch</prosody>
</speak>When try the above, and click Insert, get error message that need to verify SSML tags are correct and supported for the selected voice. I think tags are correct for pitch. When I look at the Articulate SSML page, pitch appears to work for only Standard voices. The dropdown in my Insert Text-to-Speech, appears to only show Neural voices. How do I switch to Standard voices?
Also, just to confirm, you will only hear the changes to the voice when you publish? not in Preview?
Thanks!
Hello Comprehend eLearning,
Happy to help!
You can find a list of the Standard Voices here. You'll want to make sure that you are using a voice from this list when using the "<prosody>" tag. You can also find more information on how to add values to the tag here for your reference.
And to address your follow-up question, you will hear the changes to the voice during preview as well, so no need to publish to hear the differences.
- StephenGilbertCommunity Member
Thanks Jose. So, I now understand <prosody> used for pitch will only work for Standard Voices. However, I can see there are no English (USA) standard voices?
Hello Comprehend eLearning,
Yes, that is correct. Currently, the "<prosody>" tag can only be used on the standard voices listed in the article that I shared. Unfortunately, English (USA) only exists as a neural voice but you're welcome to raise a feature request here for additional standard voices.
- John_MCommunity Member
I am having the same issue as Richard F above. The voice reads the tags.
When I download the file and entitled "ssml_support.story" and run it, the voice reads the tags aloud.
When I download the file entitled "modified_ssml_suppoort.story" the tags appear to be working, they affect the voice performance instead of being read aloud.
Why does one work and not the other? What is the difference from one to the other? I have read thru the thread but I am not sure I understand.
Is there a reference somewhere that we can go and learn about this stuff?
Hello Richard,
I'm sorry to hear that the new SSML feature isn't working properly for you.
I'd like to take a closer look at your project files to see why the tags are being read as text and not as proper SSML markups. Would you be willing to share a copy of your project files here or in private by opening a support case for testing? We'll delete it when we're done!
Hello John
The reason why the tags like <emphasis> were being read in the "modified_ssml_suppoort.story" file is because this tag only supports Standard voices while the voice used in the slide was a Neural one. You can check out this article for more information on correct usage of supported tags.
- MeenuNanwani1Community Member
I have been using SSML tags with standard voices because I would like to apply the "whisper" effect. The standard voice, however, reads the SSML tag. I am using version V3.83.31444.0, if that helps with resolving my issue.
What am I doing wrong to get this to work? I have attached a screen capture of the text to speech window for your reference. Thank you so much!