Forum Discussion
Text to Speech SSML tag
Hi,
I would like to know if it is possible to tune the articulates TTS voices using the SSML markup tags that are defined by Speech Synthesis Markup Language (SSML) Version 1.1, W3C Recommendation ?
or any other tags ?
pascal
Hi, everyone!
I have some great news to share. We just released another update for Storyline 360. In Update 83, we’ve included important fixes and new features!
One enhanced feature we’ve included:
Unlock new possibilities for text-to-speech audio. Use speech synthesis markup language (SSML) to adjust the speaking rate, modify pronunciation, emphasize words, add pauses, and more.
To take advantage of this update, launch the Articulate 360 desktop app on your computer, and click the Update button next to Storyline 360. You'll find our step-by-step instructions here!
Hi John,
Thanks for reaching out and sharing what you would like to see in the Text to Speech options.
The ability to use SSML has been included in the feature request and this conversation has been tagged so that we can share any updates with you here.
- TomRadachCommunity Member
Hello,
Do you have an expected time for this feature update?THX,
Tom
- RichardFouchauxCommunity Member
I was very excited to try it, but very disappointed that it simply doesn't work here. The voices read the SSML code, which has no effect on the output. The syntax highlighting is there, so that part of the update took place. Based on your stellar record I install AS360 updates almost immediately without fear, but the neural voices have never impressed me — I sense now that they possibly aren't properly updated. I didn't know about a workaround until today, but I'm creating new audio and changing old with more than just a space at the end, and I've had to use/avoid the same voices as ever.
So summarize: SSML is not working after installing the latest update. Despite syntax highlighting, Standard or Neural voices read the tags out loud. In light of information gleaned here today I suspect my Neural voices have not updated properly, perhaps across this and several previous updates.
- ThorMelicher-b5Community Member
I see lots of different requests going on here, but I might have a solution for you, but it requires going to the source that Storyline uses, Amazon Polly. To make things a bit simpler, I’ve created an application that addresses many of the things here:
- Adjust the overall speed of your files with one setting
- Adjust the overall pause duration for commas
- Add your own SSML to get more finer nuanced, naturally sounding results and as necessary, correct the pronunciation of words
- Neural voices
- Batch process your files
Here’s what you do:
- Get an Amazon Polly account (yes, there is some cost involved but doesn’t seem that prohibitive) (https://aws.amazon.com/polly/)
- Save your scripts as separate files (MS-Word or Text)
- Download HeroVoice TTS from the Microsoft Windows Store (fully functioning 15-day free trial)
- Encode your files with HeroVoice TTS – apply a global setting for speed and even comma duration so your files are consistent.
- Select the voice you want – these are the same as you’ll find in Storyline today including Neural voices (which aren’t currently available in Storyline)
- Load each audio file into Storyline
- RenGomezStaff
Hi Badrul,
No updates to share on this request, but our team is actively tracking this one. I'll be sure to add your plus to the list of folks who have requested it and update you when there is any news to share!
- joelroom-d18e87Community Member
Please add SSML support. The neural voices are a great addition but without SSML it's just not very useable.
- StephenGilbertCommunity Member
Hi, I've updated and am testing out. I wanted to change the pitch of a voice.
<speak>
<prosody pitch="x-low">This text has extra low pitch</prosody>
</speak>When try the above, and click Insert, get error message that need to verify SSML tags are correct and supported for the selected voice. I think tags are correct for pitch. When I look at the Articulate SSML page, pitch appears to work for only Standard voices. The dropdown in my Insert Text-to-Speech, appears to only show Neural voices. How do I switch to Standard voices?
Also, just to confirm, you will only hear the changes to the voice when you publish? not in Preview?
Thanks!
Hello Comprehend eLearning,
Happy to help!
You can find a list of the Standard Voices here. You'll want to make sure that you are using a voice from this list when using the "<prosody>" tag. You can also find more information on how to add values to the tag here for your reference.
And to address your follow-up question, you will hear the changes to the voice during preview as well, so no need to publish to hear the differences.
- StephenGilbertCommunity Member
Thanks Jose. So, I now understand <prosody> used for pitch will only work for Standard Voices. However, I can see there are no English (USA) standard voices?
Hello Comprehend eLearning,
Yes, that is correct. Currently, the "<prosody>" tag can only be used on the standard voices listed in the article that I shared. Unfortunately, English (USA) only exists as a neural voice but you're welcome to raise a feature request here for additional standard voices.
Hi Pascal,
Thanks for reaching out and sharing what you would like to use for your project.
Additional options and editing tools for text to speech is a feature request that we are tracking.
I'm adding this conversation to the report so that we can update you here with any news we have to share.
- AmazonFoodSafetCommunity Member
@Leslie McKerchie,
It seems that SL360 is using Polly for TTS. Can you apply their SSML coding?
Hi there!
We don't have any noted workaround for this issue, but I see that it's been added as a feature request.
I'll give the community a chance to share their wealth of knowledge!
- John_MCommunity Member
Yes, the voices in Storyline TTS do sound very much like the Polly voices.
I have used both the TTS feature in Storyline and Polly with SSML from the Amazon Polly website. When using Polly, the only option I have found is to save the Polly speech as a file external to Storyline and import it using Insert Audio.
Storyline TTS is so darn convenient, being able to integrate with notes and to make almost instant updates. For me it is not worth the hassle of using files from Polly.
If a client tells me "I don't like the way the voice says that word," I am kind of concerned, but not overly so, because the goal of the training can still be met. If a client tells me "I don't understand that sentence at all," I am much more concerned, because the training is not working. When this happens I will explore other options for making the speech more clear. Starting with rewording or using phonetic spelling in Storyline, but using another speech generator if I absolutely must.
I have had demonstrations with SMEs and stakeholders to show how fast and accurately I can work and make updates using Storyline TTS. I point out that having a strange sounding word every now and then is a small price to pay against those benefits.
If there is a way to use SSML in Storyline, I would love to learn about it.