Forum Discussion
Text to Speech SSML tag
Hi,
I would like to know if it is possible to tune the articulates TTS voices using the SSML markup tags that are defined by Speech Synthesis Markup Language (SSML) Version 1.1, W3C Recommendation ?
or any other tags ?
pascal
Hi, everyone!
I have some great news to share. We just released another update for Storyline 360. In Update 83, we’ve included important fixes and new features!
One enhanced feature we’ve included:
Unlock new possibilities for text-to-speech audio. Use speech synthesis markup language (SSML) to adjust the speaking rate, modify pronunciation, emphasize words, add pauses, and more.
To take advantage of this update, launch the Articulate 360 desktop app on your computer, and click the Update button next to Storyline 360. You'll find our step-by-step instructions here!
Hi Pascal,
Thanks for reaching out and sharing what you would like to use for your project.
Additional options and editing tools for text to speech is a feature request that we are tracking.
I'm adding this conversation to the report so that we can update you here with any news we have to share.
- AmazonFoodSafetCommunity Member
@Leslie McKerchie,
It seems that SL360 is using Polly for TTS. Can you apply their SSML coding?
Hi there!
We don't have any noted workaround for this issue, but I see that it's been added as a feature request.
I'll give the community a chance to share their wealth of knowledge!
- John_MCommunity Member
Yes, the voices in Storyline TTS do sound very much like the Polly voices.
I have used both the TTS feature in Storyline and Polly with SSML from the Amazon Polly website. When using Polly, the only option I have found is to save the Polly speech as a file external to Storyline and import it using Insert Audio.
Storyline TTS is so darn convenient, being able to integrate with notes and to make almost instant updates. For me it is not worth the hassle of using files from Polly.
If a client tells me "I don't like the way the voice says that word," I am kind of concerned, but not overly so, because the goal of the training can still be met. If a client tells me "I don't understand that sentence at all," I am much more concerned, because the training is not working. When this happens I will explore other options for making the speech more clear. Starting with rewording or using phonetic spelling in Storyline, but using another speech generator if I absolutely must.
I have had demonstrations with SMEs and stakeholders to show how fast and accurately I can work and make updates using Storyline TTS. I point out that having a strange sounding word every now and then is a small price to pay against those benefits.
If there is a way to use SSML in Storyline, I would love to learn about it.
Hi John,
Thanks for reaching out and sharing what you would like to see in the Text to Speech options.
The ability to use SSML has been included in the feature request and this conversation has been tagged so that we can share any updates with you here.
- TomRadachCommunity Member
Hello,
Do you have an expected time for this feature update?THX,
Tom
- RichardFouchauxCommunity Member
I was very excited to try it, but very disappointed that it simply doesn't work here. The voices read the SSML code, which has no effect on the output. The syntax highlighting is there, so that part of the update took place. Based on your stellar record I install AS360 updates almost immediately without fear, but the neural voices have never impressed me — I sense now that they possibly aren't properly updated. I didn't know about a workaround until today, but I'm creating new audio and changing old with more than just a space at the end, and I've had to use/avoid the same voices as ever.
So summarize: SSML is not working after installing the latest update. Despite syntax highlighting, Standard or Neural voices read the tags out loud. In light of information gleaned here today I suspect my Neural voices have not updated properly, perhaps across this and several previous updates.
Hi Tom!
This is still a feature request that our team is looking into. We have a process when handling feature requests that you can find here!
If we hear of any updates, we'll keep you informed in this discussion.
- ThorMelicher-b5Community Member
I see lots of different requests going on here, but I might have a solution for you, but it requires going to the source that Storyline uses, Amazon Polly. To make things a bit simpler, I’ve created an application that addresses many of the things here:
- Adjust the overall speed of your files with one setting
- Adjust the overall pause duration for commas
- Add your own SSML to get more finer nuanced, naturally sounding results and as necessary, correct the pronunciation of words
- Neural voices
- Batch process your files
Here’s what you do:
- Get an Amazon Polly account (yes, there is some cost involved but doesn’t seem that prohibitive) (https://aws.amazon.com/polly/)
- Save your scripts as separate files (MS-Word or Text)
- Download HeroVoice TTS from the Microsoft Windows Store (fully functioning 15-day free trial)
- Encode your files with HeroVoice TTS – apply a global setting for speed and even comma duration so your files are consistent.
- Select the voice you want – these are the same as you’ll find in Storyline today including Neural voices (which aren’t currently available in Storyline)
- Load each audio file into Storyline
- DragoIvanovCommunity Member
I noted that this discussion is from 2 years ago, yet no SSML capability in SL360. I also use Lectora and they had SSML option since the feature was introduced (looks like Amazon Polly as well).
For the time being, I will generate the voices in Lectora and import them in SL360, at least I have this option.
- BadrulKhanCommunity Member
SSML capability in SL360 not yet?
- RenGomezStaff
Hi Badrul,
No updates to share on this request, but our team is actively tracking this one. I'll be sure to add your plus to the list of folks who have requested it and update you when there is any news to share!