Text to speech speed
Feb 03, 2018
By
Mohamed Aman
Hello Guys, i think Articulate Storyline needs to improve the text to speech features, so that we are able to adjust the speed of the narrator. because all the narrators (voices), are really too fast, and in order to make it some how slower, we have to add punch of commas; which make the sound very annoying but understandable.
It will also be great , to make the narration speed in a slider (from 1 to 10)not drop-down list (slow, medium, fast).
Thanks
101 Replies
Hi Mohamed,
That's a neat idea, and I can see how the ability to adjust the narration speed with a slider would be very useful for a lot of people. I'm happy to pass along this idea for you!
For now, you can use punctuation, such as commas and semicolons, when you want to insert brief pauses. If you need longer pauses, convert your text to speech, then open the clip in the built-in audio editor and insert silence where it's needed.
Thank you Alyssa.
but the problem with using extra punctuation is that the narration is too distorted , it's like some thing was chancing the narrator :D .
the silence option is good in very little cases .
Hi, I'd like to see this feature installed too. Adding punctuation that's wrong grammatically generate a mono-toned automated narration without the human touch.
We want something that can control the pace in general.
Thanks, Rafik and Mohamed! It's helpful feedback, so as Alyssa mentioned we'd pass it along. 🙂
Having the speed feature in the TTS would be a great addition, as learners complain from the speedy narration, and adding pauses doesn't solve the problem. Please try to make it work in the next update 🙂
The only way that works for me is to break the text into discrete objects and then use the timeline to adjust the timing. This has the added benefit of keeping subtitles in-sync too.
Acronyms are a real problem for me. We've got a system called HBCIS but pronounced "Hibiscus", like the flower. I have to type "Hi-Biscus" to get Russell to pronounce it correctly, then go back through the subtitles and manually change each instance to the acronym.
After a few review cycles this can get quite tedious.It'd be good if there was some markup that could be used to overcome these failings. Here's my current thoughts. I'll give them a bit more thought and then submit a feature request. Any ideas or suggestions from the community would be good.
[[p,10]] inserts a pause of ten seconds.[[s,HBCIS,Hi Biscus]] Uses the first word in subtitles and the second for text-to-speech.[[em,this]] puts extra emphasis on the word "this".
I'd also have a drop-down that allows you to choose whether your text uses this markup or if it's just plain text.
It'd be nice if it would recognize VTML (not HTML) coding like the voice to text in captivate. With VTML coding you can change the pitch, speed, add pauses and much more. I love Articulate 360 but this is one thing I do miss about Captivate.
Thanks, LM! We are tracking this feature request, so I'll add your notes to our documentation!
Is there an update on this topic? Does Storyline plan to add VTML coding to TTS? I currently need to adjust how the narrator is pronouncing a company name and I've tried everything without success. If I was able to use VTML I could have solved the problem quickly.
Hi Corrissa and welcome to E-Learning Heroes :)
I know that accessibility is at the forefront of what our team is focused on right now and other text to speech options, including VTML, have been shared.
You are in the right place to be notified of any updates that we have to share on this topic.
I have the exact same thought, I was just coming on the see if there was a way to adjust speech speed. It's not so much adding silence, it's slowing down the speech. This is a really cool feature, but we have a lot of people where English is their second language, and they speak so quickly it will be difficult to pick up for many.
But I do LOVE this feature, it works a lot better than I expected...you guys did a great job with this one, I've had dedicated programs in the past that aren't as smooth as this one is.
It looks like Articulate are using Amazon Poly for the text-to-speech, and it allows the use of SSML tags.
https://docs.aws.amazon.com/polly/latest/dg/voice-speed-vip.html
It looks like SSML is a W3C standard whereas VTML is a proprietary standard so my preference would be to support the former; especially since it seems to already be supported by AWS.
Could the prosody tag be permitted within Storyline to control speed?
I wanted to try the SSML tags you mentioned so I added the 'prosody' tag just like in the AWS link you shared but the TTS in Storyline just reads it as regular text. But I wasn't aware of SSML or Amazon Polly before now so thanks for sharing that info.
Ive gone in and manually added pauses and spaces but it is limited and time consuming. Any change to the text also wipes out the pauses
Hi all,
I appreciate the added insight into what you'd like to see added to Text to speech! We'll let you know as we expand that feature set or make any changes, and your voice truly helps us prioritize and determine next steps. I’ll be happy to pass your thoughts on to our product team, but you can also feel free to detail them more through a feature request!
Yes, yes, please add me to this request. Love text to speech, but it is cumberson, and I believe it would become a standard for e-learning especially for software. Please add the ability to edit the speed of the speech.
Thank you,
Kimberly
Agree with all the comments above. Another option is to use a non-grammatical character (/) as the pause indicator and allow multiple occurrences of the character. Adding silence to the audio track is, indeed, cumbersome. The character should be machine-read only and not affect the closed captioning. This may be a modest stop-gap fix with out a massive application re-engineering?
Bottom line, put me on the list.
Thank you,
Emerson
I have just started trialling 360 - specifically for the TTS feature
The feedback so far from my stakeholders is that it is too fast and robotic.... I think if we could slow down the speech and add pauses, it would sound less robotlike!!...
This feature defo works better than I thought it would and would be incredibly useful for quickly adding voice overs in different languages (which is currently a very time consuming and expensive process for us!!)
I am keen to know when/if the above suggested feature requests will be added! (soon I hope as I would like to use TTS to convince my boss to give the go ahead for us to upgrade to 360 :) )
Thanks for your feedback Patricia.
You are in the right place to be notified of any updates we have to share regarding this feature.
While this is not the best solution, it does a much better job than trying to add silence.
I am using the portable version of Audacity so nothing need be installed on our firm machines.
I will chime in this petition/discussion. We need a speed control for the text to speech. I especially suffer with the acronyms that are never said correctly.
Hi there.
I used to record voice overs for the eLearning courses I developed, but it became very time consuming particularly as it relates to the editing process.
Fortunately, some companies have really come out with excellent text-to-speech applications. However, the process of using multiple applications does add an extra step to the development process. So, I was quite excited to learn that Storyline 360 has integrated the text-to-speech software! YAYA!
From the applications I have been using (Speech2Go and Text Speaker), I have learned that inserting at least a 0.25 second pause after each sentence helps quite a bit.
Here are some programming commands that might be used within Storyline 360:
Insert 0.25 second pause - <silence msec="250"/>
Insert 0.50 second pause - <silence msec="500"/>
Insert 0.75 second pause - <silence msec="750"/>
Insert 0.80 second pause - <silence msec="800"/>
Insert 1 second pause - <silence msec="1000"/>
Insert 2 second pause - <silence msec="2000"/>
Greetings All,
I recently came across the same issues with the voice being either too fast or robotic. As the staffed has shared in the past using punctuation helps a lot, but also adding spacing between the words. In my company, we were a bit reluctant to use the feature as an immense percentage of our employees have English as their second or even third language. However, utilizing the Microsoft Word Software, to replace the single space to a triple space has reduced significantly the pace of the automated voice in our case (English: US: Mathew) and using commas, periods and semicolons have gone a long way when explaining very technical aspects. Extra Tip: If you have the script in a word format and would like to add the spacing in an efficient and concise way just use the shortcut (CTRL + H), push the space bar on the bar of what you want to look for in the document and replace it with the desired spacing.
I know it is an extra step. However, I thought it may help some of you that need the pace to be slower.
Have a wonderful rest of the year.
If you need additional capability to manipulate speed, pauses, and pronunciation, you can generate your audio with an AWS account and the Amazon Polly console (includes same voices used in Storyline). It works much like the Text to Speech in Storyline, there is just an extra step to export the audio file and insert it into SL. You can switch between Plain Text and SSML which is handy for customizing your audio. See documentation here: https://docs.aws.amazon.com/polly/index.html#lang/en_us
This post was removed by the author