New Text to Speech Feature in Articulate Storyline 360

I am very excited at this new feature. It seems to use the Amazon Polly engine to do it's conversion. I would like to know if there is a way to edit the text to speech to slow it down, add emphasis, and customize it like you can do in Polly with SSML tags. The speech is great it is just a little quick for what I will need it for in my course development. 

24 Replies
Veronica Budnikas

This is a great question Chris! I am not sure, but there doesn't seem to be at the moment.

It wouldn't surprise me if this was coming next though. When they released the CC features, the ability to import CC files was released first, and the CC editor came soon after.

You could also request a feature here: https://articulate.com/support/contact/feature-request 

Crystal Horn

Hello SibaPrasad!  Check out this tutorial on using the text to speech feature.  You can make changes to the text or to the voice, presently. 

It sounds like it would be a popular feature to have more robust editing capabilities of the narration that's generated; let us know what works for you in a feature request!

nicola poletti

Just to make sure I got everything right: there is no way to edit text-to-speech in order to insert pauses inside paragraphs or phrases, am I right?

I tried filling the text with dots or commas but nothing changes, and working with Silences is very time consuming, since there is no way to rapidly preview a slide during audio editing.

That's a pity: the tool could be useful but as it is, IMHO, it seems quite uncomplete and difficult to use properly.

Alyssa Gomez

Hi Nicola! 

I typically recommend adding extra punctuation within the text, or manually adding silence to the narration in the Audio Editor. It sounds like you've already tried those options and you're looking for something more user-friendly. We're tracking these requests, and we'll definitely let you know if more editing tools are added to the text-to-speech feature in the future!

Chris Dickson

Adding silence and manipulating the audio sounds more unnatural than using the built in SSML tags that work with Polly. Yes this may not sound as user friendly as some audio editor, but the tags are easy to use once you get hang of them and a system that allowed you to drop in specific tags would make manipulating the speech must quicker and give you a better control of how the speech sounds. This is similar to allowing users that can write JavaScript to use it and there should be a way to mark up the speech for those that can use tags. 

nicola poletti

Hi Alyssa,

and first of all thank you for your kind and quick reply. As you mentioned, I already tried adding extra punctuation but it just didn't work: even if I insert lots of dots or commas, pauses last the same. Should I use something else?

In ended I worked with silences in Audio editing and I managed to get the text how I wanted it, but it really takes a lot more time than working with tags or similar.

I hope you'll add a specific tool in the future since it doesn't seem that difficult to me!

Goetz Schucht

Hi all.
A little bit of 'Sherlock-Holming' brought me here since I was confident not to be the only one with this feature request.

So just for the record, my feature request has number: 01399278 [ref:_00D30Txo._500f31E32Nt:ref] from June, 7th.

I'd be pretty happy to hear from Articulate that things are proceeding in the direction of integrating similar features as in the above mentioned Amazon Polly.
Or at least the variation of talking speed which I think is the most needed one.

Thor Melicher

I see lots of different requests going on here, but I might have a solution for you, but it requires going to the source that Storyline uses, Amazon Polly.  To make things a bit simpler, I’ve created an application that addresses many of the things here:

  • Adjust the overall speed of your files with one setting
  • Adjust the overall pause duration for commas
  • Add your own SSML to get more finer nuanced, naturally sounding results and as necessary, correct the pronunciation of words
  • Neural voices
  • Batch process your files 
  • Preview a file before you batch process your files

Here’s what you do:

  1. Get an Amazon Polly account (yes, there is some cost involved but doesn’t seem that prohibitive) (https://aws.amazon.com/polly/)
  2. Save your scripts as separate files (MS-Word or Text)
  3. Download HeroVoice TTS from the Microsoft Windows Store (Fully functioning 15-day free trial)
  4. Encode your files with HeroVoice TTS – apply a global setting for speed and even comma duration so your files are consistent.
  5. Select the voice you want – these are the same as you’ll find in Storyline today including Neural voices (which aren’t currently available in Storyline)
  6. Load each audio file into Storyline
Katie Riggio

Hello Chris,

Thanks for reaching out, and good question!

This is an internal report, where we're tracking requests for more editing capabilities in text-to-speech. I'll surface your thoughts and let you know if this gets added to a future update!

Learn More 💡