Text-to-Speech is awesome but has one serious flow.

Feb 08, 2019

I am very impressed with the Text-to-Speech engine now included in Storyline 360. I previously had to use a voice-over artist and this was costly and frustrating when you wanted to make minor changes to the dialogue.

Recording and modifying dialogue is a breeze with Text-to-Speech - except for one major flaw! It is not possible to include any "control" variables in your Text-to-Speech dialogue. By this I mean something like [Pause 5] or [emphasis]. I have read the discussions around getting the best out of Text-to-Speech and can manipulate "emphasis" to a certain degree using comma, etc.

The big frustration comes with inserting pauses. For this, I need to edit the Text-to-Speech sound track and insert the pauses using the Silence function. This is fine until you need to make any modification to the Text-to-Speech (such as adding a comma to improve emphasis), in which case all the silences I have inserted are deleted and I have to start over again.

As much as Text-to-Speech is saving me time and money, it is also wasting a huge amount of time with the rework of the Silences. It would be great if Articulate include the use of control variables i.e. [Pause 5] and [Emphasis] in the Text-to-Speech engine. I could include these variables directly into the Text-to-Speech dialogue.

Pinned Reply

Kelly Auner
Staff

8 months ago12/13/23 at 2:32 pm (UTC)

Hi, everyone!

I have some great news to share. We just released another update for Storyline 360. In Update 83, we’ve included important fixes and new features!

One enhanced feature we’ve included:

Unlock new possibilities for text-to-speech audio. Use speech synthesis markup language (SSML) to adjust the speaking rate, modify pronunciation, emphasize words, add pauses, and more.

To take advantage of this update, launch the Articulate 360 desktop app on your computer, and click the Update button next to Storyline 360. You'll find our step-by-step instructions here!

18 Replies

Alyssa Gomez
Staff

over 5 years ago02/08/19 at 4:45 pm (UTC)

Hello, Gawie!

I'm happy to hear you're impressed with the Text-to-Speech feature, and I appreciate you sharing your feedback and pain points with us!

We're tracking requests for more editing capabilities with text-to-speech, so I'll add this discussion. That way we can keep you updated. 😊

Gawie Bing
Author

over 5 years ago02/10/19 at 8:27 am (UTC)

Hi Alyssa. In my quest to find an alternative text-to-speech engine, that overcomes my frustrations, I came across Amazon Polly.

How interesting, as Amazon Polly appears to be the exact same speech-to-text engine that Articulate is using! What is even more interesting is that Amazon Polly already supports the in-text tags that I am looking for in their SSML editor.

If Articulate is already hooking into Amazon Polly, why not just enable SSML editing and my problems are solved?

over 5 years ago02/16/19 at 1:13 pm (UTC)

I second that proposal.

Robert Cummins

over 5 years ago02/20/19 at 3:00 pm (UTC)

Regardless of SSML editing, at least increase the default pause between sentence breaks. Everything is simply run together no matter what punctuation you add. Maybe Mandarin is using a different symbol, if so, what is that symbol to indicate to text to speech to pause a little between the start of the next sentence?

SSML editing would be nice, but would need to automatically be removed in the CC's generated, otherwise there is just as much work fixing the CC's as adding the SSML.

Alyssa Gomez
Staff

over 5 years ago02/20/19 at 5:28 pm (UTC)

Thanks for sharing this helpful insight, folks! I've passed all of your feedback and ideas along to our Product team. I appreciate you letting us know how we can make the Text to Speech feature even better!

Chris Jamerson

over 5 years ago02/22/19 at 2:34 pm (UTC)

I would like this as well. I'm converting A LOT of PowerPoints with text-to-speech in the notes section. It would save so much time if I could add a pause at the end of each slide without manually adjusting the timeline. Also, can you add the ability to automatically add text-to-speech when importing slides? These two items alone would save me many many hours.

emily gill

over 5 years ago02/28/19 at 8:39 pm (UTC)

A further feature request would be the ability to "teach" the talk to text certain characters; for example, I have a course in which the phrase "complaint/inquiry" is heavily used, as that is the way the process was written and defined by the client. It would be nice to be able to get the text to learn that the / does not need to be spoken, but can be read as "and or" globally, rather than having to go into each text-to-speech to update, since the find/replace feature does not read captions either.

Alyssa Gomez
Staff

over 5 years ago02/28/19 at 9:58 pm (UTC)

Thanks for the idea, Emily! I'll be sure it gets into the right hands.

Lynn Adinolfi

over 4 years ago11/06/19 at 2:49 am (UTC)

So all of this just happened to me and I am very disappointed to find out that this has been an issue for well over 9 months with no resolution.

I am enjoying this product but this is a real downfall. I just did a 5 minute video with pauses and I had to change one word, then LOST the 90 minutes worth of editing.....

If there is any resolution to this please post....

Lauren Connelly
Staff

over 4 years ago11/07/19 at 2:51 pm (UTC)

Hi Lynn!

We have many ideas in this thread that have been shared with our team. When we feel like these changes have reached perfection, we'll update you here!

Secondly, I'm so sorry you've lost 90 minutes worth of editing! I'd like our Support Engineers to take a deeper look into your file! Please use this link to submit a case with them directly.

Melanie Coyle

over 4 years ago11/07/19 at 8:54 pm (UTC)

Has anyone had any issues with the audio visibility in the audio editor after an update to the text to speech? Sometimes mine will disappear and the only way to get it back is to save, close and re-open.

Scarlett Brooks

over 4 years ago11/07/19 at 10:36 pm (UTC)

I have had a lot of success with inserting audio as small snippets, so I can manipulate them easily. For example, each sentence is usually its own file. If I have something like a list and want not only to manipulate the timing, but also coordinate interactions with the narration, using smaller snippets of sound makes that a breeze.

To maximize the benefit, I copy and paste the narration from the story board...

Hope this helps!

over 4 years ago11/22/19 at 10:08 pm (UTC)

Yes I have, Melanie. I/we need to submit a ticket when this happens.

over 4 years ago12/03/19 at 8:27 pm (UTC)

When I encounter this problem, I type a TTS script with adjusted spellings so the TTS engine can record it as it is supposed to sound. What I place in the NOTES field is written as it is intended to be written, but the TTS engine has already recorded the speech as I want it to sound. (Example: "Dr. Smythe" should be pronounced the same as "Dr. Smith", but the TTS engine pronounces it with a "hard Y". So I give the TTS engine the name "Smith" and after it is recorded, I correct the spelling)

Thor Melicher

over 4 years ago04/22/20 at 9:28 pm (UTC)

I see several different challenges going on in this post but I might have a solution for you? It requires going to the source that Storyline uses, Amazon Polly. To make things a bit simpler, I’ve created an application that addresses many of the things here:

Adjust the overall speed of your files with one setting
Adjust the overall pause duration for commas
Add your own SSML tags to get more finer nuanced, naturally sounding results and as necessary, correct the pronunciation of words
Neural voices
Batch process your files

Here’s what you do:

Get an Amazon Polly account (yes, there is some cost involved but doesn’t seem that prohibitive) (https://aws.amazon.com/polly/)
Save your scripts as separate files (MS-Word or Text)
Download HeroVoice TTS from the Microsoft Windows Store (fully functioning 15-day free trial)
Encode your files with HeroVoice TTS – apply a global setting for speed and even comma duration so your files are consistent.
Select the voice you want – these are the same as you’ll find in Storyline today including Neural voices (which aren’t currently available in Storyline)
Load each audio file into Storyline

Sandra Redguard

almost 2 years ago10/25/22 at 2:29 pm (UTC)

Hello: I am wondering if Articulate has added any of these text to speech functions. It seems that Articulate isn't invested in things like updates to text - to - speech or making Storyline available to Mac users.

That the community has to find these lengthy and costly work arounds, seems counter-productive to our continuing to utilize, with so many work arounds. This thread began 3 years ago. Can you please point out which, if any, of the suggestions, have been added?

Thanks

Ivo Cimmino

1 year ago06/21/23 at 2:21 pm (UTC)

This discussion started 4 years ago, and still no updates?

Kelly Auner
Staff