Two languages in one text-to-speech file

Jan 09, 2020

Is it possible to have two languages read properly by the same voice in one text-to-speech file? For example: the entire voiceover is in English however it needs to pronounce several German words correctly. Is there any way of using the same voice for both instead of switching between two different voices?

4 Replies
Ren Gomez

Hi Marek,

Thanks for reaching out! In the text to speech tool, you can only select one language at a time when creating a voiceover.

I went ahead and added a Spanish/English combination just to see what would happen, and the results weren't bad with a small sample. I'd suggest trying it, and if it doesn't work for you, then you may need to consider an actual voiceover. Hope this helps!

Thor Melicher

Currently Storyline 360 doesn’t support using SSML (Synthesized Speech Markup Language).  If it did, this would be easy to solve as you would use a tag to say the word with an English accent which also keeps the same voice so it sounds more natural.

For example, 

The Bundestag is the German federal parliament would be written as

The <lang xml:lang="de-DE">Bundestag</lang> is the German federal parliament.

With that being said and that Storyline 360 doesn’t support SSML tags, I wrote an application to help facilitate the process using Amazon Polly directly (which is what Storyline uses): 

  1. Get an Amazon Polly account (yes, there is some cost involved but doesn’t seem that prohibitive) (https://aws.amazon.com/polly/)
  2. Save your scripts as separate files (MS-Word or Text)
  3. Download HeroVoice TTS from the Microsoft Windows Store (fully functioning 15-day free trial)
  4. Encode your files with HeroVoice TTS – apply a global setting for speed and even comma duration so your files are consistent.
  5. Select the voice you want – these are the same as you’ll find in Storyline today including Neural voices (which aren’t currently available in Storyline)
  6. Load each audio file into Storyline

If you have one file then you probably wouldn’t need to use HeroVoice TTS as you could use the Amazon Polly console to get your file, however, you may also like these other requested features from the Storyline Community:

  • Adjust the overall speed of your files with one setting
  • Adjust the overall pause duration for commas
  • Add your own SSML to get more finer nuanced, naturally sounding results
  • Neural voices
  • Batch process your files

For more information on what else you can do with SSML tags and Amazon Polly, a good reference can be found here: https://docs.aws.amazon.com/polly/latest/dg/supportedtags.html

This discussion is closed. You can start a new discussion or contact Articulate Support.