Forum Discussion

JurgenLepla's avatar
JurgenLepla
Community Member
7 months ago

Why are TTS voices still so bad?

I regularly receive feedback that TTS voices are bad, compared to other TTS voices available on the market

This is a recent example: 

The text to speech sounded like it was from the 90's and it cut off the last word of almost every sentence.

This is even with the supposedly improved new voices that were added recently. 

 

Will this be improving with the AI introduction? 

  • AndrewHanley's avatar
    AndrewHanley
    Community Member

    I must admit Ive never had this problem. While the TTS is not human realism level, its not awful for AI either (imho), and Ive never experienced "cutting off the last word of almost every sentence"

    This last issue sounds more like aproblem with either the course creation, or something in the end users machine.

    I once did have something similar happen with a course (using professional VO narration) where the start and end of audio was cut off. The cause turned out to be bluetooth headphones!

    I had never heard of this before, but now I introduce a 0.5s silence at start and end of all audio clips and Ive never had a repeat of that customer issue.

    Maybe its something similar for you?

    • S-JBirch's avatar
      S-JBirch
      Community Member
      Andrew Hanley

      I had never heard of this before, but now I introduce a 0.5s silence at start and end of all audio clips and Ive never had a repeat of that customer issue.

      Maybe its something similar for you?

      That is exactly what is the problem with the speak. There is ABSOLUTE silence (no data) between each word.

      You can use a sound track with quiteness (hiss at around -30db or lower). I put it for the duration of the course. 

  • JurgenLepla's avatar
    JurgenLepla
    Community Member

    Possibly, but even so, this was just one of the remarks. 

    Other people say the voices are flat and lack intonation. The emphasis is also often wrong. etc. 

     

  • S-JBirch's avatar
    S-JBirch
    Community Member

    Agree. It is also staccato and you can't adjust pace. The quality of Storyline text-to-speech is bottom of the barrel, and I now use a much better provider of this service. Shame, as the insert speak from slidenotes is a big time saver.

  • Agree as well! Better voices! The included voices are not even the best versions of the iterations available. Training Developers at my company are using other software to generate Text to Speech (Vyond, very nice collection of voices and deliveries) and uploading to Storyline.

    Delivery is flat, range is dull, voices are boring.

    I have been using Danielle and a SSML to get a result that is just not as good as we get with other software. For Articulate to remain the industry standard we need to light a fire under this topic.

    Is there a ticket open for this?