Forum Discussion
AI Voice Generation emphasis in SL
- 2 months ago
Sorry for the quick delay here—I took this back to the team to see if anyone had thoughts/suggestions similar to what you shared about that break time markup. It seems like there's consensus that emphasis in particular is hard to achieve, when I think about this it makes sense because it's not quite a pronunciation difference, I can see why the speech models would have trouble with it! The feeling on the team is there's some experimentation needed to get the voice to flow correctly, and that sometimes experimentation with pronunciation can achieve close to what you want for emphasis.
I think you've probably already seen this based on what you referenced, but for anyone else following this thread who may be curious, here is an article the team put together that talks about some of the limitations and options with SSML models and AI speech.
Curious to keep following this and see if there are any specific practices folks have landed on that worked really well to achieve emphasis.
These are all super creative solutions, Paul–thank you for sharing! I got a chuckle out of the "on bus" -> "on-us" workaround 🤣 🚌
Thanks Noele,
Is there any guidance from Articulate on the emphasis issue I started the discussion with? I found an article which says that
<break time="1.5s" />
can be used to add a pause (which seems to work well), so I'm wondering if there are other similar xml-like tags which can be used for other purposes...
- Noele_Flowers2 months agoStaff
Sorry for the quick delay here—I took this back to the team to see if anyone had thoughts/suggestions similar to what you shared about that break time markup. It seems like there's consensus that emphasis in particular is hard to achieve, when I think about this it makes sense because it's not quite a pronunciation difference, I can see why the speech models would have trouble with it! The feeling on the team is there's some experimentation needed to get the voice to flow correctly, and that sometimes experimentation with pronunciation can achieve close to what you want for emphasis.
I think you've probably already seen this based on what you referenced, but for anyone else following this thread who may be curious, here is an article the team put together that talks about some of the limitations and options with SSML models and AI speech.
Curious to keep following this and see if there are any specific practices folks have landed on that worked really well to achieve emphasis.- Paul_Atleos2 months agoCommunity Member
Thanks Noele. As you say, trial and error and lots of experimentation seems to be the key.
Related Content
- 2 years ago
- 3 months ago