Generate closed captions from tts could be better

Jan 18, 2023

What is annoying me on the generate closed captions, is that I use nice phrases, with punctuation, and then the closed captions chop up my phrases, and makes balloons containing separate words, and sentence fragments. 

Is there any way to make captions follow added punctuation.

Now I have to go and edit every caption on every slide, which is really annoying. 

4 Replies
Luciana Piazza

Hi Jurgen, 

Thanks so much for reaching out. Sorry to hear you're having trouble with your Text to Speech within your course. Happy to help!

The closed captions editor will automatically analyze the audio track to detect units of speech and add caption placeholders to the timeline. Timing adjustments may be needed.

I look forward to hearing if anyone in the community has nailed down a good workflow for this to help you.

Liz Stephens

Yes! Totally agree. Thank you for posting about this folks.

I also constantly have this issue. The SL guidance provided around auto-generated closed captioning says that the captions are split according to pauses in the spoken audio. In my experience this is absolutely NOT the case. The splits seem to be completely random, and ignore the very obvious audio pauses and text punctuation.

As Jurgen and Sarah have mentioned, I too spend not just hours but DAYS fixing the location of the splits in long courses to more logically marry with the flow of the spoken phrases and sentences. This is especially an issue in closed captions for the inbuilt Text-to-Speech feature. 

It's so important for the visual presentation of the closed captions to model logical phrasing in spoken language, not just from an accessibility POV re. equitable access to information, but also for course participants whose first language may not be what they are reading on the screen. 

SL, is there a simple way to ensure that closed caption balloons are split according to the natural pauses in the audio and punctuation in the text provided for TTS? 

Your help would be much appreciated. This is such a time-consuming design flaw that I feel like I'm missing something really obvious!


Jose Tansengco

Hello Liz, 

I'm sorry to hear about your experience with the closed captions editor.

Getting the caption placeholders to match the corresponding audio each time is tricky since each audio file differs. What might sound like pauses to us might be interpreted differently by a machine. This is why we recommend  tweaking the timing of captionsadding more caption placeholders, or splitting captions into smaller chunks as a way of fine-tuning the captions. 

We also understand that the function might not be working as intended for you, so if you'd like to share a copy of your project file here or in private by opening a support case, we'll be glad to take a closer look to test the behavior you're experiencing. We'll delete it when we're done testing!