Dan Myers

Different layers would work but if you want it all on one layer, you could also split your audio into two different files but put them right next to eachother so the audio sounds the same to the learner.

Then just build triggers to change the caption based on when the first audio file ends.

Hope that helps,