Forum Discussion
AdamPeterson-5b
2 days agoCommunity Member
AI Audio Consistency
There's a lot to love about the AI Audio, however, the frustration with consistency is a major problem. I don't just mean the consistency of how the voices change their pronunciation between each gen...
EricSantos
14 hours agoStaff
Hello AdamPeterson-5b,
Thanks for sharing the video and the details of what you are experiencing. I understand you are seeing inconsistencies in the quality of the generated audio.
For comparison, I ran a test on my end using the settings shown in the screenshot below (Stability—1.00, Similarity—1.00, Style exaggeration—0.50). I did not see the issue with those settings. Each generation produced the same text-to-speech audio.
As outlined in this article: Create Content with AI Assistant
- Stability: Controls how stable the voice is and how much randomness appears between each generation. Lower values can sound more emotional, while higher values sound more professional and formal.
- Similarity: Controls how closely the AI should match the original voice when replicating it.
- Style exaggeration: Adjusts how much the style of the original voice is amplified. Higher values can increase generation time.
I’m curious if you can try using the same settings I used in the screenshot and let us know if that makes a difference for you.
Related Content
- 3 months ago