VALL-E can be used to synthesize high-quality personalized speech with only a three-second enrollment recording of a speaker as an acoustic prompt. The model of the voice can then be used for text-to-speech applications. The post
Microsoft’s New AI Can Simulate Anyone’s Voice From a 3-Second Sample appeared first on
TechNewsWorld.
from TechNewsWorld https://ift.tt/S1b8QAG
via
IFTTT
Comments
Post a Comment