...
机译:CAMNet: A controllable acoustic model for efficient, expressive, high-quality text-to-speech
Samsung Res UK, Commun House,South St, Staines Upon Thames TW18 4QE, England;
Samsung Res, Speech Proc Lab, 56 Seongchon Gil, Seoul, South Korea;
Text-to-speech; Expressive TTS; Acoustic model; VAE; Disentanglement; Speech synthesis;