Toggle Poster Visibility
Oral
Fri Jul 23 09:00 AM -- 09:20 AM (KST)
Global Prosody Style Transfer Without Text Transcriptions
[ Paper ]
Spotlight
Fri Jul 23 09:20 AM -- 09:25 AM (KST)
SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform
[ Paper ]
Spotlight
Fri Jul 23 09:25 AM -- 09:30 AM (KST)
EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
[ Paper ]
Spotlight
Fri Jul 23 09:30 AM -- 09:35 AM (KST)
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
[ Paper ]
Spotlight
Fri Jul 23 09:35 AM -- 09:40 AM (KST)
Learning de-identified representations of prosody from raw audio
[ Paper ]
Spotlight
Fri Jul 23 09:40 AM -- 09:45 AM (KST)
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
[ Paper ]
Spotlight
Fri Jul 23 09:45 AM -- 09:50 AM (KST)
You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling
[ Paper ]
Successful Page Load