Toggle Poster Visibility
Oral
Thu Jul 22 05:00 PM -- 05:20 PM (PDT)
Global Prosody Style Transfer Without Text Transcriptions
[ Paper ]
Spotlight
Thu Jul 22 05:20 PM -- 05:25 PM (PDT)
SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform
[ Paper ]
Spotlight
Thu Jul 22 05:25 PM -- 05:30 PM (PDT)
EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
[ Paper ]
Spotlight
Thu Jul 22 05:30 PM -- 05:35 PM (PDT)
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
[ Paper ]
Spotlight
Thu Jul 22 05:35 PM -- 05:40 PM (PDT)
Learning de-identified representations of prosody from raw audio
[ Paper ]
Spotlight
Thu Jul 22 05:40 PM -- 05:45 PM (PDT)
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
[ Paper ]
Spotlight
Thu Jul 22 05:45 PM -- 05:50 PM (PDT)
You Only Sample (Almost) Once: Linear Cost Self-Attention Via Bernoulli Sampling
[ Paper ]