Workshop
The ICML Expressive Vocalizations (ExVo) Workshop and Competition 2022
Alice Baird · Panagiotis Tzirakis · Kory Mathewson · Gauthier Gidel · Eilif Muller · Bjoern Schuller · Erik Cambria · Dacher Keltner · Alan Cowen
Room 301 - 303
Sat 23 Jul, 6 a.m. PDT
The ICML Expressive Vocalizations (ExVo) Workshop and Competition 2022 introduces, for the first time in a competition setting, the machine learning problem of understanding and generating vocal bursts – a wide range of emotional non-linguistic utterances. Participants of ExVo are presented with three tasks that utilize a single dataset. The dataset and three tasks draw attention to new innovations in emotion science and capture 10 dimensions of emotion reliably perceived in distinct vocal bursts: Awe, Excitement, Amusement, Awkwardness, Fear, Horror, Distress, Triumph, Sadness and Surprise. Of particular interest to the ICML community, these tasks highlight the need for advanced machine learning techniques for multi-task learning, audio generation, and personalized few-shot learning of nonverbal expressive style.
With studies of vocal emotional expression often relying on significantly smaller datasets insufficient to apply the latest machine learning innovations, the ExVo competition and workshop provides an unprecedented platform for the development and discussion of novel strategies for understanding vocal bursts and will enable unique forms of collaborations by leading researchers from diverse disciplines. Organized by leading researchers in emotion science and machine learning, the following three tasks are proposed: the Multi-task High-Dimensional Emotion, Age & Country Task (ExVo Multi-Task); the Generative Emotional Vocal Burst Task (ExVo Generate); and the Few-Shot Emotion Recognition task (ExVo Few-Shot).
Important dates (AoE)
- Challenge Opening (data available): April 1, 2022.
- Baselines and paper released: April 8, 2022.
- ExVo MultiTask submission deadline: May 12, 2022.
- ExVo Few-Shot (test-labels): May 13, 2022.
- Workshop paper submission: ~~May 20, 2022~~ Extended June 6 2022.
For those interested in submitting research to the ExVo workshop outside of the competition, we encourage contributions covering the following topics:
- Detecting and Understanding Vocal Emotional Behavior
- Multi-Task Learning in Affective Computing
- Generating Nonverbal Vocalizations or Speech Prosody
- Personalized Machine Learning for Affective Computing
- Other topics related to Affective Verbal and Nonverbal Vocalization
Schedule
Sat 6:00 a.m. - 6:15 a.m.
|
ExVo Welcome
(
Opening Remarks
)
>
SlidesLive Video |
Alice Baird 🔗 |
Sat 6:15 a.m. - 6:25 a.m.
|
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts
(
Spotlight
)
>
SlidesLive Video |
Alice Baird · Panagiotis Tzirakis · Alan Cowen · Gauthier Gidel · Marco Jiralerspong · Eilif Muller · Kory Mathewson · Bjoern Schuller · Erik Cambria · Dacher Keltner 🔗 |
Sat 6:25 a.m. - 6:30 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 6:30 a.m. - 6:40 a.m.
|
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction
(
Spotlight
)
>
link
SlidesLive Video |
Andreas Triantafyllopoulos · Meishu Song · Zijiang Yang · Xin Jing · Björn Schuller 🔗 |
Sat 6:40 a.m. - 6:45 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 6:45 a.m. - 6:55 a.m.
|
Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression
(
Spotlight
)
>
link
SlidesLive Video |
Xin Jing · Andreas Triantafyllopoulos · Zijiang Yang · Björn Schuller · Meishu Song 🔗 |
Sat 6:55 a.m. - 7:00 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 7:00 a.m. - 7:30 a.m.
|
Tea/Coffee Break
|
🔗 |
Sat 7:30 a.m. - 8:15 a.m.
|
"Using WaveNet to reunite speech-impaired users with their original voices" (invited talk)
(
Keynote
)
>
SlidesLive Video |
Yutian Chen 🔗 |
Sat 8:15 a.m. - 8:30 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 8:30 a.m. - 8:40 a.m.
|
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations
(
Spotlight
)
>
link
SlidesLive Video |
Chin-Cheng Hsu 🔗 |
Sat 8:40 a.m. - 8:45 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 8:45 a.m. - 8:55 a.m.
|
Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrograms
(
Spotlight
)
>
link
SlidesLive Video |
Marco Jiralerspong · Gauthier Gidel 🔗 |
Sat 8:55 a.m. - 9:00 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 9:00 a.m. - 10:30 a.m.
|
Lunch
|
🔗 |
Sat 10:30 a.m. - 11:00 a.m.
|
"Fundamental advances in understanding nonverbal behavior" (invited talk)
(
Keynote
)
>
SlidesLive Video |
Alan Cowen 🔗 |
Sat 11:00 a.m. - 11:15 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 11:15 a.m. - 11:25 a.m.
|
Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression
(
Spotlight
)
>
link
SlidesLive Video |
Meishu Song · Zijiang Yang · Andreas Triantafyllopoulos · Xin Jing · Vincent Karas · Jiangjian Xie · Zixing Zhang · Yamamoto Yoshiharu · Björn Schuller 🔗 |
Sat 11:25 a.m. - 11:30 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 11:30 a.m. - 11:40 a.m.
|
Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers
(
Spotlight
)
>
link
SlidesLive Video |
Josh Belanich · Krishna Somandepalli · Brian Eoff · Brendan Jou 🔗 |
Sat 11:40 a.m. - 11:45 a.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 11:45 a.m. - 11:55 a.m.
|
Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations
(
Spotlight
)
>
link
SlidesLive Video |
Detai Xin · Shinnosuke Takamichi · Hiroshi Saruwatari 🔗 |
Sat 11:55 a.m. - 12:00 p.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 12:00 p.m. - 12:30 p.m.
|
Tea/Coffee Break
|
🔗 |
Sat 12:30 p.m. - 12:50 p.m.
|
"Neurosymbolic AI for Sentiment Analysis" (invited talk)
(
Keynote
)
>
SlidesLive Video |
Erik Cambria 🔗 |
Sat 12:50 p.m. - 1:00 p.m.
|
Self-supervision and Learnable STRFs for Age, Emotion and Country Prediction
(
Spotlight
)
>
link
SlidesLive Video |
Roshan Sharma · Tyler Vuong · Mark Lindsey · Hira Dhamyal · Bhiksha Raj · Rita Singh 🔗 |
Sat 1:00 p.m. - 1:05 p.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 1:05 p.m. - 1:15 p.m.
|
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track
(
Spotlight
)
>
link
SlidesLive Video |
Tilak Purohit · Imen Ben Mahmoud · Bogdan Vlasenko · Mathew Magimai.-Doss 🔗 |
Sat 1:15 p.m. - 1:20 p.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 1:20 p.m. - 1:30 p.m.
|
Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion, Age, and Origin from Vocal Bursts
(
Spotlight
)
>
link
SlidesLive Video |
Atijit Anuchitanukul · Lucia Specia 🔗 |
Sat 1:30 p.m. - 1:35 p.m.
|
Questions
(
Questions
)
>
|
🔗 |
Sat 1:35 p.m. - 2:00 p.m.
|
Winner Announcements
(
Closing Remarks
)
>
SlidesLive Video |
Alice Baird 🔗 |