The ICML Expressive Vocalizations (ExVo) Workshop and Competition 2022

Workshop

The ICML Expressive Vocalizations (ExVo) Workshop and Competition 2022

Alice Baird · Panagiotis Tzirakis · Kory Mathewson · Gauthier Gidel · Eilif Muller · Bjoern Schuller · Erik Cambria · Dacher Keltner · Alan Cowen

Room 301 - 303

Sat 23 Jul, 6 a.m. PDT

[ Abstract ] Workshop Website

The ICML Expressive Vocalizations (ExVo) Workshop and Competition 2022 introduces, for the first time in a competition setting, the machine learning problem of understanding and generating vocal bursts – a wide range of emotional non-linguistic utterances. Participants of ExVo are presented with three tasks that utilize a single dataset. The dataset and three tasks draw attention to new innovations in emotion science and capture 10 dimensions of emotion reliably perceived in distinct vocal bursts: Awe, Excitement, Amusement, Awkwardness, Fear, Horror, Distress, Triumph, Sadness and Surprise. Of particular interest to the ICML community, these tasks highlight the need for advanced machine learning techniques for multi-task learning, audio generation, and personalized few-shot learning of nonverbal expressive style.

With studies of vocal emotional expression often relying on significantly smaller datasets insufficient to apply the latest machine learning innovations, the ExVo competition and workshop provides an unprecedented platform for the development and discussion of novel strategies for understanding vocal bursts and will enable unique forms of collaborations by leading researchers from diverse disciplines. Organized by leading researchers in emotion science and machine learning, the following three tasks are proposed: the Multi-task High-Dimensional Emotion, Age & Country Task (ExVo Multi-Task); the Generative Emotional Vocal Burst Task (ExVo Generate); and the Few-Shot Emotion Recognition task (ExVo Few-Shot).

Important dates (AoE)
- Challenge Opening (data available): April 1, 2022.
- Baselines and paper released: April 8, 2022.
- ExVo MultiTask submission deadline: May 12, 2022.
- ExVo Few-Shot (test-labels): May 13, 2022.
- Workshop paper submission: ~~May 20, 2022~~ Extended June 6 2022.

For those interested in submitting research to the ExVo workshop outside of the competition, we encourage contributions covering the following topics:
- Detecting and Understanding Vocal Emotional Behavior
- Multi-Task Learning in Affective Computing
- Generating Nonverbal Vocalizations or Speech Prosody
- Personalized Machine Learning for Affective Computing
- Other topics related to Affective Verbal and Nonverbal Vocalization

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sat 6:00 a.m. - 6:15 a.m.	ExVo Welcome ( Opening Remarks ) > SlidesLive Video	Alice Baird 🔗
Sat 6:15 a.m. - 6:25 a.m.	The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts ( Spotlight ) > SlidesLive Video	Alice Baird · Panagiotis Tzirakis · Alan Cowen · Gauthier Gidel · Marco Jiralerspong · Eilif Muller · Kory Mathewson · Bjoern Schuller · Erik Cambria · Dacher Keltner 🔗
Sat 6:25 a.m. - 6:30 a.m.	Questions ( Questions ) >	🔗
Sat 6:30 a.m. - 6:40 a.m.	Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction ( Spotlight ) > link SlidesLive Video Link	Andreas Triantafyllopoulos · Meishu Song · Zijiang Yang · Xin Jing · Björn Schuller 🔗
Sat 6:40 a.m. - 6:45 a.m.	Questions ( Questions ) >	🔗
Sat 6:45 a.m. - 6:55 a.m.	Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression ( Spotlight ) > link SlidesLive Video Link	Xin Jing · Andreas Triantafyllopoulos · Zijiang Yang · Björn Schuller · Meishu Song 🔗
Sat 6:55 a.m. - 7:00 a.m.	Questions ( Questions ) >	🔗
Sat 7:00 a.m. - 7:30 a.m.	Tea/Coffee Break	🔗
Sat 7:30 a.m. - 8:15 a.m.	"Using WaveNet to reunite speech-impaired users with their original voices" (invited talk) ( Keynote ) > SlidesLive Video	Yutian Chen 🔗
Sat 8:15 a.m. - 8:30 a.m.	Questions ( Questions ) >	🔗
Sat 8:30 a.m. - 8:40 a.m.	Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations ( Spotlight ) > link SlidesLive Video Link	Chin-Cheng Hsu 🔗
Sat 8:40 a.m. - 8:45 a.m.	Questions ( Questions ) >	🔗
Sat 8:45 a.m. - 8:55 a.m.	Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrograms ( Spotlight ) > link SlidesLive Video Link	Marco Jiralerspong · Gauthier Gidel 🔗
Sat 8:55 a.m. - 9:00 a.m.	Questions ( Questions ) >	🔗
Sat 9:00 a.m. - 10:30 a.m.	Lunch	🔗
Sat 10:30 a.m. - 11:00 a.m.	"Fundamental advances in understanding nonverbal behavior" (invited talk) ( Keynote ) > SlidesLive Video	Alan Cowen 🔗
Sat 11:00 a.m. - 11:15 a.m.	Questions ( Questions ) >	🔗
Sat 11:15 a.m. - 11:25 a.m.	Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression ( Spotlight ) > link SlidesLive Video Link	Meishu Song · Zijiang Yang · Andreas Triantafyllopoulos · Xin Jing · Vincent Karas · Jiangjian Xie · Zixing Zhang · Yamamoto Yoshiharu · Björn Schuller 🔗
Sat 11:25 a.m. - 11:30 a.m.	Questions ( Questions ) >	🔗
Sat 11:30 a.m. - 11:40 a.m.	Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers ( Spotlight ) > link SlidesLive Video Link	Josh Belanich · Krishna Somandepalli · Brian Eoff · Brendan Jou 🔗
Sat 11:40 a.m. - 11:45 a.m.	Questions ( Questions ) >	🔗
Sat 11:45 a.m. - 11:55 a.m.	Exploring the Effectiveness of Self-supervised Learning and Classifier Chains in Emotion Recognition of Nonverbal Vocalizations ( Spotlight ) > link SlidesLive Video Link	Detai Xin · Shinnosuke Takamichi · Hiroshi Saruwatari 🔗
Sat 11:55 a.m. - 12:00 p.m.	Questions ( Questions ) >	🔗
Sat 12:00 p.m. - 12:30 p.m.	Tea/Coffee Break	🔗
Sat 12:30 p.m. - 12:50 p.m.	"Neurosymbolic AI for Sentiment Analysis" (invited talk) ( Keynote ) > SlidesLive Video	Erik Cambria 🔗
Sat 12:50 p.m. - 1:00 p.m.	Self-supervision and Learnable STRFs for Age, Emotion and Country Prediction ( Spotlight ) > link SlidesLive Video Link	Roshan Sharma · Tyler Vuong · Mark Lindsey · Hira Dhamyal · Bhiksha Raj · Rita Singh 🔗
Sat 1:00 p.m. - 1:05 p.m.	Questions ( Questions ) >	🔗
Sat 1:05 p.m. - 1:15 p.m.	Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track ( Spotlight ) > link SlidesLive Video Link	Tilak Purohit · Imen Ben Mahmoud · Bogdan Vlasenko · Mathew Magimai.-Doss 🔗
Sat 1:15 p.m. - 1:20 p.m.	Questions ( Questions ) >	🔗
Sat 1:20 p.m. - 1:30 p.m.	Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion, Age, and Origin from Vocal Bursts ( Spotlight ) > link SlidesLive Video Link	Atijit Anuchitanukul · Lucia Specia 🔗
Sat 1:30 p.m. - 1:35 p.m.	Questions ( Questions ) >	🔗
Sat 1:35 p.m. - 2:00 p.m.	Winner Announcements ( Closing Remarks ) > SlidesLive Video	Alice Baird 🔗