Talk
in
Workshop: Self-supervision in Audio and Speech
Invited Talk: Self-Supervised Video Models from Sound and Speech, Lorenzo Torresani
Lorenzo Torresani
Abstract:
Existing manually-annotated datasets for video understanding differ substantially in their label spaces. Coupled with the limited sizes of these collections, this causes fully-supervised video models to transfer poorly across datasets and tasks.
Link to the video: https://slideslive.com/38930742/selfsupervised-video-models-from-sound-and-speech
Chat is not available.