Skip to yearly menu bar Skip to main content


Talk
in
Workshop: Self-supervision in Audio and Speech

Invited Talk: Self-Supervised Video Models from Sound and Speech, Lorenzo Torresani

Lorenzo Torresani


Abstract:

Existing manually-annotated datasets for video understanding differ substantially in their label spaces. Coupled with the limited sizes of these collections, this causes fully-supervised video models to transfer poorly across datasets and tasks.

Link to the video: https://slideslive.com/38930742/selfsupervised-video-models-from-sound-and-speech

Chat is not available.