Machine Learning for Media Discovery

Workshop

Machine Learning for Media Discovery

Erik Schmidt · Oriol Nieto · Fabien Gouyon · Yves Raimond · Katherine Kinnaird · Gert Lanckriet

[ Abstract ] Workshop Website

[ Project Page ]

The ever-increasing size and accessibility of vast media libraries has created a demand more than ever for AI-based systems that are capable of organizing, recommending, and understanding such complex data.

While this topic has received only limited attention within the core machine learning community, it has been an area of intense focus within the applied communities such as the Recommender Systems (RecSys), Music Information Retrieval (MIR), and Computer Vision communities. At the same time, these domains have surfaced nebulous problem spaces and rich datasets that are of tremendous potential value to machine learning and the AI communities at large.

This year's Machine Learning for Media Discovery (ML4MD) aims to build upon the five previous Machine Learning for Music Discovery editions at ICML, broadening the topic area from music discovery to media discovery. The added topic diversity is aimed towards having a broader conversation with the machine learning community and to offer cross-pollination across the various media domains.

One of the largest areas of focus in the media discovery space is on the side of content understanding. The recommender systems community has made great advances in terms of collaborative feedback recommenders, but these approaches suffer strongly from the cold-start problem. As such, recommendation techniques often fall back on content-based machine learning systems, but defining the similarity of media items is extremely challenging as myriad features all play some role (e.g., cultural, emotional, or content features, etc.). While significant progress has been made, these problems remain far from solved.

In addition, these complex data present many challenges beyond the development of machine learning systems to model and understand them. One of the largest challenges is scale. One example is commercial music libraries, which span into the tens of millions. However, user-generated content platforms such as YouTube and Pinterest have libraries stretching into the billions--a scale at which many of the traditional approaches discussed in the literature simply cannot perform.

On the other side of this problem sits the recent explosion of work in the area of Creative AI. Relevant examples include Google Magenta, Amazon's DeepComposer, who seek to develop algorithms capable of composing and performing completely original (and compelling) works of music. The same also happens in the world of visual media creation (e.g., DeepDream, Deep Fakes). Certain work in this area adds an interesting dimension to the conversation as understanding how content is created is a prerequisite to generating.

This workshop proposal is timely in that it will bridge these separate pockets of otherwise very related research. In addition to making progress on the challenges above, we hope to engage the wide AI and machine learning community with our rich problem space, and connect them with the many available datasets the community has to offer.

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sat 9:00 a.m. - 9:10 a.m.	Welcome Remarks ( Welcome ) >	🔗
Sat 9:10 a.m. - 9:40 a.m.	Graph Neural Networks for Reasoning over Multimodal Content ( Invited Talk ) > SlidesLive Video	Jure Leskovec 🔗
Sat 9:40 a.m. - 10:00 a.m.	Novel Audio Embeddings for Personalized Recommendations on Newly Released Tracks ( Accepted Talk ) > SlidesLive Video	Beici Liang 🔗
Sat 10:00 a.m. - 10:20 a.m.	Musical Word Embedding: Bridging the Gap between Listening Contexts and Music ( Accepted Talk ) > SlidesLive Video	Seungheon Doh 🔗
Sat 10:20 a.m. - 11:00 a.m.	Poster Session #1 ( Posters ) >	🔗
Sat 11:00 a.m. - 11:30 a.m.	Graphs for music analysis ( Invited Talk ) > SlidesLive Video	Delia Fano Yela 🔗
Sat 11:30 a.m. - 11:50 a.m.	Deep Active Learning Toward Crisis-related Tweets Classification ( Accepted Talk ) > SlidesLive Video	Shiva Ebrahimi 🔗
Sat 11:50 a.m. - 12:20 p.m.	The Unsung Heroes of Music Recommendation: an Essay ( Invited Talk ) > SlidesLive Video	Matthias Mauch 🔗
Sat 12:20 p.m. - 1:00 p.m.	Lunch	🔗
Sat 1:00 p.m. - 1:30 p.m.	Beyond Being Accurate: Solving Real-World Recommendation Problems with Neural Modeling ( Invited Talk ) > SlidesLive Video	Ed Chi 🔗
Sat 1:30 p.m. - 1:50 p.m.	Character-focused Video Thumbnail Retrieval ( Accepted Talk ) > SlidesLive Video	Shervin Ardeshir 🔗
Sat 1:50 p.m. - 2:10 p.m.	HitPredict: Using Spotify Data to Predict Billboard Hits ( Accepted Talk ) > SlidesLive Video	Elena Georgieva 🔗
Sat 2:10 p.m. - 2:50 p.m.	Poster Session #2 ( Posters ) >	🔗
Sat 2:50 p.m. - 3:20 p.m.	Hit Song Prediction ( Invited Talk ) > SlidesLive Video	Eva Zangerle 🔗
Sat 3:20 p.m. - 3:40 p.m.	I know why you like this movie: Interpretable Efficient Mulitmodal Recommender ( Accepted Talk ) > SlidesLive Video	Barbara Rychalska 🔗
Sat 3:40 p.m. - 4:00 p.m.	Content-based Music Similarity with Siamese Networks ( Accepted Talk ) > SlidesLive Video	Joseph O Cleveland 🔗