Learning Affective Correspondence between Music and Image

Verma, Gaurav; Dhekane, Eeshan Gunesh; Guha, Tanaya

Computer Science > Multimedia

arXiv:1904.00150 (cs)

[Submitted on 30 Mar 2019 (v1), last revised 17 Apr 2019 (this version, v2)]

Title:Learning Affective Correspondence between Music and Image

Authors:Gaurav Verma, Eeshan Gunesh Dhekane, Tanaya Guha

View PDF

Abstract:We introduce the problem of learning affective correspondence between audio (music) and visual data (images). For this task, a music clip and an image are considered similar (having true correspondence) if they have similar emotion content. In order to estimate this crossmodal, emotion-centric similarity, we propose a deep neural network architecture that learns to project the data from the two modalities to a common representation space, and performs a binary classification task of predicting the affective correspondence (true or false). To facilitate the current study, we construct a large scale database containing more than $3,500$ music clips and $85,000$ images with three emotion classes (positive, neutral, negative). The proposed approach achieves $61.67\%$ accuracy for the affective correspondence prediction task on this database, outperforming two relevant and competitive baselines. We also demonstrate that our network learns modality-specific representations of emotion (without explicitly being trained with emotion labels), which are useful for emotion recognition in individual modalities.

Comments:	5 pages, International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
Subjects:	Multimedia (cs.MM); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1904.00150 [cs.MM]
	(or arXiv:1904.00150v2 [cs.MM] for this version)
	https://doi.org/10.48550/arXiv.1904.00150

Submission history

From: Gaurav Verma [view email]
[v1] Sat, 30 Mar 2019 05:17:27 UTC (4,014 KB)
[v2] Wed, 17 Apr 2019 03:27:35 UTC (4,014 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.MM

< prev | next >

new | recent | 2019-04

Change to browse by:

cs
cs.LG
cs.SD
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Gaurav Verma
Eeshan Gunesh Dhekane
Tanaya Guha

export BibTeX citation

Computer Science > Multimedia

Title:Learning Affective Correspondence between Music and Image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Multimedia

Title:Learning Affective Correspondence between Music and Image

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators