OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Sous-continents Estimation of Emotion in Music with Recurrent Neural Networks

Pellegrini, Thomas and Barrière, Valentin Sous-continents Estimation of Emotion in Music with Recurrent Neural Networks. (2016) In: MediaEval 2015 Multimedia Benchmark Workshop (MediaEval 2015), 14 September 2015 - 15 September 2015 (Wurzen, Germany).

[img]
Preview
(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
420kB

Official URL: http://ceur-ws.org/Vol-1436/Paper60.pdf

Abstract

In this paper, we describe the IRIT's approach used for the MediaEval 2015 "Emotion in Music" task. The goal was to predict two real-valued emotion dimensions, namely valence and arousal, in a time-continuous fashion. We chose to use recurrent neural networks (RNN) for their sequence modeling capabilities. Hyperparameter tuning was performed through a 10-fold cross-validation setup on the 431 songs of the development subset. With the baseline set of 260 acoustic features, our best system achieved averaged root mean squared errors of 0.250 and 0.238, and Pearson's correlation coefficients of 0.703 and 0.692, for valence and arousal, respectively. These results were obtained by first making predictions with an RNN comprised of only 10 hidden units, smoothed by a moving average filter, and used as input to a second RNN to generate the final predictions. This system gave our best results on the official test data subset for arousal (RMSE=0.247, r=0.588), but not for Valence. Valence predictions were much worse (RMSE=0.365, r=0.029). This may be explained by the fact that in the development subset, valence and arousal values were very correlated (r=0.626), and this was not the case with the test data. Finally, slight improvements over these figures were obtained by adding spectral atness and spectral valley features to the baseline set.

Item Type:Conference or Workshop Item (Paper)
Additional Information:Thanks to CEUR-WS. This paper appears in volume 1436 : http://ceur-ws.org/Vol-1436/Paper60.pdf
HAL Id:hal-01327121
Audience (conference):International conference proceedings
Uncontrolled Keywords:
Institution:Université de Toulouse > Institut National Polytechnique de Toulouse - INPT (FRANCE)
French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UPS (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
Statistics:download
Deposited By: IRIT IRIT
Deposited On:17 May 2016 09:39

Repository Staff Only: item control page