Professional Documents
Culture Documents
discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/221573849
CITATIONS READS
20 75
4 authors, including:
Homer Chen
National Taiwan University
204 PUBLICATIONS 2,913 CITATIONS
SEE PROFILE
All content following this page was uploaded by Yu-Ching Lin on 16 May 2014.
The user has requested enhancement of the downloaded file. All in-text references underlined in blue are added to the original document
and are linked to publications on ResearchGate, letting you access and read them immediately.
Mr. Emo: Music Retrieval in the Emotion Plane
Yi-Hsuan Yang, Yu-Ching Lin, Heng-Tze Cheng, and Homer Chen
National Taiwan University
1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan
{affige, vagante, mikejdionline}@gmail.com, homer@cc.ee.ntu.edu.tw
ABSTRACT
This technical demo presents a novel emotion-based music
retrieval platform, called Mr. Emo, for organizing and browsing
music collections. Unlike conventional approaches which
quantize emotions into classes, Mr. Emo defines emotions by two
continuous variables arousal and valence and employs regression
algorithms to predict them. Associated with arousal and valence
values (AV values), each music sample becomes a point in the
arousal-valence emotion plane, so a user can easily retrieve music
samples of certain emotion(s) by specifying a point or a trajectory
in the emotion plane. Being content centric and functionally
powerful, such emotion-based retrieval complements traditional
keyword- or artist-based retrieval. The demo shows the
effectiveness and novelty of music retrieval in the emotion plane.
Keywords
Instead, we view emotions from a continuous perspective and
Music information retrieval, emotion recognition, emotion plane
define emotions in a 2-D plane in terms of arousal (how exciting
or calming) and valence (how positive or negative). Therefore,
1. INTRODUCTION MER becomes the prediction of the arousal and valence values
Due to the fast growth of digital music collection, effective (AV values) corresponding to a point in the emotion plane. A user
retrieval and management of music is needed in the digital era. can then retrieve music samples of certain emotions by specifying
Music classification and retrieval by emotion is a plausible a point or drawing a trajectory in the emotion plane, as shown in
approach, for it is content-centric and functionally powerful. Fig. 1. In this way, the granularity and ambiguity issues
associated with emotion classes or adjectives can be successfully
Various research results have been reported in the field of music
resolved since no categorical classes are needed, and hence
emotion recognition (MER) for recognizing the affective content
numerous novel emotion-based music organization, browsing, and
(or evoking emotion) of music signals [1]. A typical approach is
retrieval methods can be easily realized.
to categorize emotions into a number of classes (e.g., happy,
angry, sad and relaxing) and apply machine learning techniques to This demo illustrates an emotion-based music retrieval platform,
train a classifier. This approach, though widely adopted, faces the called Mr. Emo. The critical task of predicting the AV values is
granularity issue when it comes to practical usage. Obviously, accomplished by regression, which has sound theoretical basis
classifying emotions into only a handful of classes cannot meet and yields satisfactory prediction accuracy. We apply the trained
the user demand for effective information access. Using a finer regression models to a mildly large scale music database and
granularity for emotion description does not necessarily address design numerous emotion-plane-based retrieval methods.
the issue since language is ambiguous and the description for the
same emotion varies from person to person.
2. SYSTEM ARCHITECTURE
The system consists of two main parts as shown in Fig. 2: 1) the
Copyright is held by the author/owner(s). prediction of AV values using regression models, and 2) the
MM'08, October 2327, 2008, Vancouver, Canada. emotion-based visualization and retrieval of music samples.
ACM 1-59593-447-2/06/0010
Fig. 3. Distributions of the music samples of three
Fig. 2. System architecture of Mr. Emo. famous artists in the emotion plane.