Modified perceptual linear prediction liftered cepstrum (MPLPLC) model for pop cover song recognition

Ning Chen, J. Stephen Downie, Haidong Xiao, Yu Zhu, Jie Zhu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Most of the features of Cover Song Identification (CSI), for example, Pitch Class Profile (PCP) related features, are based on the musical facets shared among cover versions: melody evolution and harmonic progression. In this work, the perceptual feature was studied for CSI. Our idea was to modify the Perceptual Linear Prediction (PLP) model in the field of Automatic Speech Recognition (ASR) by (a) introducing new research achievements in psychophysics, and (b) considering the difference between speech and music signals to make it consistent with human hearing and more suitable for music signal analysis. Furthermore, the obtained Linear Prediction Coefficients (LPCs) were mapped to LPC cepstrum coefficients, on which liftering was applied, to boost the timbre invariance of the resultant feature: Modified Perceptual Linear Prediction Liftered Cepstrum (MPLPLC). Experimental results showed that both LPC cepstrum coefficients mapping and cepstrum liftering were crucial in ensuring the identification power of the MPLPLC feature. The MPLPLC feature outperformed state-of-the-art features in the context of CSI and in resisting instrumental accompaniment variation. This study verifies that the mature techniques in the ASR or Computational Auditory Scene Analysis (CASA) fields may be modified and included to enhance the performance of the Music Information Retrieval (MIR) scheme.

Original languageEnglish (US)
Title of host publicationProceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015
EditorsMeinard Muller, Frans Wiering
PublisherInternational Society for Music Information Retrieval
Pages598-604
Number of pages7
ISBN (Electronic)9788460688532
StatePublished - Jan 1 2015
Externally publishedYes
Event16th International Society for Music Information Retrieval Conference, ISMIR 2015 - Malaga, Spain
Duration: Oct 26 2015Oct 30 2015

Publication series

NameProceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015

Conference

Conference16th International Society for Music Information Retrieval Conference, ISMIR 2015
CountrySpain
CityMalaga
Period10/26/1510/30/15

ASJC Scopus subject areas

  • Music
  • Information Systems

Fingerprint Dive into the research topics of 'Modified perceptual linear prediction liftered cepstrum (MPLPLC) model for pop cover song recognition'. Together they form a unique fingerprint.

  • Cite this

    Chen, N., Downie, J. S., Xiao, H., Zhu, Y., & Zhu, J. (2015). Modified perceptual linear prediction liftered cepstrum (MPLPLC) model for pop cover song recognition. In M. Muller, & F. Wiering (Eds.), Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015 (pp. 598-604). (Proceedings of the 16th International Society for Music Information Retrieval Conference, ISMIR 2015). International Society for Music Information Retrieval.