Cochlear pitch class profile for cover song identification

Ning Chen, J. Stephen Downie, Hai Dong Xiao, Yu Zhu

Research output: Contribution to journalArticle

Abstract

Abstract Pitch class profile (PCP), which can represent the harmonic progression of a piece of music very well, is one of the widely used audio features for cover version identification. In this letter, we describe a novel procedure that enhances PCP by substantially boosting the degree of instrumental accompaniment invariance without degrading the feature's discriminative power. Our idea is based on the assumption that human ear can identify a cover of a pop song based on their singing voice quickly and easily. So, we combine two concepts from psychoacoustics: (i) time-varying loudness contour and (ii) critical band, which have been used in speech recognition field successfully, with the conventional PCP descriptor to enhance its discriminative power. Since the CPCPs aim at a representation of singing voice, they may also obtain improved performance (as compared to conventional PCPs) when applied to a cappella singing recordings. Experimental results demonstrate that the resulting PCP feature, called cochlear pitch class profile (CPCP), outperforms conventional PCP feature in the context of pop cover song identification application.

Original languageEnglish (US)
Article number5592
Pages (from-to)92-96
Number of pages5
JournalApplied Acoustics
Volume99
DOIs
StatePublished - Jul 10 2015

Keywords

  • Audio matching
  • Cochlear pitch class profile (CPCP)
  • Pop cover song identification

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

Fingerprint Dive into the research topics of 'Cochlear pitch class profile for cover song identification'. Together they form a unique fingerprint.

  • Cite this