Abstract
We address a problem of separating drum sources from monaural mixtures of polyphonic music containing various pitched instruments as well as drums. We consider a spectrogram of music, described by a matrix where each row is associated with intensities of a frequency over time. We employ a joint decomposition to several spectrogram matrices that include two or more column-blocks of the mixture spectrograms (columns of mixture spectrograms are partitioned into 2 or more blocks) and a drum-only (drum solo playing) matrix constructed from various drums a priori. To this end, we apply nonnegative matrix partial co-factorization (NMPCF) to these target matrices, in which column-blocks of mixture spectrograms and the drum-only matrix are jointly decomposed, sharing a factor matrix partially, in order to determine common basis vectors that capture the spectral and temporal characteristics of drum sources. Common basis vectors learned by NMPCF capture spectral patterns of drums since they are shared in the decomposition of the drum-only matrix and accommodate temporal patterns of drums because repetitive characteristics are captured by factorizing column-blocks of mixture spectrograms (each of which is associated with different time periods). Experimental results on real-world commercial music signal demonstrate the performance of the proposed method.
Original language | English (US) |
---|---|
Article number | 5784291 |
Pages (from-to) | 1192-1204 |
Number of pages | 13 |
Journal | IEEE Journal on Selected Topics in Signal Processing |
Volume | 5 |
Issue number | 6 |
DOIs | |
State | Published - Oct 2011 |
Externally published | Yes |
Keywords
- Blind source separation
- Music source separation (MSS)
- Nonnegative matrix factorization (NMF)
- Nonnegative matrix partial co-factorization (NMPCF)
ASJC Scopus subject areas
- Signal Processing
- Electrical and Electronic Engineering