Musical Instrument Recognition in Polyphonic Audio Using Source-Filter Model for Sound Separation
Abstract
This paper proposes a novel approach to musical instrument recognition in polyphonic audio signals by using a source-filter model and an augmented non-negative matrix factorization algorithm for sound separation. The mixture signal is decomposed into a sum of spectral bases modeled as a product of excitations and filters. The excitations are restricted to harmonic spectra and their fundamental frequencies are estimated in advance using a multipitch estimator, whereas the filters are restricted to have smooth frequency responses by modeling them as a sum of elementary functions on the Mel-frequency scale. The pitch and timbre information are used in organizing individual notes into sound sources. In the recognition, Mel-frequency cepstral coefficients are used to represent the coarse shape of the power spectrum of sound sources and Gaussian mixture models are used to model instrument-conditional densities of the extracted features. The method is evaluated with polyphonic signals, randomly generated from 19 instrument classes. The recognition rate for signals having six note polyphony reaches 59%.
Keywordsinstruments; separation; source-filter model
Research areas- Year:
- 2009
- Book title:
- in Proc. 10th Int. Society for Music Information Retrieval Conf. (ISMIR 2009)
- Pages:
- 327-332
- Address:
- Kobe, Japan
- Organization:
- International Society for Music Information Retrieval (ISMIR)