Group Delay Function from All-Pole Models for Musical Instrument Recognition
Abstract
In this work, the feature based on the group delay function from all-pole models (APGD) is proposed for pitched musical instrument recognition. Conventionally, the spectrum-related features take into account merely the magnitude information, whereas the phase is often overlooked due to the complications related to its interpretation. However, there is often additional information concealed in the phase, which could be beneficial for recognition. The APGD is an elegant approach to inferring phase information, which lacks of the issues related to interpreting the phase and does not require extensive parameter adjustment. Having shown applicability for speech-related problems, it is now explored in terms of instrument recognition. The evaluation is performed with various instrument sets and shows noteworthy absolute accuracy gains of up to 7% compared to the baseline mel-frequency cepstral coefficients (MFCCs) case. Combined with the MFCCs and with feature selection, APGD demonstrates superiority over the baseline with all the evaluated sets.
KeywordsMusical instrument recognition; Music information retrieval; All-pole group delay feature; Phase spectrum
- Year:
- 2014
- JRESEARCH_BOOK_TITLE:
- Sound, Music, and Motion
- Publisher:
- Springer International Publishing
- Editor:
- Aramaki, Mitsuko and Derrien, Olivier and Kronland-Martinet, Richard and Ystad, Sølvi
- Pages:
- 606-618
- ISBN:
- 978-3-319-12975-4
- Note:
- The final publication is available at link.springer.com: https://link.springer.com/chapter/10.1007/978-3-319-12976-1_37
- DOI:
- 10.1007/978-3-319-12976-1_37