Group Delay Function from All-Pole Models for Musical Instrument Recognition

Diment, Aleksandr; Rajan, Padmanabhan; Heittola, Toni; Virtanen, Tuomas
Abstract

In this work, the feature based on the group delay function from all-pole models (APGD) is proposed for pitched musical instrument recognition. Conventionally, the spectrum-related features take into account merely the magnitude information, whereas the phase is often overlooked due to the complications related to its interpretation. However, there is often additional information concealed in the phase, which could be beneficial for recognition. The APGD is an elegant approach to inferring phase information, which lacks of the issues related to interpreting the phase and does not require extensive parameter adjustment. Having shown applicability for speech-related problems, it is now explored in terms of instrument recognition. The evaluation is performed with various instrument sets and shows noteworthy absolute accuracy gains of up to 7% compared to the baseline mel-frequency cepstral coefficients (MFCCs) case. Combined with the MFCCs and with feature selection, APGD demonstrates superiority over the baseline with all the evaluated sets.

Keywords

Musical instrument recognition; Music information retrieval; All-pole group delay feature; Phase spectrum

Year:
2014
JRESEARCH_BOOK_TITLE:
Sound, Music, and Motion
Publisher:
Springer International Publishing
Editor:
Aramaki, Mitsuko and Derrien, Olivier and Kronland-Martinet, Richard and Ystad, Sølvi
Pages:
606-618
ISBN:
978-3-319-12975-4
Note:
The final publication is available at link.springer.com: https://link.springer.com/chapter/10.1007/978-3-319-12976-1_37
DOI:
10.1007/978-3-319-12976-1_37