Acoustic Scene Classification Using Higher-Order Ambisonic Features
Abstract
This paper investigates the potential of using higher-order Ambisonic features to perform acoustic scene classification. We compare the performance of systems trained using first-order and fourth-order spatial features extracted from the EigenScape database. Using both Gaussian mixture model and convolutional neural network classifiers, we show that features extracted from higher-order Ambisonics can yield increased classification accuracies relative to first-order features. Diffuseness-based features seem to describe scenes particularly well relative to direction-of-arrival based features. With specific feature subsets, however, differences in classification accuracy between first and fourth-order features become negligible.
Keywordsacoustic scene classification; ambisonics; spatial audio; convolutional neural networks; gaussian mixture models
- Year:
- 2019
- Book title:
- 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
- Series:
- IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
- Pages:
- 328-332
- Month:
- 10
- ISBN:
- 978-1-7281-1124-7
- DOI:
- 10.1109/WASPAA.2019.8937282