Datasets and Evaluation
Mesaros, Annamaria; Heittola, Toni; Ellis, Dan
Abstract
Abstract
Developing computational systems requires methods for evaluating their performance to guide development and compare alternate approaches. A reliable evaluation procedure for a classification or recognition system will involve a standard dataset of example input data along with the intended target output, and well-defined metrics to compare the systems' outputs with this ground truth. This chapter examines the important factors in the design and construction of evaluation datasets and goes through the metrics commonly used in system evaluation, comparing their properties. We include a survey of currently available datasets for environmental sound scene and event recognition and conclude with advice for designing evaluation protocols.
Research areas- Year:
- 2018
- Editor:
- Tuomas Virtanen and Mark D. Plumbley and Dan Ellis
- Pages:
- 147-179
- Publisher:
- Springer
- Month:
- 9
- ISBN:
- 978-3-319-63449-4
- DOI:
- 10.1007/978-3-319-63450-0_6