Datasets and Evaluation

Mesaros, Annamaria; Heittola, Toni; Ellis, Dan
Abstract

Developing computational systems requires methods for evaluating their performance to guide development and compare alternate approaches. A reliable evaluation procedure for a classification or recognition system will involve a standard dataset of example input data along with the intended target output, and well-defined metrics to compare the systems' outputs with this ground truth. This chapter examines the important factors in the design and construction of evaluation datasets and goes through the metrics commonly used in system evaluation, comparing their properties. We include a survey of currently available datasets for environmental sound scene and event recognition and conclude with advice for designing evaluation protocols.

Research areas

Year:
2018
Editor:
Tuomas Virtanen and Mark D. Plumbley and Dan Ellis
Pages:
147-179
Publisher:
Springer
Month:
9
ISBN:
978-3-319-63449-4
DOI:
10.1007/978-3-319-63450-0_6