Deprecated: Array and string offset access syntax with curly braces is deprecated in /var/www/html/arg/administrator/components/com_jresearch/tables/publication.php on line 150

Deprecated: Array and string offset access syntax with curly braces is deprecated in /var/www/html/arg/administrator/components/com_jresearch/tables/publication.php on line 166

Acquiring Variable Length Speech Bases for Factorisation-Based Noise Robust Speech Recognition

Bibtex

Hurmalainen, Antti; Virtanen, Tuomas
Abstract

Studies from multiple disciplines show that spectro-temporal units of natural languages and human speech perception are longer than short-time frames commonly employed in automatic speech recognition. Extended temporal context is also beneﬁcial for separation of concurrent sound sources such as speech and noise. However, the length of patterns in speech varies greatly, making it difﬁcult to model with ﬁxed-length units. We propose methods for acquiring variable length speech atom bases for accurate yet compact representation of speech with a large temporal context. Bases are generated from spectral features, from assigned state labels, and as a combination of both. Results for factorisation-based speech recognition in noisy conditions show equal or better separation and recognition quality in comparison to ﬁxed length units, while model sizes are reduced by up to 40%.

Keywords

spectral factorization; speech recognition; noise robustness

Research areas

speech recognition

Year:: 2013
Book title:: Proceedings of the 21st European Signal Processing Conference (EUSIPCO)
Organization:: The European Association for Signal Processing (EURASIP)
Month:: September

Back