Szczegóły publikacji

Opis bibliograficzny

On the extraction of early reflection signals for automatic speech recognition / Konrad KOWALCZYK, Stanisław KACPRZAK, Mariusz ZIÓŁKO // W: 2017 IEEE 2nd International Conference on Signal and Image Processing (ICSIP) : August 4–6, 2017, Singapore / IEEE. — Piscataway, NJ : IEEE, cop. 2017. — ISBN: 978-1-5386-0968-2; e-ISBN: 978-1-5386-0969-9. — S. 351–355. — Bibliogr. s. 355, Abstr. — Publikacja dostępna online od: 2017-12-01


Autorzy (3)


Słowa kluczowe

speech enhancementmicrophone array processingautomatic speech recognitionreverberation suppression

Dane bibliometryczne

ID BaDAP111583
Data dodania do BaDAP2018-02-05
Tekst źródłowyURL
DOI10.1109/SIPROCESS.2017.8124563
Rok publikacji2017
Typ publikacjimateriały konferencyjne (aut.)
Otwarty dostęptak
WydawcaInstitute of Electrical and Electronics Engineers (IEEE)
Konferencja2017 IEEE 2nd International Conference on Signal and Image Processing

Abstract

Room reverberation caused by multipath sound wave propagation in acoustic enclosures constitutes an unwanted distortion for automatic speech recognition systems. Multichannel speech enhancement methods often aim to enhance the signal impinging at the microphone array from the source direction while reducing late reverberation. In this paper, we investigate the applicability of spatial filters which constructively combine the direct-path signal with distinct early room reflection signals to increase the direct-to-reverberation ratio and to reduce the word error rate (WER) of automatic speech recognition systems. We present suitable filters and compare them with existing approaches. Results for the simulated acoustic environments indicate that an improvement in WER can indeed be achieved by the spatial filters which account for strong early reflections.

Publikacje, które mogą Cię zainteresować

fragment książki
Measures on wavelet segmentation of speech / Michał DYREK, Jakub GAŁKA, Bartosz ZIÓŁKO // W: Multimedia systems and signal processing : proceedings of the 8th WSEAS international conference on Multimedia systems and Signal processing (MUSP'08) : Hangzhou, China, April 6–8, 2008 / eds. Qing Li, [et al.] ; World Scientific and Engineering Academy and Society. — [China] : WSEAS Press, cop. 2008. — (Electrical and Computer Engineering Series : A Series of Reference Books and Textbooks ; ISSN 1790-5117). — ISBN: 978-960-6766-52-7. — S. 23–26. — Bibliogr. s. 26, Abstr.
fragment książki
A comparison of Polish taggers in the application for automatic speech recognition / Aleksander POHL, Bartosz ZIÓŁKO // W: Human language technologies as a challenge for computer science and linguistics : 6th language & technology conference : December 7–9, 2013, Poznań : proceedings / eds. Zygmunt Vetulani, Hans Uszkoreit. — Poznań : Fundacja Uniwersytetu im. A. Mickiewicza, 2013 + CD. — ISBN: 978-83-932640-3-2; e-ISBN: 978-83-932640-4-9. — S. 294–298. — Bibliogr. s. 298, Abstr.