Uncertainty Decoding for Reverberation-Robust Automatic Speech Recognition

Av:

Maas, Roland [edt]

Medverkande:

Maas, Roland [auth]

Materialtyp:

ArtikelSerie: Utgivningsinformation: Erlangen FAU University Press 2016Beskrivning: 1 electronic resource (191 p.)Innehållstyp:

text

Medietyp:

computer

Bärartyp:

online resource

ISBN:

9783944057613
9783944057620

Ämnen:

Onlineresurser:

Sammanfattning: The major problem in distant-talking speech recognition is the corruption of speech signals by both interfering sounds and reverberation. While a range of successful techniques has been developed since the beginnings of speech recognition research to combat additive and short convolutive noise, compensating for long-term distortion caused by reverberation has not gained wide attention until recently. This thesis further develops an uncertainty decoding approach, named REverberation MOdeling for Speech recognition (REMOS), to adapt the acoustic model of a conventional Hidden Markov Model-based recognizer to reverberant environments. By incorporating a convolutive observation model, the Viterbi decoder is extended in order to implicitly provide a state-wise late reverberation estimate leading to a relaxation of the hidden Markov models' conditional independence assumption. The experimental evaluation confirms that REMOS yields strong speech recognition performance under noisy and reverberant conditions and furthermore allows for a rapid adaptation to changing acoustic conditions.

Bestånd ( 0 )
Titelanmärkningar ( 6 )

Inga fysiska exemplar för denna post

Open Access Unrestricted online access star

The major problem in distant-talking speech recognition is the corruption of speech signals by both interfering sounds and reverberation. While a range of successful techniques has been developed since the beginnings of speech recognition research to combat additive and short convolutive noise, compensating for long-term distortion caused by reverberation has not gained wide attention until recently. This thesis further develops an uncertainty decoding approach, named REverberation MOdeling for Speech recognition (REMOS), to adapt the acoustic model of a conventional Hidden Markov Model-based recognizer to reverberant environments. By incorporating a convolutive observation model, the Viterbi decoder is extended in order to implicitly provide a state-wise late reverberation estimate leading to a relaxation of the hidden Markov models' conditional independence assumption. The experimental evaluation confirms that REMOS yields strong speech recognition performance under noisy and reverberant conditions and furthermore allows for a rapid adaptation to changing acoustic conditions.

Accessibility options of PDF file not available

Creative Commons Licence cc by-nc-nd cc https://creativecommons.org/licenses/by-nc-nd/3.0

eng

Freely available e-book

Utskrift
Citera
Spara posten
BIBTEX Dublin Core MARCXML MARC (icke Unicode/MARC-8) MARC (Unicode/UTF-8) MARC (Unicode/UTF-8, Standard) MODS (XML) RIS ISBD
Fler sökningar

Sök efter denna titel i:
LIBRIS
Google Books