OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Semantic Context Model for Efficient Speech Recognition

Level, Stephen and Illina, Irina and Fohr, Dominique Semantic Context Model for Efficient Speech Recognition. (2020) In: 1st International Conference on Cognitive Aircraft Systems - ICCAS 2020, 18 March 2020 - 19 March 2020 (Toulouse, France). (Unpublished)

(Document in English)

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader


Automatic speech recognition system (ASR) contains three main parts: an acoustic model, a lexicon and a language model. ASR in noisy environments is still a challenging goal because the acoustic information is not reliable and decreases the recognition accuracy. Better language model gives limited performance improvement, modeling mainly local syntactic information. In this paper, we propose a new semantic model to take into account the long-term semantic context information and thus to remove the acoustic ambiguities of noisy ASR. Recent developments in natural language processing have led to renewed interest in the field of distributional semantics. Word embeddings (WE) (T.Mikolov [Mikolov2013] or BERT model [Devlin2018]) take into account the semantic contexts of words and have been shown to be effective for several natural language processing tasks. The efficiency and the semantic properties of these representations motivate us to explore these WE for our task. Thus, our ASR is supplemented by a semantic context analysis module in order to detect the poorly recognized words and to propose new words of similar pronunciation corresponding better to the context. This semantic analysis re-evaluates (rescoring) the N-best transcription hypotheses and can be seen as a form of dynamic adaptation in the specific context of noisy data.

Item Type:Invited Conference
Audience (conference):International conference without published proceedings
Uncontrolled Keywords:
Institution:French research institutions > Institut National de la Recherche en Informatique et en Automatique - INRIA (FRANCE)
Deposited On:11 May 2021 16:30

Repository Staff Only: item control page