UNCERTAINTY-AWARE REPRESENTATIONS FOR SPOKEN QUESTION ANSWERING


Unlu M., Arisoy E.

IEEE Spoken Language Technology Workshop (SLT), ELECTR NETWORK, 19 - 22 January 2021, pp.943-949 identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Volume:
  • Doi Number: 10.1109/slt48900.2021.9383547
  • Country: ELECTR NETWORK
  • Page Numbers: pp.943-949

Abstract

This paper describes a spoken question answering system that utilizes the uncertainty in automatic speech recognition (ASR) to mitigate the effect of ASR errors on question answering. Spoken question answering is typically performed by transcribing spoken content with an ASR system and then applying text-based question answering methods to the ASR transcriptions. Question answering on spoken documents is more challenging than question answering on text documents since ASR transcriptions can be erroneous and this degrades the system performance. In this paper, we propose integrating confusion networks with word confidence scores into an end-to-end neural network-based question answering system that works on ASR transcriptions. Integration is performed by generating uncertainty-aware embedding representations from confusion networks. The proposed approach improves F1 score in a question answering task developed for spoken lectures by providing tighter integration of ASR and question answering.