Oznur Alkan, Massimilliano Mattetti, et al.
INFORMS 2020
This paper focuses on the use of both text and phonetic information in a speech translation system in order to make translation results more robust to speech recognition errors. Conventional statistical speech translation formulas are extended to exploit both text-form and phonetic speech recognition results. A novel data-driven word/text tying algorithm is then proposed to group words based on both pronunciation similarity and meaning equivalency. In our speech-to-text translation experiments, significant improvement was achieved by using phonetic information and the proposed word tying algorithm. ©2006 IEEE.
Oznur Alkan, Massimilliano Mattetti, et al.
INFORMS 2020
Rajesh Balchandran, Leonid Rachevsky, et al.
INTERSPEECH 2009
Casey Dugan, Werner Geyer, et al.
CHI 2010
Seyed Omid Sadjadi, Jason W. Pelecanos, et al.
INTERSPEECH 2014