buchspektrum Internet-Buchhandlung

Neuerscheinungen 2010

Stand: 2020-01-07
Schnellsuche
ISBN/Stichwort/Autor
Herderstraße 10
10625 Berlin
Tel.: 030 315 714 16
Fax 030 315 714 14
info@buchspektrum.de

Matthias Paulik

Learning Speech Translation from Interpretation


Rapid Development of Automatic Speech Translation Systems using Audio Recordings of Interpreter-Mediated Communication
2010. 148 S. 220 mm
Verlag/Jahr: SÜDWESTDEUTSCHER VERLAG FÜR HOCHSCHULSCHRIFTEN 2010
ISBN: 3-8381-1878-2 (3838118782)
Neue ISBN: 978-3-8381-1878-9 (9783838118789)

Preis und Lieferzeit: Bitte klicken


Deployable speech translation (ST) systems typically need to be trained on (1) hundreds of hours of manually transcribed speech audio; (2) bi-lingual text corpora of manual translations, often comprising tens of millions of words; and (3) monolingual text corpora, often comprising hundreds of millions of words. Therefore, ST system development is very costly and requires months or even years of effort. Such a delay is unacceptable for many situations that call for rapid development of automatic ST solutions, as given by disaster relief operations or military operations. Urgency, combined with the absence of automatic ST solutions, consequently necessitates the deployment of interpreters in these situations. In this work, we develop methods to directly train ST systems on audio recordings of interpreter-mediated communication. By employing unsupervised and lightly supervised training techniques, the proposed methods allow us to omit most of the manual transcription effort and all of the manual translation effort that has typically characterized ST system development. Thus, the amount of costly and time-consuming human supervision is substantially reduced.
Matthias Paulik received his Ph.D. (Dr.-Ing., "summa cum laude")and Masters (Dipl.-Inform.) in Computer Science from Universität Karlsruhe (TH) in2010 and 2005, respectively. He joined Cisco Systems in 2010. His researchfocuses on automatic speech translation, automatic speech recognition and statisticalmachine translation.