 Neuerscheinungen 2010Stand: 2020-01-07 |
Schnellsuche
ISBN/Stichwort/Autor
|
Herderstraße 10 10625 Berlin Tel.: 030 315 714 16 Fax 030 315 714 14 info@buchspektrum.de |

Matthias Paulik
Learning Speech Translation from Interpretation
Rapid Development of Automatic Speech Translation Systems using Audio Recordings of Interpreter-Mediated Communication
2010. 148 S. 220 mm
Verlag/Jahr: SÜDWESTDEUTSCHER VERLAG FÜR HOCHSCHULSCHRIFTEN 2010
ISBN: 3-8381-1878-2 (3838118782)
Neue ISBN: 978-3-8381-1878-9 (9783838118789)
Preis und Lieferzeit: Bitte klicken
Deployable speech translation (ST) systems typically need to be trained on (1) hundreds of hours of manually transcribed speech audio; (2) bi-lingual text corpora of manual translations, often comprising tens of millions of words; and (3) monolingual text corpora, often comprising hundreds of millions of words. Therefore, ST system development is very costly and requires months or even years of effort. Such a delay is unacceptable for many situations that call for rapid development of automatic ST solutions, as given by disaster relief operations or military operations. Urgency, combined with the absence of automatic ST solutions, consequently necessitates the deployment of interpreters in these situations. In this work, we develop methods to directly train ST systems on audio recordings of interpreter-mediated communication. By employing unsupervised and lightly supervised training techniques, the proposed methods allow us to omit most of the manual transcription effort and all of the manual translation effort that has typically characterized ST system development. Thus, the amount of costly and time-consuming human supervision is substantially reduced.
Matthias Paulik received his Ph.D. (Dr.-Ing., "summa cum laude")and Masters (Dipl.-Inform.) in Computer Science from Universität Karlsruhe (TH) in2010 and 2005, respectively. He joined Cisco Systems in 2010. His researchfocuses on automatic speech translation, automatic speech recognition and statisticalmachine translation.