Transcript [poster]
Using Monolingual Crowds to Improve Translation 1 Kronrod , 2 Hu , 2 Buzek , 2 Quinn Yakov Chang Olivia Alex University of Maryland (1) Dept. of Linguistics (2) Dept. of Computer Science ParaTrans: Targeted Paraphrasing Space of Available Translation Options Affordability Monolingual Human Participation Monolingual speakers improving translation quality Human participants perform the following tasks: 1. Identify mistranslated spans in target language 2. Create paraphrases for spans in source language responsible for the target error spans. • Produces gains of 0.82 and 1.32 for adequacy and fluency on a 5-point scale • Improves 65.3% of sentences examined • Yields increases of +0.9 BLEU to +3.8 BLEU, depending on size of error span, in an oracle study. Quality Paraphrases from the Crowd Translations and Human Judgments Original Google Translate output (GT), with translations produced by the targeted paraphrase translation process (TP), selected to show a range from strong to weak improvements in fluency. Fluency and adequacy (preservation of meaning) were rated on a 5-point scale by independent judges not familiar with this research. The Bigger Picture: MonoTrans Source Language Target Language MT Sentence Sentence Crowd Tasks: Features Results •Online system for collaborative monolingual translation •Asynchronous edits, short tasks, minimal context 1 Vote 2 Identify error spans 3 Create new translation candidates Crowd Tasks: 1 Vote 2 Paraphrase 3 Explain errors New candidate repeat … … 1 2 3 68% of sentences translated perfectly with no bilingual humans UMD HCI Lab (cs.umd.edu/hcil) Comp. Ling. and Info. Processing Lab (umiacs.umd.edu/labs/clip) MonoTrans Project Page: (http://www.cs.umd.edu/hcil/monotrans)