Limsi @ wmt'15: Translation task

  • Benjamin Marie
  • , Alexandre Allauzen
  • , Franck Burlot
  • , Quoc Khanh Do
  • , Julia Ive
  • , Elena Knyazeva
  • , Matthieu Labeau
  • , Thomas Lavergne
  • , Kevin Löser
  • , Nicolas Pécheux
  • , François Yvon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper describes LIMSI's submissions to the shared WMT'15 translation task. We report results for French-English, Russian-English in both directions, as well as for Finnish-into-English. Our submissions use NCODE and MOSES along with continuous space translation models in a post-processing step. The main novelties of this year's participation are the following: For Russian-English, we investigate a tailored normalization of Russian to translate into English, and a two-step process to translate first into simplified Russian, followed by a conversion into inflected Russian. For French-English, the challenge is domain adaptation, for which only monolingual corpora are available. Finally, for the Finnish-to-English task, we explore unsupervised morphological segmentation to reduce the sparsity of data induced by the rich morphology on the Finnish side.

Original languageEnglish
Title of host publication10th Workshop on Statistical Machine Translation, WMT 2015 at the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Proceedings
EditorsOndrej Bojar, Rajan Chatterjee, Christian Federmann, Barry Haddow, Chris Hokamp, Matthias Huck, Varvara Logacheva, Pavel Pecina
PublisherAssociation for Computational Linguistics (ACL)
Pages145-151
Number of pages7
ISBN (Electronic)9781941643327
Publication statusPublished - 1 Jan 2015
Event10th Workshop on Statistical Machine Translation, WMT 2015 at the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Lisbon, Portugal
Duration: 17 Sept 201518 Sept 2015

Publication series

Name10th Workshop on Statistical Machine Translation, WMT 2015 at the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015 - Proceedings

Conference

Conference10th Workshop on Statistical Machine Translation, WMT 2015 at the 2015 Conference on Empirical Methods in Natural Language Processing, EMNLP 2015
Country/TerritoryPortugal
CityLisbon
Period17/09/1518/09/15

Fingerprint

Dive into the research topics of 'Limsi @ wmt'15: Translation task'. Together they form a unique fingerprint.

Cite this