Not Logged In

Modeling past and future for neural machine translation

Full Text: Q18-1011.pdf PDF

Existing neural machine translation systems do not explicitly model what has been translated and what has not during the decoding phase. To address this problem, we propose a novel mechanism that separates the source information into two parts: translated Past contents and untranslated Future contents, which are modeled by two additional recurrent layers. The Past and Future contents are fed to both the attention model and the decoder states, which provides Neural Machine Translation (NMT) systems with the knowledge of translated and untranslated contents. Experimental results show that the proposed approach significantly improves the performance in Chinese-English, German-English, and English-German translation tasks. Specifically, the proposed model outperforms the conventional coverage model in terms of both the translation quality and the alignment error rate.

Citation

Z. Zheng, H. Zhou, S. Huang, L. Mou, X. Dai, J. Chen, Z. Tu. "Modeling past and future for neural machine translation". Transactions of the Association for Computational Linguistics, 6, pp 145–157, January 2018.

Keywords:  
Category: In Journal
Web Links: ACL

BibTeX

@article{Zheng+al:TACL18,
  author = {Zaixiang Zheng and Hao Zhou and Shujian Huang and Lili Mou and
    Xin-Yu Dai and Jiajun Chen and Zhaopeng Tu},
  title = {Modeling past and future for neural machine translation},
  Volume = "6",
  Pages = {145–157},
  journal = {Transactions of the Association for Computational Linguistics},
  year = 2018,
}

Last Updated: February 02, 2021
Submitted by Sabina P

University of Alberta Logo AICML Logo