基于增量训练的维汉神经机器翻译系统

设为首页

收藏本站

网站地图 | English | 公务邮箱

读者指南

学术客户端

NSTL服务站

科技查新

基于增量训练的维汉神经机器翻译系统

详细信息查看全文 | 推荐本文 |

英文篇名：Uyghur-to-Chinese neural machine translation based on incremental training
作者：杨郑鑫 ; 李京谕 ; 胡镓伟 ; 冯洋
英文作者：YANG Zhengxin;LI Jingyu;HU Jiawei;FENG Yang;Key Laboratory of Intelligent Information Processing,Institute of Computing Technology,Chinese Academy of Science;School of Computer Science and Technology,University of Chinese Academy of Science;
关键词：自然语言处理 ; 神经机器翻译 ; 维吾尔语
英文关键词：natural language processing;;neural machine translation;;Uyghur
中文刊名：XDZK
英文刊名：Journal of Xiamen University(Natural Science)
机构：中国科学院计算技术研究所智能信息处理重点实验室;中国科学院大学计算机科学与技术学院;
出版日期：2019-03-28
出版单位：厦门大学学报(自然科学版)
年：2019
期：v.58;No.269
基金：国家自然科学基金(61662077)
语种：中文;
页：XDZK201902009
页数：5
CN：02
ISSN：35-1070/N
分类号：53-57

摘要

目前,基于深度学习的神经机器翻译已经成为机器翻译领域的主流方法.神经机器翻译模型相较于统计机器翻译模型具有更庞大的参数规模,因此其翻译质量取决于训练数据是否充足.由于与维吾尔语相关的平行语料资源严重匮乏,神经机器翻译模型在维汉翻译任务上表现不佳,为此提出了一种利用伪语料对神经机器翻译模型进行增量训练的方法,可有效提升神经机器翻译在维汉翻译任务上的质量.
At present,the neural machine translation based on deep learning has become the mainstream method in the field of machine translation.The neural machine translation model requires a larger parameter size than the statistical machine translation model does.Therefore,its translation quality depends on the sufficiency of the training data.Due to the serious lack of parallel corpus resources related to Uyghur,the neural machine translation model performs poorly on Uyghur-to-Chinese translation tasks.This paper proposes a method of incremental training of neural machine translation models using pseudo-corpus,which effectively improves the quality of neural machine translation in Uyghur-to-Chinese translation tasks.

引文

[1]CHO K,VAN MERRIENBOER B,GULCEHRE C,et al.Learning phrase representations using RNN encoderdecoder for statistical machine translation[C]∥Proceedings of EMNLP 2014.Baltimore:Association for Computational Linguistics,2014:1724-1734.
    [2]BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[C]∥Proceedings of ICLR 2015.San Diego:International Conference on Learning Representations,2015:1409.0473.
    [3]GEHRING J,AULI M,GRANGIER D,et al.Convolutional sequence to sequence learning[C]∥Proceedings of ICML2017.Sydney:International Conference on Machine Learning,2017:1705.03122.
    [4]VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[C]∥Proceedings of NIPS 2017.Long Beach:Conference on Neural Information Processing Systems,2017:1706.03762.
    [5]SENNRICH R,HADDOW B,BIRCH A.Improving neural machine translation models with monolingual data[C]∥Proceedings of ACL 2016.Berlin:Association for Computational Linguistics,2016:86-96.
    [6]LUONG M,PHAM H,MANNING C D.Effective approaches to attention-based neural machine translation[C]∥Proceedings of EMNLP 2015.Lisbon:Association for Computational Linguistics,2015:1412-1421.
    [7]WU Y,SCHUSTER M,CHEN Z,et al.Google's neural machine translation system:bridging the gap between human and machine translation[EB/OL].[2018-11-27].https:∥arxiv.org/pdf/1609.08144.
    [8]SUTSKEVER I,VINYALS V,LE Q V.2014.Sequence to sequence learning with neural networks[EB/OL].[2018-11-27].https:∥arxiv.org/pdf/1409.3215.
    [9]SENNRICH R,HADDOW B,BIRCH A.Neural machine translation of rare words with subword units[C]∥Proceedings of ACL 2016.Berlin:Association for Computational Linguistics,2016:1715-1725.
    [10]JOHNSON M,SCHUSTER M,LE Q V,et al.2016Google's multilingual neural machine translation system:enabling zero-shot translation[EB/OL].[2018-11-27].https:∥arxiv.org/pdf/1611.04558.
    [11]CHIANG D,DENEEFE S,CHAN Y S,et al.Decomposability of translation metrics for improved evaluation and efficient algorithms[EB/OL].[2018-11-27].https:∥www3.nd.edu/~dchiang/papers/bleu.pdf.
    [12]DODDINGTON G.Automatic evaluation of machine translation quality using N-gram co-occurrence statistics[EB/OL].[2018-11-27].http:∥www.mt-archive.info/HLT-2002-Doddington.pdf.
    [13]MELAMED I D,GREEN R,TURIAN J P.Precision and recall of machine translation[C]∥Proceedings of HLT-NAACL 2003.Edmonton:Association for Computational Linguistics,2003:61-63.
    [14]KLAKOW D,PETERS J.Testing the correlation of word error rate and perplexity[J].Speech Communication,2002,38(1/2):19-28.
    [15]LEUSCH G,UEFFING N,NEY H.A novel string-tostring distance measure with applications to machine translation evaluation[C]∥Proceedings of MT SummitⅨ2003.New Orleans:[s.n.],2003:240-247.
    [16]刘群,刘洋.一种机器翻译自动评测方法及其系统:中国,ZL200410000628.8[P].2009-10-28.
    [17]BANERJEE S,LAVIE A.METEOR:an automatic metric for MT evaluation with improved correlation with human judgments[C]∥Proceedings of ACL 2005.Ann Arbor:Association for Computational Linguistics,2005:65-72.
    [18]SNOVER M,DORR B,SCHWARTZ R,et al.A study of translation edit rate with targeted human annotation[C]∥Proceedings of AMTA 2006.Cambridge:Association for Machine Translation in the Americas,2006:223-231.

常见问题　|　交通位置　|　联系我们　|　OA远程办公

地址：北京市海淀区学院路29号邮编：100083

电话：办公室：(+86 10)66554848；文献借阅、咨询服务、科技查新：66554700