JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2016, Vol. 51 ›› Issue (7): 43-50.doi: 10.6040/j.issn.1671-9352.1.2015.116

Previous Articles     Next Articles

Translation model adaptation based on semantic distribution similarity

YAO Liang, HONG Yu*, LIU Hao, LIU Le, YAO Jian-min   

  1. Provincial Key Laboratory of Computer Information Processing Technology, Soochow University, Suzhou 215006, Jiangsu, China
  • Received:2015-11-14 Online:2016-07-20 Published:2016-07-27

Abstract: Statistical machine translation(SMT)system is trained with large-scale and domain-mixed parallel corpus, when the data for training and testing are not belonged to the same domain, the translation quality usually drops dramatically. To solve this problem, we proposed a novel approach to adapt the translation model based on semantic distribution similarity of translation pair. The approach firstly obtained word representations both in source and target language, and then built mapping to link the different vector space. With the mapping function the semantic k-nearest neighbors of source language in the target vector space can be easily obtained. Based on the semantic distribution of k neighbors in the general domain space, we computed phrases translation similarity in the domain of interest. The similarities are then integrated into the decoder engine, in order to enhance the adaption ability of common translation model. Experiments on English to Chinese translation tasks show that the optimized translation systems build on our method outperform the baseline system by 0.67 and 0.56 BLUE points on news and science-technology test sets respectively.

Key words: translation model, word representation, semantic distribution, domain adaptation

CLC Number: 

  • TP393
[1] FOSTER G, GOUTTE C, KUHN R. Discriminative instance weighting for domain adaptation in statistical machine translation[C] //Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing. New York: ACM, 2010: 451-459.
[2] SU J, WU H, WANG H, et al. Translation model adaptation for statistical machine translation with monolingual topic information[C] //Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. New York: ACM, 2012: 459-468.
[3] SENNRICH R. Perplexity minimization for translation model domain adaptation in statistical machine translation[C] //Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. USA: Wiley, 2012: 539-549.
[4] 曹杰, 吕雅娟, 苏劲松,等. 利用上下文信息的统计机器翻译领域自适应[J]. 中文信息学报, 2010, 24(6): 50-56. CAO Jie, L(¨overU)Yajuan, SU Jinsong, et al. Using contextual information to the statistical machine translation domain adaptive[J]. Chinese Journal of Information, 2010, 24(6): 50-56.
[5] HEWAVITHARANA S, MEHAY D, ANANTHAKRI-SHNAN S, et al. Incremental topic-based translation model adaptation for conversational spoken language translation[C] //Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Somerset: ACL, 2013: 697-701.
[6] HASLER E, BLUNSOM P, KOEHN P, et al. Dynamic topic adaptation for phrase-based MT[C] //Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics. Somerset: ACL, 2014.
[7] 黄瑾, 吕雅娟, 刘群. 基于信息检索方法的统计翻译系统训练数据选择与优化[J]. 中文信息学报, 2008, 22(2): 40-46. HUANG Jin, LÜ Yajuan, LIU Qun. Selection and optimization of training data for statistical translation system based on information retrieval[J]. Chinese Journal of Information, 2008, 22(2):40-46.
[8] MOORE R C, LEWIS W. Intelligent selection of language model training data[C] //Proceedings of the ACL 2010 Conference Short Papers, Association for Computational Linguistics. New York: ACM, 2010:220-224.
[9] AXELROD A, HE X, GAO J. Domain adaptation via pseudo in-domain data selection[C] //Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing. New York: ACM, 2011: 355-362.
[10] DUH K, NEUBIG G, SUDOH K, et al. Adaptation data selection using neural language models: experiments in machine translation[C] //Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Somerset: ACL, 2013: 678-683.
[11] LE L, YU H, HAO L, et al. Effective selection of translation model training data[C] //Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics(Short Papers). Somerset: ACL, 2014: 569-573.
[12] MIKOLOV T, SUTSKEVER I, CHEN K, et al. Distributed representations of words and phrases and their compositionality[J]. Advances in Neural Information Processing Systems, 2013, 26: 3111-3119.
[13] MIKOLOV T, LE Q V, SUTSKEVER I. Exploiting similarities among languages for machine translation[J/OL]. Computer Science, 2013.[2015-06-12].http://arxin.prg/pdf/1309.4168v/pdf
[14] 王超超, 熊德意, 张民. 基于双语合成语义的翻译相似度模型[J]. 北京大学学报(自然科学版), 2015, 51(2): 335-341. WANG Chaochao, XIONG Deyi, ZHANG Min. Translation similarity model based on the semantic of bilingual synthesis[J]. Journal of Peking University(Natural Science Edition), 2015, 51(2):335-341.
[15] KOEHN P, OCH F J, MARCU D. Statistical phrase-based translation[C] //Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology. New York: ACM, 2003: 48-54.
[16] XIAO T, ZHU J, ZHANG H, et al. NiuTrans: an open source toolkit for phrase-based and syntax-based machine translation[C] //Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. New York: ACM, 2012: 19-24.
[17] OCH F J, NEY H. A systematic comparison of various statistical alignment models[J]. Computational Linguistics, 2003, 29(1): 19-51.
[18] STOLCKE A. SRILM-an extensible language modeling toolkit[J]. International Conference on Spoken Language Processing, 2004: 901-904.
[19] OCH F J. Minimum error rate training in statistical machine translation[C] //Proceedings of the 41st Annual Meeting on Association for Computational Linguistics. New York: ACM, 2003: 160-167.
[20] PAPINENI K, ROUKOS S, WARD T, et al. BLEU: a method for automatic evaluation of machine translation[C] //Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. New York: ACM, 2002: 311-318.
[1] LI Yan-ping, QI Yan-jiao, ZHANG Kai, WEI Xu-guang. Multi-authority and revocable attribute-based encryption scheme [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(7): 75-84.
[2] ZHANG Guang-zhi, CAI Shao-bin, MA Chun-hua, ZHANG Dong-qiu. Application of maximum distance separable codes in the error correction of the network coding [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(1): 75-82.
[3] LI Yang, CHENG Xiong, TONG Yan, CHEN Wei, QIN Tao, ZHANG Jian, XU Ming-di. Method for threaten users mining based on traffic statistic characteristics [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(1): 83-88.
[4] ZHAO Guang-yuan, QIN Feng-lin, GUO Xiao-dong. Design and implementation of P2P-based network measurement cloud platform [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(12): 104-110.
[5] HUANG Shu-qin, XU Yong, WANG Ping-shui. User similarity calculation method based on probabilistic matrix factorization and its recommendation application [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(11): 37-43.
[6] WANG Ya-qi, WANG Jing. Rumor spreading on dynamic complex networks with curious psychological mechanism [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(6): 99-104.
[7] CHEN Guang-rui, CHEN Xing-shu, WANG Yi-tong, GE Long. A software update mechanism for virtual machines in IaaS multi-tenant environment [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(3): 60-67.
[8] ZHUANG Zheng-mao, CHEN Xing-shu, SHAO Guo-lin, YE Xiao-ming. A time-relevant network traffic anomaly detection approach [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(3): 68-73.
[9] SONG Yuan-zhang, LI Hong-yu, CHEN Yuan, WANG Jun-jie. P2P botnet detection method based on fractal and adaptive data fusion [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(3): 74-81.
[10] ZHU Sheng, ZHOU Bin, ZHU Xiang. EIP: discovering influential bloggers by user similarity and topic timeliness [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(9): 113-120.
[11] . An approach of detecting LDoS attacks based on the euclidean distance of available bandwidth in cloud computing [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(9): 92-100.
[12] LI Yu-xi, WANG Kai-xuan, LIN Mu-qing, ZHOU Fu-cai. A P2P network privacy protection system based on anonymous broadcast encryption scheme [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(9): 84-91.
[13] SU Bin-ting, XU Li, FANG He, WANG Feng. Fast authentication mechanism based on Diffie-Hellman for wireless mesh networks [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(9): 101-105.
[14] LIN Li. News event extraction based on kernel dependency graph [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(9): 121-126.
[15] . Construction of expert relationship network based on random walk strategy [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(7): 30-34.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!