JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2016, Vol. 51 ›› Issue (7): 66-73.doi: 10.6040/j.issn.1671-9352.1.2015.007

Previous Articles     Next Articles

Online shopping customer service dialogue annotation and analysis

  

  1. Intelligent Computing Research Center, Shenzhen Graduate School of Harbin Institute of Technology, Shenzhen 518055, Guangdong, China
  • Received:2015-11-14 Online:2016-07-20 Published:2016-07-27

Abstract: There is lack of research data on real application environment for interactive question-answering research. This paper collected a large number of online shopping customer service dialogue records as real application environment interactive question-answering corpus. First, the online customer service dialogue records were statistics and analysis. Then 174 groups service dialogues were randomly selected. Those dialogues were annotated and statistics on unnormal language, question relevance and question answer matching phenomena. The annotation and statistics results show that: high frequent dialogue sentences reached to large proportion, 15% of high frequent customer dialogue sentences covered 45% of all data customer sent out; 50% of dialogue sentences contained unnormal language phenomena; Anaphora relevance, omission relevance and common word sequences are the three most important features for judging relevance of client questions; more than 60% of service dialogue sentences are cross matching question answers pairs, and more than 50% of matching question answers pairs are recessive matching.

Key words: corpus annotation, interactive question answering, corpus analysis, customer service dialogue

CLC Number: 

  • TP391
[1] MOLLÁ D, VICEDO J L. Question answering in restricted domains: an overview[J]. Computational Linguistics, 2007, 33(1):41-61.
[2] 延霞,范士喜. 面向问答社区的粗粒度问句分类算法[J]. 计算机应用与软件, 2013, 30(1):219-286. YAN Xia, FAN Shixi. Coarse grain question classification method for question answering community[J]. Computer Applications and Software, 2013, 30(1):219-286.
[3] 孔维泽,刘奕群,张敏,等. 问答社区中回答质量的评价方法研究[J]. 中文信息学报, 2011, 25(001): 3-8. KONG Weize, LIU Yiqun, ZHANG Min, et al. Research on the evaluation method of quality in the question and answer community[J]. Chinese Journal of Information, 2011, 25(001):3-8.
[4] 李晨,巢文涵,陈小明,等. 中文社区问答中问题答案质量评价和预测[J]. 计算机科学, 2011, 38(6):230-236. LI Chen, CHAO Wenhan, CHEN Xiaoming, et al. Quality evaluation and prediction of question answers in Chinese community question answering[J]. Computer Science, 2011, 38(6):230-236.
[5] 王宝勋,刘秉权,孙承杰,等. 基于论坛话题段落划分的答案识别[J]. 自动化学报, 2013, 39(1):11-20. WANG Baoxun, LIU Bingquan, SUN Chengjie, et al. Answer recognition based on forum topic paragraph[J]. Journal of Automation, 2013, 39(1):11-20.
[6] WANG B, WANG X, SUN C, et al. Modeling semantic relevance for question-answer pairs in web social communities[C] //Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala: ACL, 2010:1230-1238.
[7] UTHUS D C, AHA D W. Detecting bot-answerable questions in Ubuntu chat[C] //International Joint Conference on Natural Language Processing. Japan: Nagoya, 2013:747-752.
[8] DANG H T, LIN J, KELLY D. Overview of the TREC 2007 question answering track[C] //Proceedings of the 16th Text Retrieval Conference(TREC 2007). [S.l.] : DBLP, 2007:115-123.
[9] KATO T, FUKUMOTO J, MASUI F, et al. Handling information access dialogue through QA technologies—a novel challenge for open-domain question answering[C] //Proceedings of the HLT-NAACL 2004 Workshop on Pragmatics of Question Answering. USA: Boston, 2004:70-77.
[10] BERTOMEU N, USZKOREIT H, FRANK A, et al. Contextual phenomena and thematic relations in database QA dialogues: results from a Wizard-of-Oz Experiment[C] //Proceedings of the Interactive Question Answering Workshop at HLT-NAACL 2006. New York: ACM, 2006:1-8.
[11] SCHOOTEN B, AKKER R. Follow-up utterances in QA dialogue[J]. Traitement Automatique Des Langues, 2005, 46(3):181-206.
[12] 伍大勇,张宇,刘挺. 中文交互式问答用户问题相关检测研究[J]. 中文信息学报, 2010, 24(3):11-18. WU Dayong, ZHANG Yu, LIU Ting. Research on the detection of Chinese interactive question answering user questions [J]. Chinese Journal of Information, 2010, 24(3):11-18.
[13] 张耀允,王晓龙,王轩,等. 面向开放的限定领域的交互式问答语料分析[J]. 中国计算语言学研究前沿进展(2009-2011).北京:中国中文信息学会, 2011:480-486. ZHANG Yaoyun, WANG Xiaolong, WANG Xuan, et al. An analysis of interactive question and answer corpus in the field of open and restricted domain[J]. Frontiers of Computational Linguistics in China(2009-2011). Beijing: Chinese Information Processing Society of China, 2011:480-486.
[14] YANG F, FENG J, FABBRIZIO G D. A data driven approach to relevancy recognition for contextual question answering[C] //Proceedings of the Interactive Question Answering Workshop at HLT-NAACL 2006. New York: ACM, 2006:33-40.
[1] GONG Shuang-shuang, CHEN Yu-feng, XU Jin-an, ZHANG Yu-jie. Extraction of Chinese multiword expressions based on Web text [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 40-48.
[2] YU Chuan-ming, ZUO Yu-heng, GUO Ya-jing, AN Lu. Dynamic discovery of authors research interest based on the combined topic evolutional model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 23-34.
[3] . Reader emotion classification with news and comments [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 35-39.
[4] . Design and implementation of topic detection in Russian news based on ontology [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 49-54.
[5] LIAO Xiang-wen, ZHANG Ling-ying, WEI Jing-jing, GUI Lin, CHENG Xue-qi, CHEN Guo-long. User influence analysis of social media with temporal characteristics [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 1-12.
[6] YU Chuan-ming, FENG Bo-lin, TIAN Xin, AN Lu. Deep representative learning based sentiment analysis in the cross-lingual environment [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 13-23.
[7] ZHANG Jun, LI Jing-fei, ZHANG Rui, RUAN Xing-mao, ZHANG Shuo. Community detection algorithm based on effective resistance of network [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 24-29.
[8] PANG Bo, LIU Yuan-chao. Fusion of pointwise and deep learning methods for passage ranking [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 30-35.
[9] CHEN Xin, XUE Yun, LU Xin, LI Wan-li, ZHAO Hong-ya, HU Xiao-hui. Text feature extraction method for sentiment analysis based on order-preserving submatrix and frequent sequential pattern mining [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 36-45.
[10] WANG Tong, MA Yan-zhou, YI Mian-zhu. Speech recognition of Russian short instructions based on DTW [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(11): 29-36.
[11] ZHANG Xiao-dong, DONG Wei-guang, TANG Min-an, GUO Jun-feng, LIANG Jin-ping. gOMP reconstruction algorithm based on generalized Jaccard coefficient for compressed sensing [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(11): 23-28.
[12] SUN Jian-dong, GU Xiu-sen, LI Yan, XU Wei-ran. Chinese entity relation extraction algorithms based on COAE2016 datasets [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(9): 7-12.
[13] WANG Kai, HONG Yu, QIU Ying-ying, WANG Jian, YAO Jian-min, ZHOU Guo-dong. Study on boundary detection of users query intents [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(9): 13-18.
[14] ZHANG Fan, LUO Cheng, LIU Yi-qun, ZHANG Min, MA Shao-ping. User preference prediction in heterogeneous search environment [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(9): 26-34.
[15] YANG Yan, XU Bing, YANG Mu-yun, ZHAO Jing-jing. An emotional classification method based on joint deep learning model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(9): 19-25.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!