融合分段编码与仿射机制的相似案例匹配方法

doi:10.6040/j.issn.1671-9352.1.2021.048

Abstract

Abstract: Similarity case matching(SCM)task is to judge whether the cases described in two judgment documents are similar. SCM is usually regarded as the text matching problem of judgment documents and has important applications in the judicial trial. Existing deep learning models mostly encode long texts of cases into a single vector, and it is difficult for the model to learn the subtle differences between the cases from long texts. Considering that the content of each part of the case text is relatively fixed, this paper proposes to split the long case text into multiple pieces and encode them separately to obtain the subtle features of different parts. At the same time, learnable affine-transformation is used to improve the similarity scoring module, so that the model learn more subtle differences, which further improves the performance of case matching. The experimental results on the CAIL2019-SCM data set show that compared to another model, the accuracy of the method proposed in this paper have increased by 1.89%.

CLC Number:

TP391

LAI Hua, ZHANG Heng-tao, XIAN Yan-tuan, HUANG Yu-xin. A similarity case matching method combining segment encoding and affine-mechanism[J].JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2023, 58(1): 40-47.

References

[1] HALL P A V, DOWLING G R. Approximate string matching[J]. ACM Computing Surveys(CSUR), 1980, 12(4):381-402.
[2] SALTON G, BUCKLEY C. Term-weighting approaches in automatic text retrieval[J]. Information Processing & Management, 1988, 24(5):513-523.
[3] HUANG C H, YIN J, HOU F. A text similarity measurement combining word semantic information with TF-IDF method[J]. Chinese Journal of Computers, 2011, 34(5):856-864.
[4] NIRAULA N, BANJADE R, ??塁TEFANESCU D, et al. Experiments with semantic similarity measures based on LDA and LSA[C] //Proceedings of the First International Conference on Statistical Language and Speech Processing. Berlin: Springer, 2013: 188-199.
[5] WANG Z Z, HE M, DU Y P. Text similarity computing based on topic model LDA[J]. Computer Science, 2013, 40(12):229-232.
[6] MIKOLOV T, CHEN K, CORRADO G, et al. Efficient estimation of word representations in vector space[EB/OL].(2013-09-07)[2021-07-01]. https://arxiv. org/abs/1301.3781.pdf.
[7] LE Q, MIKOLOV T. Distributed representations of sentences and documents[C] //Proceedings of the 31st International Conference on Machine Learning. Beijing: JMLR, 2014: 1188-1196.
[8] MUELLER J, THYAGARAJAN A. Siamese recurrent architectures for learning sentence similarity[C] //Proceedings of the AAAI Conference on Artificial Intelligence. Arizona: AAAI Press, 2016, 30(1):2786-2792.
[9] REIMERS N, GUREVYCH I. Sentence-BERT: sentence embeddings using siamese BERT-networks[C] //Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing(EMNLP-IJCNLP). Hong Kong:Association for Computational Linguistics, 2019(1):3980-3990.
[10] DEVLIN Jacob, CHANG Ming-wei, LEE Kenton, et al. BERT: pre-training of deep bidirectional transformers for language understanding.[C] //Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Long and Short Papers. Minneapolis: Association for Computational Linguistics, 2018: 4171-4186.
[11] WANG Z, HAMZA W, FLORIAN R. Bilateral multi-perspective matching for natural language sentences[C] //Proceedings of Twenty-sixth International Joint Conference on Artificial Intelligence. Melbourne: IJCAI, 2017: 4144-4150.
[12] CHEN Z, ZHANG H, ZHANG X, et al. Quora question pairs[EB/OL].(2018-05-25)[2021-07-01]. http://static.hongbozhang.me/doc/STAT_441_Report.pdf.
[13] YANG Y, YIH W, MEEK C. Wikiqa: a challenge dataset for open-domain question answering[C] //Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Lisbon: The Association for Computational Linguistics, 2015: 2013-2018.
[14] XIAO C, ZHONG H, GUO Z. CAIL2019-SCM: a dataset of similar case matching in legal domain[EB/OL].(2019-09-25)[2021-07-01]. https://arxiv.org/abs/1911.08962.pdf.
[15] HUANG P S, HE X, GAO J, et al. Learning deep structured semantic models for web search using clickthrough data[C] //Proceedings of the 22nd ACM International Conference on Information & Knowledge Management. San Francisco: ACM, 2013: 2333-2338.
[16] CHOPRA S, HADSELL R, LECUN Y. Learning a similarity metric discriminatively, with application to face verification[C] //Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR'05). San Diego: IEEE, 2005: 539-546.
[17] SHEN Y, HE X, GAO J, et al. A latent semantic model with convolutional-pooling structured for information retrieval[C] //Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. Shanghai: CIKM, 2014: 101-110.
[18] CHEN Q, ZHU X, LING Z, et al. Enhanced LSTM for natural language inference[C] //Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Vancouver: ACL, 2017: 1657-1668.
[19] ROCKTÄSCHEL T, GREFENSTETTE E, HERMANN K M, et al. Reasoning about entailment with neural attention[C] //Proceedings of 2016 International Conference on Learning Representations. San Juan: ICLR, 2016.
[20] SHAO Y, MAO J, LIU Y, et al. BERT-PLI: modeling paragraph-level interactions for legal case retrieval[C] //Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20. Yokohama: The Association for Computational Linguistics, 2020: 3501-3507.
[21] VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[C] //Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach: NIPS, 2017: 5998-6008.
[22] DING S, SHANG J, WANG S, et al. ERNIE-DOC: the retrospective long-document modeling transformer[J/OL]. arXiv, 2020, https://arxiv.org/abs/2012.15688.pdf.
[23] HONG Z, ZHOU Q, ZHANG R, et al. Legal feature enhanced semantic matching network for similar case matching[C] //Proceedings of 2020 International Joint Conference on Neural Networks(IJCNN). Glasgow: IEEE, 2020: 1-8.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

A similarity case matching method combining segment encoding and affine-mechanism

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 7

Metrics

Comments

Recommended 0

[1]	YIN Ai-ying, LIN Jian-zhou, WU Yun-bing, LIAO Xiang-wen. Sentiment classification combining graph convolution neural network [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2021, 56(11): 15-23.
[2]	Wen-she YIN,Jian-feng HE. Detection method of hemorrhages of fundus image based on deep learning [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2020, 55(9): 62-71.
[3]	Wen-qing WANG,Ao-yang HAN,Li-tao YU,Zhi-sheng ZHANG. Short-term load forecasting model based on autoencoder and PSOA-CNN [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2019, 54(7): 50-56.
[4]	LIU Ming-ming, ZHANG Min-qing, LIU Jia, GAO Pei-xian. Steganalysis method based on shallow convolution neural network [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 63-70.
[5]	ZHANFG Fang-fang, CAO Xing-chao. Lexical and semantic relevance matching based neural document ranking [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 46-53.
[6]	QIN Jing, LIN Hong-fei, XU Bo. Music retrieval model based on semantic descriptions [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(6): 40-48.
[7]	. Optimal white noise estimatorsfor linear systems with  delayed measurements [J]. J4, 2009, 44(6): 63-68.