您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(理学版)》

J4 ›› 2012, Vol. 47 ›› Issue (3): 33-37.

• 电子技术与信息 • 上一篇    下一篇

基于人物相关社区的重名消解研究

李琦1,2,马军1,2*   

  1. 1.山东大学计算机科学与技术学院, 山东 济南 250101; 2.山东省软件工程重点实验室, 山东 济南 250101
  • 收稿日期:2012-01-11 出版日期:2012-03-20 发布日期:2012-04-01
  • 通讯作者: 马军(1956- ),男,博士,教授,博士生导师,主要研究领域为算法、信息检索和并行计算. Email: majun@sdu.edu.cn
  • 作者简介:李琦(1985- ),男,硕士研究生,主要研究方向为信息抽取、共指消解. Email: liqii@yahoo.com.cn
  • 基金资助:

    国家自然科学基金资助项目(60970047,61103151,61173068); 教育部博士点基金项目(20110131110028)

Person′s name disambiguation based on person  related social communities

LI Qi1,2, MA Jun1,2*   

  1. 1. School of Computer Science and Technology Shandong University, Jinan 250101, Shandong, China;
    2. Shandong Provincial Key Laboratory of Seftware Engineering, Jinan 250101, Shandong, China
  • Received:2012-01-11 Online:2012-03-20 Published:2012-04-01

摘要:

由于人的重名现象,人名检索的结果往往是同名的不同人物实体相关网页的混合。重名消解是根据上下文来区分同名的不同人物实体的过程。本文提出了基于相关社区的重名消解方法,采用改进的Espresso算法进行相关社区发现。将每个网页发现的社区应用到两阶段重名消解算法中,并且在WePS-2测试集上进行试验。实验结果表明了该方法的有效性。

关键词: 社会网络;社团;重名消解;人名检索;聚类

Abstract:

Person′s names are so ambiguous that the results of searching for a person′s name are usually a mixture of pages about namesakes. Person′s name disambiguation is a course of distinguishing different person′s entities with the same name. The method of person′s name disambiguation based on the relevant community was proposed and the modified Espresso algorithm was used to find relevant community for each Web page. The enlarged name sets were applied in the two-stage person′s name disambiguation algorithm, and then the algorithm was tested it on the WePS-2 test dataset. The experimental results show the effectiveness of our method.

Key words: social network; community; person′s name disambiguation; Web people search; clustering

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!