您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(理学版)》

J4 ›› 2011, Vol. 46 ›› Issue (5): 44-48.

• SEWM 2011 会议 • 上一篇    下一篇

基于HITS算法的查询结果多样化方法

陈飞,张敏,刘奕群,马少平   

  1. 智能技术与系统国家重点实验室, 清华大学信息科学与技术国家实验室, 清华大学计算机系, 北京  100084
  • 收稿日期:2010-12-06 发布日期:2011-05-25
  • 作者简介:陈飞(1987- ),男,博士研究生,主要研究方向为信息检索.Email:chenfei27@gmail.com
  • 基金资助:

    国家自然科学基金资助项目(60736044,60903107);高等学校博士学科点专项科研基金资助项目(20090002120005)

The search result diversification approach based on the HITS algorithm

CHEN Fei, ZHANG Min, LIU Yi-qun, MA Shao-ping   

  1. State Key Lab of Intelligent Technology and Systems,Tsinghua National Laboratory for Information Science and Technology,
    Department of Computer Science and Technology,Tsinghua University, Beijing 100084, China
  • Received:2010-12-06 Published:2011-05-25

摘要:

现有的查询结果多样化研究很难准确得到用户多样性需求并提供与用户查询各个方面需求相关的文档。针对这个问题,本文基于HITS算法的网页间链接分析特性,根据网页链接图直接计算查询结果列表中的文档可能满足用户多样性需求的程度,并将其应用到结果列表的重排序中以实现搜索结果多样性。在TREC大规模数据集合上的实验结果表明了该方法的有效性。

关键词: 多样性;HITS;PageRank;权威性;中心性

Abstract:

To avoid the problem that users′ diversity needs cannot be precisely obtained or documents provided cannot concern all aspects of the needs in a specific query,a new method was proposed based on the linkparsing feature of the HITS algorithm, in where the possibility was directly calculated according to the diversity of documents in the search result list for a query, and then the result list was reranked based on this value. Experimental results on the TREC′s largescale data collections verified that this method was effective.

Key words:  diversity; HITS; PageRank; authority; Hub

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!