您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(理学版)》

J4

• 论文 •    下一篇

基于链接聚类的Shark-Search算法

苏 祺,项 锟,孙 斌   

  1. 北京大学计算语言学研究所,北京 100871
  • 收稿日期:2006-03-09 修回日期:1900-01-01 出版日期:2006-10-24 发布日期:2006-10-24
  • 通讯作者: 苏 祺

The Shark-Search algorithm based on clustering links

SU Qi,XIANG Kun and SUN Bin   

  1. Institute of Computational Linguistics, Peking Univ., Beijing 100871, China
  • Received:2006-03-09 Revised:1900-01-01 Online:2006-10-24 Published:2006-10-24
  • Contact: SU Qi

摘要: 根据对Shark-Search主题爬取算法的分析,提出了一种基于链接聚类的改进Shark-Search算法. 并通过几个对比实验对该算法进行了验证. 实验结果表明,新算法能够更有效地识别链接与主题的相关性.

关键词: Shark-Search算法, 主题爬取, 链接聚类

Abstract: Based on the analysis of the focused-crawling algorithm Shark-Search, an improved Shark-Search algorithm with link clustering is proposed. The new algorithm by several comparable experiments is validated. The results show that it could identify the relevance between link and focused topic more effectively.

Key words: link clustering , focused crawling, Shark-Search algorithm

[1] 陈 军,陈竹敏 . 基于网页分块的Shark-Search算法[J]. J4, 2007, 42(9): 62-66 .
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!