J4 ›› 2012, Vol. 47 ›› Issue (5): 25-31.

• Articles • Previous Articles     Next Articles

Automatic extracting topic page links from Hub page

XIA  Tian1,2   

  1. 1. Key Laboratory of Data Engineering and Knowledge Engineering, MOE, Beijing 100872, China;
    2. School of Information Resource Management, Renmin University of China, Beijing 100872, China
  • Received:2011-11-30 Online:2012-05-20 Published:2012-06-01

Abstract:

A topic link extraction method from Hub page based on extended label tree was proposed. Firs, a topic link sorted list was build and deny rules were learned by prefix tree, then, the link type was pre-determined. Second, by group splitting and re-merging, each candidate link was classified into different groups. The group type and the group which represented the hub page’s core region were identified, and finally all links were put into three different collections. Experimental results show that this method can achieve high-precision for topic link extraction without training.

Key words: link extraction; extended label tree; link prefix tree

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!