J4 ›› 2009, Vol. 44 ›› Issue (11): 48-51.

• Articles • Previous Articles     Next Articles

Research of an improved information gain methodusing distribution information of terms

杨玉珍,刘培玉*,朱振方,邱烨   

  1. Department of Information Science and Engineering, Shandong Normal University, Jinan 250014, Shandong, China
  • Received:2009-07-07 Online:2009-11-16 Published:2009-11-25

Abstract:

Classification performance of a traditional information gain algorithm will rapidly decline when feature items are in an unbalanced distribution. An improved calculation method of an information gain formula using feature items’ distribution information is proposed. Distribution information of feature items is computed to judge whether the imbalance of feature items exists and balance the influence of classification accuracy when the feature items do not appear. The improved calculation method has better performance through the experiment.

Key words: feature selection; information gain; distribution Information inside a class; distribution Information among classes

CLC Number: 

  • TP301
[1] LIU Jing-Lei, WANG Ling-Ling, ZHANG Wei. Generation algorithm for the role assigning lattice [J]. J4, 2009, 44(11): 52-56.
[2] ZHOU Xiao-qiang, LIU Ren-ren. Elimination of the not minimal covering of preserving binary regularly separable relations in partial four-valued logic [J]. J4, 2008, 43(12): 24-27.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!