JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2024, Vol. 59 ›› Issue (5): 100-113.doi: 10.6040/j.issn.1671-9352.7.2023.4204

Previous Articles     Next Articles

Feature selection for partial label learning based on neighborhood rough sets

GAO Hefei1, LI Yan2*, WANG Shuo1   

  1. 1. College of Mathematics and Information Science, Hebei University, Baoding 071002, Hebei, China;
    2. School of Applied Mathematics, Beijing Normal University at Zhuhai, Zhuhai 519000, Guangdong, China
  • Published:2024-05-09

Abstract: A feature selection method for partial label learning based on neighborhood rough sets is proposed. A partial label neighborhood decision system is constructed, and the concepts of lower approximation and dependency of neighborhood rough sets are then defined in partial label learning. On this basis, a feature selection algorithm suitable to partial label classification is developed. This method can measure the similarity between labels in the set of candidate labels while granulating the feature space in the neighborhood, and select a subset of features with strong relevance to the label information. Two generation mechanisms for false positive candidate labels are used which are different from the most often used random method, and their impact on the results are compared and analyzed in the experiments. Finally, extensive experimental results on six real-world and six controlled synthetic partial label data sets are presented to demonstrate the effectiveness of the proposed feature selection method.

Key words: partial label learning, feature selection, partial label neighborhood decision system, neighborhood rough sets

CLC Number: 

  • TP181
[1] WANG Dengbao, ZHANG Minling, LI Li. Adaptive graph guided disambiguation for partial label learning[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 44(12):8796-8811.
[2] NGUYEN N, CARUANA R. Classification with partial labels[C] //Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2008:551-559.
[3] HÜLLERMEIER E, BERINGER J. Learning from ambiguously labeled examples[J]. Intelligent Data Analysis, 2006, 10(5):419-439.
[4] COUR T, SAPP B, TASKAR B. Learning from partial labels[J]. The Journal of Machine Learning Research, 2011, 12(5):1501-1536.
[5] YU Fei, ZHANG Minling. Maximum margin partial label learning [J].Machine Learning, 2017, 106(4):1-21.
[6] ZHANG Minling, YU Fei. Solving the partial label learning problem: an instance-based approach[C] //International Joint Conference on Artificial Intelligence. Buenos Aires: Morgan Kaufmann, 2015:4048-4054.
[7] ZHANG Minling, WU Xuan. Disambiguation-free partial label learning[J]. Scientia Sinica Informationis, 2019, 49(9):1083-1096.
[8] WU Xuan, ZHANG MinLing. Towards enabling binary decomposition for partial label learning[C] //International Joint Conference on Artificial Intelligence. Stockholm: Morgan Kaufmann, 2018:2868-2874.
[9] ZHANG Minling, WU Jinghan, BAO Weixuan. Disambiguation enabled linear discriminant analysis for partial label dimensionality reduction[J]. ACM Transactions on Knowledge Discovery From Data, 2022, 16(4):1-18.
[10] BAO Weixuan, HANG Junyi, ZHANG Minling. Partial label dimensionality reduction via confidence-based dependence maximization[C] //Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. New York: ACM, 2021:46-54.
[11] LI Haikun, FANG Min, GE Lingchi, et al. Partial label dimensional reduction via semantic difference information and manifold regularization[J]. International Journal on Artificial Intelligence Tools, 2022, 31(2):1-13.
[12] BAO Weixuan, HANG Junyi, ZHANG Minling. Submodular feature selection for partial label learning[C] //Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York: ACM, 2022:26-34.
[13] XIA Shuyin, ZHANG Zhao, LI Wenhua, et al. GBNRS: a novel rough set algorithm for fast adaptive attribute reduction in classification[J]. IEEE Transactions on Knowledge and Data Engineering, 2022, 34(3):1231-1242.
[14] WANG Changzhong, WANG Yan, SHAO Mingwen, et al. Fuzzy rough attribute reduction for categorical data[J]. IEEE Transactions on Fuzzy Systems, 2020, 28(5):818-830.
[15] WANG Changzhong, HU Qinghua, WANG Xizhao, et al. Feature selection based on neighborhood discrimination index[J]. IEEE Transactions on Neural Networks, 2018, 29(7):2986-2999.
[16] FAN Jing, JIANG Yunliang, LIU Yong. Quick attribute reduction with generalized indiscernibility models[J]. Information Sciences, 2017, 397:15-36.
[17] CAMPAGNER A, CIUCCI D, HÜLLERMEIER E. Rough set-based feature selection for weakly labeled data[J]. International Journal of Approximate Reasoning, 2021, 136(1):150-167.
[18] QIAN Wenbin, LI Yihui, YE Qianzhi, et al. Disambiguation-based partial label feature selection via feature dependency and label consistency[J]. Information Fusion, 2023, 94:152-168.
[19] 胡清华,赵辉,于达仁. 基于邻域粗糙集的符号与数值属性快速约简算法 [J]. 模式识别与人工智能,2008,21(6):732-738. HU Qinghua, ZHAO Hui, YU Daren. Efficient symbolic and numerical attribute reduction with rough sets[J]. Pattern Recognition and Artificial Intelligence, 2008, 21(6):732-738.
[20] HU Qinghua, YU Daren, LIU Jinfu, et al. Neighborhood rough set based heterogeneous feature subset selection[J]. Information Sciences, 2008, 178(18):3577-3594.
[21] PANIS G, LANITIS A. An overview of research activities in facial age estimation using the FG-NET aging database[C] //European Conference on Computer Vision. Zürich: Springer, 2015.
[22] ZENG Zinan, XIAO Shijie, JIA Kui, et al. Learning by associating maximum margin images[C] //Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition. Portland: IEEE, 2013:708-715.
[23] LIU Liping, THOMAS G. DIETTERICH. A conditional multinomial mixture model for superset label learning[J]. Advances in Neural Information Processing Systems, 2012, 25:557-565.
[24] BRIGGS F, FERN X Z, RAICH R. Rank-loss support instance machines for MIML instance annotation[C] //Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: ACM, 2012:534-542.
[25] HUISKES M J, LEW M S. The MIR flickr retrieval evaluation[C] //Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval. New York: ACM, 2008:39-43.
[1] CHENG Yuxuan, MAO Yu, ZHANG Xiaoqing, ZENG Yixiang, LIN Yaojin. Online multi-label feature selection based on sub-correlation features and neighborhood mutual information [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2024, 59(5): 70-81.
[2] ZHU Liquan, LIN Yaojin, MAO Yu, CHENG Yuxuan. Multi-label online stream feature selection based on high-dimensional correlation [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2024, 59(5): 90-99.
[3] Chunyu SHI,Yu MAO,Haoyang LIU,Yaojin LIN. Hierarchical feature selection algorithm based on instance correlations [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2024, 59(3): 61-70.
[4] ZHANG Zhi-hao, LIN Yao-jin, LU Shun, WU Yi-lin, WANG Chen-xi. Multi-label feature selection with streaming and missing labels [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2022, 57(8): 39-52.
[5] SUN Lin, CHEN Yu-sheng, XU Jiu-cheng. Multilabel feature selection algorithm based on improved ReliefF [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2022, 57(4): 1-11.
[6] SUN Lin, LIANG Na, XU Jiu-cheng. Feature selection using adaptive neighborhood mutual information and spectral clustering [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2022, 57(12): 13-24.
[7] ZHANG Yao, MA Ying-cang, YAND Xiao-fei, ZHU Heng-dong, YANG Ting. Multi-label feature selection based on manifold structure and flexible embedding [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2021, 56(7): 91-102.
[8] HUANG Tian-yi, ZHU William. Cost-sensitive feature selection via manifold learning [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(3): 91-96.
[9] WAN Zhong-ying, WANG Ming-wen, ZUO Jia-li, WAN Jian-yi. Feature selection combined with the global and local information(GLFS) [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(5): 87-93.
[10] LI Zhao,SUN Zhan-,LI Xiao,LI Cheng,. Study on feature selection method based on information loss [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(11): 7-12.
[11] XIA Meng-nan, DU Yong-ping, ZUO Ben-xin. Micro-blog opinion analysis based on syntactic dependency and feature combination [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 22-30.
[12] ZHENG Yan, PANG Lin, BI Hui, LIU Wei, CHENG Gong. Feature selection algorithm based on sentiment topic model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 74-81.
[13] PAN Qing-qing, ZHOU Feng, YU Zheng-tao, GUO Jian-yi, XIAN Yan-tuan. Recognition method of Vietnamese named entity based on#br# conditional random fields [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(1): 76-79.
[14] YU Ran 1,2, LIU Chun-yang3*, JIN Xiao-long 1, WANG Yuan-zhuo 1, CHENG Xue-qi 1. Chinese spam microblog filtering based on the fusion of
multi-angle features
[J]. J4, 2013, 48(11): 53-58.
[15] YI Chao-qun, LI Jian-ping, ZHU Cheng-wen. A kind of feature selection based on classification accuracy of SVM [J]. J4, 2010, 45(7): 119-121.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] WANG Gang, XU Xin-shun*. A new Multi-instance learning method for scene classification[J]. J4, 2010, 45(7): 108 -113 .
[2] LU Wei-jie,ZHU Chen-fu,SONG Cui and YANG Yan-li . Determination of inorganic cations in the Chinese traditional drug Yujin by capillary electrophoresis[J]. J4, 2007, 42(7): 13 -18 .
[3] LI Hui-juan,YIN Hai-ying,ZHANG Xue-cheng and YANG Ai-fang* . Enhancement of drought resistance in transgenic tobacco expressing sucrose: Sucrose 1-fructosyltransferase gene from Lactuca sativa[J]. J4, 2007, 42(1): 89 -94 .
[4] HU Chuan-Ke, CHEN Ru-Hui, DIAO E-Ou. Prediction of proteinprotein interaction based on improved pseudo amino acid composition[J]. J4, 2009, 44(9): 17 -21 .
[5] ZHAO Jun1, ZHAO Jing2, FAN Ting-jun1*, YUAN Wen-peng1,3, ZHANG Zheng1, CONG Ri-shan1. Purification and anti-tumor activity examination of water-soluble asterosaponin from Asterias rollestoni Bell[J]. J4, 2013, 48(1): 30 -35 .
[6] YANG Yong-wei1, 2, HE Peng-fei2, LI Yi-jun2,3. On strict filters of BL-algebras#br#[J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(03): 63 -67 .
[7] HAN Ya-fei, YI Wen-hui, WANG Wen-bo, WANG Yan-ping, WANG Hua-tian*. Soil bacteria diversity in continuous cropping poplar plantation#br# by high throughput sequencing[J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(05): 1 -6 .
[8] CHEN Shunmin 1,2, CHEN Guiyun 3. Groups of order paqbrc having exactly three conjugacy classes of nonnormal subgroups[J]. J4, 2009, 44(4): 5 -7 .
[9] YANG Zhe . Backward stochastic differential advanced equation withconstant advanced time of functional type[J]. J4, 2006, 41(5): 39 -43 .
[10] XIE Juan-ying1, 2, ZHANG Yan1, XIE Wei-xin2, 3, GAO Xin-bo2. A novel rough K-means clustering algorithm based on the weight of density[J]. J4, 2010, 45(7): 1 -6 .