您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(理学版)》

《山东大学学报(理学版)》 ›› 2019, Vol. 54 ›› Issue (2): 41-50.doi: 10.6040/j.issn.1671-9352.0.2018.037

• • 上一篇    下一篇

基于粗糙集的协同过滤算法研究

郑荔平,胡敏杰,杨红和,林耀进   

  1. 闽南师范大学计算机学院, 福建 漳州 363000
  • 发布日期:2019-02-25
  • 作者简介:郑荔平(1977— ),女,硕士,讲师,研究方向为协同过滤与特征选择. E-mail:9306188@qq.com
  • 基金资助:
    国家自然科学基金资助项目(61303131);福建省教育厅科技资助项目(JAT170350);福建省教育厅科技资助项目(JAT170347)

Research on collaborative filtering algorithm based on rough set

ZHENG Li-ping, HU Min-jie, YANG Hong-he, LIN Yao-jin   

  1. School of Computer Science, Minnan Normal University, Zhangzhou 363000, Fujian, China
  • Published:2019-02-25

摘要: 协同过滤的推荐性能受限于评分矩阵中数据的稀疏性。针对这个问题,提出一种基于粗糙集的协同过滤方法,能在一定程度上缓解数据稀疏性的影响。首先进行数据预处理,即利用用户评分数量以及用户的评分值,作为用户进行分类的特征值,对用户进行分类;其次,利用粗糙集属性约简的方法剔除对用户分类影响较小的项目,生成更小的用户-项目评分矩阵,以降低数据的稀疏性和规模;最后基于约简后的数据集进行用户相似度的计算,获得目标用户真正近邻。实验结果表明,所提算法在常用评价推荐性能的指标MAE,COVERAGE,PRECISION和RECALL中显著优于COS,PCC,ADCOS,NHSM算法中的指标。

关键词: 协同过滤, 属性约简, 数据预处理, 分类标记, 粗糙集

Abstract: In collaborative filtering, the recommendation performance is limited by the data sparsity of rating matrix. To alleviate this problem, a rough set based collaborative filtering method is proposed, which is helpful to reduce the influence of the data sparsity to some extent. First, the procedure of data pre-processing is executed. As the eigenvalue for user classification, users ratings with the rating number are employed. Then, an attribute reduction approach of rough set is introduced to eliminate the item which has little effect with respect to user classification, and a smaller scale user-item rating matrix is generated to lower the data sparsity. Finally, the similarity between users is calculated based on the reducing attributes to obtain real neighbors of active users. The experimental results demonstrate that the proposed algorithm in MAE, COVERAGE, PRECISION and RECALL are significantly better than the ones of COS, PCC, ADCOS, NHSM.

Key words: collaborative filtering, attribute reduction, data preprocess, classification label, rough set

中图分类号: 

  • TP311
[1] GOLDBERG D, NICHOLS D, OKI B M, et al. Using collaborative filtering to weave an information tapestry[J]. Communications of the ACM, 1992, 35(12):61-70.
[2] RESNICK P, IACOVOU N, SUCHAK M, et al. GroupLens: an open architecture for collaborative filtering of netnews[C] // Proc of the ACM Conference on Computer Supported Cooperative Work. Chapel Hill:[s.n.] , 1994: 175-186.
[3] 杨恒宇,李慧宗,林耀进,等.协同过滤中有影响力近邻的选择[J].北京邮电大学学报,2016,39(1):29-35. YANG Hengyu, LI Huizong, LIN Yaojin, et al. Influential neighbor selection in collaborative filtering[J]. Journal of Beijing University of Posts and Telecommunications, 2016, 39(1):29-35.
[4] 张佳,林耀进,林梦雷,等.基于目标用户近邻修正的协同过滤算法[J].模式识别与人工智能,2015,28(9):802-810. ZHANG Jia, LIN Yaojin, LIN Menglei, et al. Target users neighbors modification based collaborative filtering[J]. Pattern Recognition and Artificial Intelligence, 2015, 28(9):802-810.
[5] 郭兰杰,梁吉业,赵兴旺.融合社交网络信息的协同过滤推荐算法[J].模式识别与人工智能,2016,29(3):281-288. GUO Lanjie, LIANG Jiye, ZHAO Xingwang. Collaborative filtering recommendation algorithm incorporating social network information[J]. Pattern Recognition and Artificial Intelligence, 2016, 29(3):281-288.
[6] 康熠华.面向数据稀疏问题的协同过滤推荐算法研究[D].内蒙古:内蒙古师范大学,2016. KANG Yihua. Research on collaborative filtering recommendation algorithms for data sparsity[D]. Inner Mongolia: Inner Mongolia Normal University, 2016.
[7] BREESE J S,HECKERMAN D,KADIE C. Empirical analysis of predictive algorithms for collaborative filtering[C] // Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence. Madison: UAI, 1998: 43-52.
[8] SARWAR B, KARYPIS G, KONSTAN J, et al. Item based collaborative filtering recommendation algorighms.[C] // Proceedings of the 10th Intel World Wide WEB Conference, New York: [s.n.] , 2001: 285-295.
[9] PAWLAK Z. Rough sets[J]. International Journal of Computer and Information Science,1982, 11(5):341-356.
[10] 胡清华,于达仁.应用粗糙集计算[M].北京:科学出版社,2012: 39-61. HU Qinghua, YU Daren. Applied rough sets[M]. Beijing: Science Press, 2012: 39-61.
[11] 张佳,林耀进,林梦雷,等.基于信息熵的协同过滤算法[J].山东大学学报(工学版),2016,46(2):43-50. ZHANG Jia, LIN Yaojin, LIN Menglei, et al. Entropy-based collaborative filtering algorithm[J]. Journal of Shandong University(Engineering Science), 2016, 46(2):43-50.
[12] BOBADILLA J, ORTEGA F, HERNANDO A, et al. Recommender systems survey[J]. Knowledge-Based Systems, 2013, 46:109-132.
[13] LIU Haifeng, HU Zheng, AHMAD Mian, et al. A new user similarity model to improve the accuracy of collaborative filtering[J]. Knowledge-Based Systems, 2014, 56(3):156-166.
[1] 李文焱,李丽红,王洪欣. 基于知识度量的模糊粗糙c-均值算法[J]. 《山东大学学报(理学版)》, 2026, 61(1): 49-64.
[2] 张光旭,姚卫. 基于二型模糊预序的模糊粗糙集模型[J]. 《山东大学学报(理学版)》, 2026, 61(1): 85-93.
[3] 周缪娟,黄韩亮,张纪平,李进金. 基于FT-粗糙集构建知识结构与寻找学习路径方法[J]. 《山东大学学报(理学版)》, 2025, 60(7): 116-130.
[4] 李心如,李令强,贾成昭. 新型多粒度变精度(*,·)-模糊粗糙集[J]. 《山东大学学报(理学版)》, 2025, 60(7): 131-142.
[5] 杨志强,冯山,尹伊,吴慧佳. 一种多因素融合的高效离群点检测方法[J]. 《山东大学学报(理学版)》, 2024, 59(8): 118-126.
[6] 宋苏洋,叶军,曾广财,孙清. 基于优化可辨识矩阵的多粒度粗糙集属性约简算法[J]. 《山东大学学报(理学版)》, 2024, 59(5): 52-62.
[7] 高贺飞,李艳,王硕. 基于邻域粗糙集的偏标记特征选择[J]. 《山东大学学报(理学版)》, 2024, 59(5): 100-113.
[8] 温欣,李德玉. 基于属性加权的ML-KNN方法[J]. 《山东大学学报(理学版)》, 2024, 59(3): 107-117.
[9] 王茜,张贤勇. 不完备邻域加权多粒度决策理论粗糙集及三支决策[J]. 《山东大学学报(理学版)》, 2023, 58(9): 94-104.
[10] 胡成祥,张莉,黄晓玲,王汇彬. 面向属性变化的动态邻域粗糙集知识更新方法[J]. 《山东大学学报(理学版)》, 2023, 58(7): 37-51.
[11] 胡玉文,徐久成,张倩倩. 决策演化集的李雅普诺夫稳定性[J]. 《山东大学学报(理学版)》, 2023, 58(7): 52-59.
[12] 吴凡,孔祥智. 基于模糊信息系统的模糊β-覆盖粗糙集模型[J]. 《山东大学学报(理学版)》, 2023, 58(5): 10-16.
[13] 时俊鹏,张燕兰. 面向对象删除的局部邻域粗糙集动态更新算法[J]. 《山东大学学报(理学版)》, 2023, 58(5): 17-25.
[14] 刘长顺,刘炎,宋晶晶,徐泰华. 基于论域离散度的属性约简算法[J]. 《山东大学学报(理学版)》, 2023, 58(5): 26-35.
[15] 林天泰,杨斌. 基于q-正交模糊集的冲突分析模型[J]. 《山东大学学报(理学版)》, 2023, 58(12): 77-90.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!