JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2016, Vol. 51 ›› Issue (1): 95-100.doi: 10.6040/j.issn.1671-9352.1.2015.C03

Previous Articles     Next Articles

Text topic mining of archives research based on SVD

FENG Guo-he, WANG Dan-di, LI Mei-chan   

  1. College of Economic &
    Management, South China Normal University, Guangzhou 510006, Guangdong, China
  • Received:2015-05-27 Online:2016-01-16 Published:2016-11-29

Abstract: The data of National Social Science Fund Project on Archives Field from 2010 to 2014 were collected, the words of the project title are separated, and the term-document matrix was obtained. According to the importance level of the terms, local and whole weight was designed, local weight was integrated with whole weight, which obtained the weight value of the term-document matrix. Feature dimension reduction was implemented by SVD, the recent National Social Science Archives Project themes in different dimensions were studied. Eventually, seven research topics of social science archives were obtained by visually analyzing, which were the intangible cultural heritage protection, electronic document management, digital resource construction, value and research of the archival information resource, archival information protecting system, research of the archives, security of the archival information.

Key words: term-document matrix, singular value decomposition, topic mining, weight design, archives project

CLC Number: 

  • TP391.1
[1] 毕建新,郑建明.近十年档案学国家级基金项目计量研究[J].档案学通讯,2013(5):31-34.
[2] SHAIK Z, GARLA S, CHAKRABORTY G. An application of text mining to reveal trends[EB/OL].(2012-04-02)[2015-05-06]. http://support.sas.com/resources/papers/proceedings12/135-2012.pdf.
[3] ALBRIGHT R.Taming text with the SVD[EB/OL].[2015-11-29].http://ftp.sas.com/techsup/download/EMiner/TamingTextwiththeSVD.pdf.
[4] 全国哲学社会科学规划办公室.国家社科基金项目数据库[DB/OL].[2015-05-06].http://www.npopss-cn.gov.cn/.
[5] SAS.Getting Started with SAS text miner13.2[EB/OL].[2015-11-29]. http://support.sas.com/documentation/onlinedoc/txtminer/index.html#txtminer13x.
[6] 廖安平,刘建州.矩阵论[M].长沙:湖南大学出版社,2005:57-58.
[7] CHAKRABORTY G, PAGOLU M, GARLA S.Text mining and analysis: practical methods, examples, and case studies using SAS[M]. North Carolina Carey, America:SAS Institute Inc, 2013:72-83.
[1] YU Chuan-ming, ZUO Yu-heng, GUO Ya-jing, AN Lu. Dynamic discovery of authors research interest based on the combined topic evolutional model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 23-34.
[2] ZHENG Chan, LI Han-yu. Generalized singular value decompositions with respect to Semi-definite inner product [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(12): 81-86.
[3] LIANG Mao-lin, DAI Li-fang, YANG Xiao-ya. The least-squares solutions and the optimal approximation of the inverse problem for row anti-symmetric matrices on linear manifolds [J]. J4, 2012, 47(4): 121-126.
[4] JIA Zhi-gang,ZHAO Jian-li,ZHANG Feng-xia . Eigen-problem and singular value decomposition of the generalized symmetric matrix [J]. J4, 2007, 42(12): 15-18 .
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!