JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2024, Vol. 59 ›› Issue (1): 1-10, 45.doi: 10.6040/j.issn.1671-9352.0.2023.512

• Invited Review •     Next Articles

Strategic limit theory and strategic statistical learning

Xiaodong YAN()   

  1. Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan 250100, Shandong, China
  • Received:2023-12-05 Online:2024-01-20 Published:2024-01-19

Abstract:

The nonlinear expectation is an original research direction pioneered by Academician Peng Shige of Shandong University, which is becoming increasingly important in various fields of scientific research. The rise of big data and artificial intelligence has provided stronger impetus for innovative theoretical and applied research in nonlinear expectation. Recently, Shandong University's Nonlinear Probability Team has developed the "Strategy Limit Theory" based on the strategic game process of multi-armed bandits, representing a significant breakthrough in the intersection of nonlinear probability theory and reinforcement learning. This has tran-sformed the research paradigm of traditional statistical methods. Based on the proposed 10 basic mathematical problems of artificial intelligence by Academician Xu Zongben, the declaration guide of 2022 major research plan projects issued by the National Natural Science Foundation of China for the research about universal and interpretable artificial intelligence technologies, and the application guide for basic mathematical theory research of artificial intelligence in 2021 and 2022 the key projects of "Mathematics and Applied Research" issued by the Ministry of Science and Technology, this article adopts the concept of "strategy" to reveal the nature of artificial intelligence and explore and the motivation source and theoretical basis for initiating and promoting the innovation of artificial intelligence technology. Different from the applications of the traditional law of large numbers and the central limit theorem in the field of artificial intelligence, we propose novel theory about the strategic law of large numbers and the central limit theorem in the new generation of artificial intelligence. The discussed topics in this work include but not limited to: (1) strategic sampling of massive data; (2) online learning of streaming data; (3) the central limit theorem of reinforcement learning; (4) differential privacy protection of data; (5) strategic integration of federal learning; (6) information reconstruction of transfer learning and meta learning; (7) the fusion of knowledge reasoning and data driving.

Key words: artificial intelligence, strategic limit theory, mathematical foundation, big data analysis, reinforcement learning, online learning, transfer learning, federated learning, data privacy protection, knowledge reasoning and data driving

CLC Number: 

  • TP18

Fig.1

Different probability density functions of the peak distribution f(z) and the binormal distribution g(z) as the parameters k, h change, where the green line is the probability density function plot of the standard normal distribution"

Fig.2

Coverage probability of the 3 peak distribution probability density functions Xt(blue), Yt (green), and R0, Π* (red) in the interval [a, b], where the expectations of Xt and Xt are set to μL=-1, μR=1 and the variances are set to σL=σR=1, respectively, and consider the three intervals[0, 1], [0.5, 1.5], [1, 2]"

1 徐宗本. 用好大数据须有大智慧: 准确把握、科学应对大数据带来的机遇和挑战[N]. 人民日报, 2016-03-15 (07).
XU Zongben. To make good use of big data requires great wisdom: accurately grasp and scientifically respond to the opportunities and challenges brought by big data[N]. People's Daily, 2016-03-15 (07).
2 徐宗本. 把握新一代信息技术的聚焦点: 数字化、网络化、智能化[N]. 人民日报, 2019-03-01 (09).
XU Zongben. Grasp the focus of the new generation of information technology: digitalization, networking, and intelligence[N]. People's Daily, 2019-03-01 (09).
3 徐宗本, 唐年胜, 程学旗. 数据科学: 它的内涵、方法、意义与发展[M]. 北京: 科学出版社, 2021.
XU Zongben , TANG Niansheng , CHENG Xueqi . Data science: its connotation, method, significance and development[M]. Beijing: Science Publishing, 2021.
4 徐宗本. 人工智能的10个重大数理基础问题[J]. 中国科学: 信息科学, 2021, 51, 1967- 1978.
XU Zongben . Ten fundamental problems for artificial intelligence: mathematical and physical aspects[J]. Science China: Information Sciences, 2021, 51, 1967- 1978.
5 PENG S . Nonlinear expectations and stochastic calculus under uncertainty: with robust CLT and G-Brownian motion[M]. Berlin: Springer Nature, 2019.
6 CHEN Z , EPSTEIN L G . A central limit theorem for sets of probability measures[J]. Stochastic Processes and Their Applications, 2022, 152, 424- 451.
doi: 10.1016/j.spa.2022.07.003
7 CHEN Z, FENG S, ZHANG G. Strategy-driven limit theorems associated bandit problems[EB/OL]. 2022-04-09[2023-12-05]. https://arxiv.org/abs/2204.04442.
8 CHEN Z , EPSTEIN L , ZHANG G . A central limit theorem, loss aversion and multi-armed bandits[J]. Journal of Economic Theory, 2023, 209, 105645.
doi: 10.1016/j.jet.2023.105645
9 CHEN Z , YAN X , ZHANG G . Strategic two-sample test via two-armed bandit process[J]. Journal of the Royal Statistical Society Series B: Statistical Methodology, 2023, 85, 1271- 1298.
doi: 10.1093/jrsssb/qkad061
10 CHEN Z , FENG X , LIU S , et al. Optimal distributions of rewards for a two-armed slot machine[J]. Neurocomputing, 2023, 518, 401- 407.
doi: 10.1016/j.neucom.2022.11.019
11 ZHAO T , CHENG G , LIU H . A partially linear framework for massive heterogeneous data[J]. Annals of Statistics, 2016, 44 (4): 1400- 1437.
12 KLEINER A , TALWALKAR A , SARKAR P , et al. A scalable bootstrap for massive data[J]. Journal of the Royal Statistical Society Series B: Statistical Methodology, 2014, 76 (4): 795- 816.
doi: 10.1111/rssb.12050
13 AI Mingyao , YU Jun , ZHANG Huiming , et al. Optimal subsampling algorithms for big data regressions[J]. Statistica Sinica, Forthcoming, 2021, 31, 749- 772.
14 ETIKAN I , ALKASSIM R , ABUBAKAR S . Comparision of snowball sampling and sequential sampling technique[J]. Biometrics and Biostatistics International Journal, 2016, 3 (1): 55.
15 LIN N , XI R . Aggregated estimating equation estimation[J]. Statistics and its Interface, 2011, 4, 73- 83.
doi: 10.4310/SII.2011.v4.n1.a8
16 SCHIFANO E D , WU J , WANG C , et al. Online updating of statistical inference in the big data setting[J]. Technometrics, 2016, 58 (3): 393- 403.
doi: 10.1080/00401706.2016.1142900
17 CHEN X , LEE J D , TONG X T , et al. Statistical inference for model parameters in stochastic gradient descent[J]. The Annals of Statistics, 2020, 48, 251- 273.
18 ZHU W , CHEN X , WU B . Online covariance matrix estimation in stochastic gradient descent[J]. Journal of the American Statistical Association, 2021, 118 (154): 393- 404.
19 LUO L , SONG P X K . Renewable estimation and incremental inference in generalized linear models with streaming data sets[J]. Journal of the Royal Statistical Society Series B: Statistical Methodology, 2020, 82, 69- 97.
doi: 10.1111/rssb.12352
20 CUI W, JI X, KONG L, et al. Opposite online learning via sequentially integrated stochastic gradient descent estimators[C]//Proceedings of the AAAI Conference on Artificial Intelligence. Washington: AAAI Press, 2023, 37(6): 7270-7278.
21 SUTTON R S, BARTO A G. Reinforcement learning: an introduction[M]. [S. l. ]: MIT Press, 2018.
22 WILLIAMS R J . Simple statistical gradient-following algorithms for connectionist reinforcement learning[J]. Machine Learning, 1992, 8 (3): 229- 256.
23 LAI T L , ROBBINS H . Asymptotically efficient adaptive allocation rules[J]. Advances in Applied Mathematics, 1985, 6 (1): 4- 22.
doi: 10.1016/0196-8858(85)90002-8
24 DWORK C. Differential privacy: a survey of results[C]//International Conference on Theory and Applications of Models of Computation. Heidelberg: Springer, 2008: 1-19.
25 方滨兴. 释放数据使用权将成为未来技术发展取向[N/OL]. 中国新闻网, 2022-05-19[2023-12-05], https://news.sciencenet.cn/htmlnews/2022/5/479297.shtm.
26 WASSERMAN L , ZHOU S . A statistical framework for differential privacy[J]. Journal of the American Statistical Association, 2010, 105 (489): 375- 389.
doi: 10.1198/jasa.2009.tm08651
27 DUCHI J C , JORDAN M I , WAINWRIGHT M J . Privacy aware learning[J]. Journal of the ACM, 2014, 61 (6): 1- 57.
28 LI T , SAHU A K , TALWALKAR A , et al. Federated learning: challenges, methods, and future directions[J]. IEEE Signal Processing Magazine, 2020, 37 (3): 50- 60.
doi: 10.1109/MSP.2020.2975749
29 TAN C, SUN F, KONG T, et al. A survey on deep transfer learning[C]//International Conference on Artificial Neural Networks. Cham: Springer, 2018: 270-279.
30 FINN C, XU, K, LEVINE S. Probabilistic model-agnostic meta-learning[C]//NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems. Montréal: Curran Associates Inc., 2018: 9537-9548.
31 VILALTA R , DRISSI Y . A perspective view and survey of meta-learning[J]. Artificial Intelligence Review, 2002, 18 (2): 77- 95.
doi: 10.1023/A:1019956318069
32 张钹. 人工智能进入后深度学习时代[J]. 智能科学与技术学报, 2019, 1 (1): 4- 6.
ZHANG Ba . Artificial intelligence is entering the post deep-learning era[J]. Chinese Journal of Intelligent Science and Technology, 2019, 1 (1): 4- 6.
33 张钹, 朱军, 苏航. 迈向第三代人工智能[J]. 中国科学: 信息科学, 2020, 50, 1281- 1302.
ZHANG Ba , ZHU Jun , SU Hang . Toward the third generation of artificial intelligence[J]. Science China: Information Sciences, 2020, 50, 1281- 1302.
[1] KANG Hai-yan, DENG Jie. Survey on blockchain data privacy protection [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2021, 56(5): 92-110.
[2] YU Chuan-ming, FENG Bo-lin, TIAN Xin, AN Lu. Deep representative learning based sentiment analysis in the cross-lingual environment [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 13-23.
[3] SUN Shi-chang, LIN Hong-fei, MENG Jia-na, LIU Hong-bo. Model selection with likelihood ratio for sequence transfer learning [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(6): 24-31.
[4] . Computational humor researches and applications [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(7): 1-10.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] ZHAO Tong-xin1, LIU Lin-de1*, ZHANG Li1, PAN Cheng-chen2, JIA Xing-jun1. Pollinators and pollen polymorphism of  Wisteria sinensis (Sims) Sweet[J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(03): 1 -5 .
[2] MA Jian-ling . Spectral characterization of the rhomb-type achromatic retarder[J]. J4, 2007, 42(7): 27 -29 .
[3] CHENG Zhi1,2, SUN Cui-fang2, WANG Ning1, DU Xian-neng1. On the fibre product of Zn and its property[J]. J4, 2013, 48(2): 15 -19 .
[4] ZHANG Ai-ping,LI Gang . LRquasinormalEhresmann semigroups[J]. J4, 2006, 41(5): 44 -47 .
[5] HUO Yu-hong, JI Quan-bao. Synchronization analysis of oscillatory activities in a biological cell system[J]. J4, 2010, 45(6): 105 -110 .
[6] SHI Chang-guang . Multi-soliton solution of the Faddeev model[J]. J4, 2007, 42(7): 38 -40 .
[7] MA Jie-xiong,JIANG Li,QI Yu-yu,XIANG FEng-ning,XIA Guang-min . The growth of calli and regenerated plantlets of Gentiana Przewalskii Maxim. and the constituents analysis of its two effective components[J]. J4, 2006, 41(6): 157 -160 .
[8] WANG Kang, LI Hua. Analysis of the compound Haqing injection with hyphenated chromatography and chemometric resolution[J]. J4, 2009, 44(11): 16 -20 .
[9] CHEN Li, . Singular LQ suboptimal control problem with disturbance rejection[J]. J4, 2006, 41(2): 74 -77 .
[10] XIE Tao, ZUO Ke-zheng. [J]. J4, 2013, 48(4): 95 -103 .