正倒向随机最优控制问题的最大值原理:完全信息和部分信息

doi:10.6040/j.issn.1671-9352.9.2021.012

摘要/Abstract

摘要： 本文是一篇关于正倒向随机最优控制问题研究进展的综述论文。近30年来,正倒向随机控制系统的各种理论与应用研究得到了迅猛发展,取得了大量原创性科研成果,吸引了大批国际同行跟进研究。限于论文篇幅和作者意图,本文仅仅聚焦于正倒向随机最优控制问题的最大值原理这一主题,概述其最新研究进展及其在求解线性二次最优控制问题中的简单应用。

关键词: 正倒向随机系统, 一般最大值原理, 最优滤波, 倒向分离方法, 线性二次控制

Abstract: This paper reviews some progresses on forward-backward stochastic control system. In the past 30 years, various theories and applications of forward-backward stochastic control system have been developed rapidly and a large number of original scientific research results have been obtained, which have attracted many international peers to follow-up research. Limited to the length of this paper and the authors emphasis, this paper only focuses on maximum principle for optimal control of forward-backward stochastic system, and summarizes its latest research progress and application in solving linear quadratic optimal control problem.

Key words: forward-backward stochastic system, general maximum principle, optimal filtering, backward separation approach, linear-quadratic control

中图分类号:

O211

吴臻,王光臣,李敏. 正倒向随机最优控制问题的最大值原理:完全信息和部分信息[J]. 《山东大学学报(理学版)》, 2021, 56(10): 1-10.

WU Zhen, WANG Guang-chen, LI Min. Maximum principle for optimal control of forward-backward stochastic system: full information and partial information[J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2021, 56(10): 1-10.

参考文献

[1] BISMUT J M. Linear quadratic optimal stochastic control with random coefficients[J]. SIAM Journal on Control and Optimization, 1976, 14(3):419-444.
[2] PARDOUX E, PENG S G. Adapted solution of a backward stochastic differential equation[J]. Systems & Control Letters, 1990, 14(1):55-61.
[3] ANTONELLI F. Backward-forward stochastic differential equations[J]. The Annals of Applied Probability, 1993, 3(3):777-793.
[4] PARDOUX E, TANG S J. Forward-backward stochastic differential equations and quasilinear parabolic PDEs[J]. Probability Theory and Related Fields, 1999, 114(2):123-150.
[5] MA J, PROTTER P, YONG J M. Solving forward-backward stochastic differential equations explicitly: a four step scheme[J]. Probability Theory and Related Fields, 1994, 98(3):339-359.
[6] HU Y, PENG S G. Solution of forward-backward stochastic differential equations[J]. Probability Theory and Related Fields, 1995, 103(2):273-283.
[7] PENG S G, WU Z. Fully coupled forward-backward stochastic differential equations and applications to optimal control[J]. SIAM Journal on Control and Optimization, 1999, 37(3):825-843.
[8] MA J, YONG J M. Forward-backward stochastic differential equations and their applications[M] //Lecture Notes in Math 1702. New York: Springer-Verlag, 1999.
[9] YONG J M, ZHOU X Y. Stochastic controls: Hamiltonian systems and HJB euqations[M]. New York: Springer-Verlag, 1999.
[10] MA J, WU Z, ZHANG D T, et al. On well-posedness of forward-backward SDEs: a unified approach[J]. The Annals of Applied Probability, 2015, 25(4):2168-2214.
[11] WANG G C, WU Z. The maximum principles for stochastic recursive optimal control problems under partial information[J]. IEEE Transactions on Automatic Control, 2009, 54(6):1230-1242.
[12] PENG S G. A general stochastic maximum principle for optimal control problems[J]. SIAM Journal on Control and Optimization, 1990, 28(4):966-979.
[13] PENG S G. Backward stochastic differential equations and applications to optimal control[J]. Applied Mathematics and Optimization, 1993, 27(2):125-144.
[14] XU W S. Stochastic maximum principle for optimal control problem of forward and backward system[J]. The ANZIAM Journal, 1995, 37(2):172-185.
[15] WU Z. Maximum principle for optimal control problem of fully coupled forward-backward stochastic systems[J]. Journal of Mathematics and System Science. 1998, 11(3):249-259.
[16] PENG S G. Open problems on backward stochastic differential equations[M] // Control of Distributed Parameter and Stochastic Systems. Boston: Springer, 1999: 265-273.
[17] WU Z. A general maximum principle for optimal control of forward-backward stochastic systems[J]. Automatica, 2013, 49(5):1473-1480.
[18] HU M S. Stochastic global maximum principle for optimization with recursive utilities[J]. Probability, Uncertainty and Quantitative Risk, 2017, 2(1):1-20.
[19] WU Z, YU Z Y. Dynamic programming principle for one kind of stochastic recursive optimal control problem and Hamilton-Jacobi-Bellman equation[J]. SIAM Journal on Control and Optimization, 2008, 47(5):2616-2641.
[20] NIE T Y, SHI J T, WU Z. Connection between MP and DPP for stochastic recursive optimal control problems: viscosity solution framework in the general case[J]. SIAM Journal on Control and Optimization, 2017, 55(5):3258-3294.
[21] WONHAM W M. On the separation theorem of stochastic control[J]. SIAM Journal on Control, 1968, 6(2):312-326.
[22] LI X J, TANG S J. General necessary conditions for partially observed optimal stochastic controls[J]. Journal of Applied Probability, 1995, 32(4):1118-1137.
[23] TANG S J. The maximum principle for partially observed optimal control of stochastic differential equations[J]. SIAM Journal on Control and Optimization, 1998, 36(5):1596-1617.
[24] WANG G C, WU Z. Kalman-Bucy filtering equations of forward and backward stochastic systems and applications to recursive optimal control problems[J]. Journal of Mathematical Analysis and Applications, 2008, 342(2):1280-1296.
[25] WANG G C, ZHANG C H, ZHANG W H. Stochastic maximum principle for mean-field type optimal control under partial information[J]. IEEE Transactions on Automatic Control, 2014, 59(2):522-528.
[26] WANG G C, WU Z, XIONG J. An introduction to optimal control of FBSDE with incomplete information[M]. Cham: Springer, 2018.
[27] WU Z. A maximum principle for partially observed optimal control of forward-backward stochastic control systems[J]. Science China Information Sciences, 2010, 53(11):2205-2214.
[28] WANG G C, WU Z, XIONG J. Maximum principles for forward-backward stochastic control systems with correlated state and observation noises[J]. SIAM Journal on Control and Optimization, 2013, 51(1):491-524.
[29] WANG G C, WU Z, XIONG J. A linear-quadratic optimal control problem of forward-backward stochastic differential equations with partial information[J]. IEEE Transactions on Automatic Control, 2015, 60(11):2904-2916.
[30] KOHLMANN M, ZHOU X Y. Relationship between backward stochastic differential equations and stochastic controls: a linear-quadratic approach[J]. SIAM Journal on Control and Optimization, 2000, 38(5):1392-1407.
[31] LIM A E B, ZHOU X Y. Linear-quadratic control of backward stochastic differential equations[J]. SIAM Journal on Control and Optimization, 2001, 40(2):450-474.
[32] HU M S, JI S L, XUE X L. A global stochastic maximum principle for fully coupled forward-backward stochastic systems[J]. SIAM Journal on Control and Optimization, 2018, 56(6):4309-4335.
[33] MOON J. The risk-sensitive maximum principle for controlled forward-backward stochastic differential equations[J]. Automatica, 2020, 120:109069.
[34] JI S L, LIU H. Maximum principle for stochastic optimal control problem of forward-backward stochastic difference systems[J/OL].(2021-12-29). International Journal of Control. https://arxiv.org/abs/1812.11283.
[35] LI R J, FU F Y. The maximum principle for partially observed optimal control problems of mean-field FBSDEs[J]. International Journal of Control, 2019, 92(10):2463-2472. doi:10.1080/00207179.2018.1441555.
[36] ZHANG S Q, LI X, XIONG J. A stochastic maximum principle for partially observed stochastic control systems with delay[J]. Systems & Control Letters, 2020, 146:1-7. doi:10.1016/j.sysconle.2020.104812.
[37] BENSOUSSAN A. Stochastic control of partially observable systems[M]. Cambridge: Cambridge University Press, 1992.
[38] HUANG J H, WANG G C, XIONG J. A maximum principle for partial information backward stochastic control problems with applications[J]. SIAM Journal on Control and Optimization, 2009, 48(4):2106-2117.
[39] LI N, WANG G C, WU Z. Linear-quadratic optimal control for time-delay stochastic system with recursive utility under full and partial information[J]. Automatica, 2020, 121:109169.
[40] BENSOUSSAN A, FENG X W, HUANG J H. Linear-quadratic-Gaussian mean-field-game with partial observation and common noise[J]. Mathematical Control & Related Fields, 2021, 11(1):23-46.
[41] WANG G C, WANG W C, YAN Z G. Linear quadratic control of backward stochastic differential equation with partial information[J]. Applied Mathematics and Computation, 2021, 403:126164.

多维度评价

Viewed

Full text

Abstract

Cited

Shared

Discussed