[2] Devroye L., Gyorfi L.:
Nonparametric Density Estimation the $L_{1}$ View. Wiley, New York 1985
MR 0780746
[3] Dynkin E. B., Yushkevich A. A.:
Controlled Markov Processes. Springer–Verlag, New York 1979
MR 0554083
[4] Gordienko E. I.:
Adaptive strategies for certain classes of controlled Markov processes. Theory Probab. Appl. 29 (1985), 504–518
Zbl 0577.93067
[5] Gordienko E. I., Minjárez-Sosa J. A.:
Adaptive control for discrete-time Markov processes with unbounded costs: discounted criterion. Kybernetika 34 (1998), 217–234
MR 1621512
[7] Hernández-Lerma O.:
Adaptive Markov Control Processes. Springer–Verlag, New York 1989
MR 0995463
[8] Hernández-Lerma O., Cavazos-Cadena R.:
Density estimation and adaptive control of Markov processes: average and discounted criteria. Acta Appl. Math. 20 (1990), 285–307
DOI 10.1007/BF00049572 |
MR 1081591
[9] Hernández-Lerma O., Lasserre J. B.:
Discrete-Time Markov Control Processes: Basic Optimality Criteria. Springer–Verlag, New York 1996
MR 1363487 |
Zbl 0840.93001
[10] Hernández-Lerma O., Lasserre J. B.:
Further Topics on Discrete-Time Markov Control Processes. Springer–Verlag, New York 1999
MR 1697198 |
Zbl 0928.93002
[13] Schäl M.:
Conditions for optimality and for the limit of $n$-stage optimal policies to be optimal. Z. Wahrs. Verw. Gerb. 32 (1975), 179–196
DOI 10.1007/BF00532612 |
MR 0378841