[1] Arapostathis, A., al, et:
Discrete time controlled Markov processes with average cost criterion: a survey. SIAM J. Control Optim. 31 (1993), 282-344.
DOI 10.1137/0331018 |
MR 1205981
[2] Casella, G., Berger, R. L.: Statistical Inference. Second edition. Duxbury Thomson Learning 2002.
[3] Dynkin, E. B., Yushkevich, A. A.:
Controlled Markov Processes. Springer, New York 1979.
MR 0554083
[4] Gordienko, E., Hernández-Lerma, O.:
Average cost Markov control processes with weighted norms: existence of canonical policies. Appl. Math. (Warsaw) 23 (1995), 2, 199-218.
MR 1341223 |
Zbl 0829.93067
[9] Kakumanu, M.:
Nondiscounted continuous time Markov decision process with countable state space. SIAM J. Control Optim. 10 (1972), 1, 210-220.
DOI 10.1137/0310016 |
MR 0307785
[13] Sennott, L. I.:
Average reward optimization theory for denumerable state spaces. In: Handbook of Markov Decision Processes (Int. Ser. Operat. Res. Manag. Sci. 40) (E. A. Feinberg and A. Shwartz Kluwer, eds.), Boston, pp. 153-172.
DOI 10.1007/978-1-4615-0805-2_5 |
MR 1887202 |
Zbl 1008.90068
[15] Zhu, Q. X.:
Average optimality for continuous-time jump Markov decision processes with a policy iteration approach. J. Math. Anal. Appl. 339 (2008), 1, 691-704.
DOI 10.1016/j.jmaa.2007.06.071 |
MR 2370686