VECTOR-VALUED MARKOV DECISION PROCESSES WITH AVERAGE REWARD CRITERION: THE MULTICHAIN CASE

Kazuyoshi Wakuta

doi:10.1017/S0269964800144092

VECTOR-VALUED MARKOV DECISION PROCESSES WITH AVERAGE REWARD CRITERION: THE MULTICHAIN CASE

Published online by Cambridge University Press: 31 October 2000

Kazuyoshi Wakuta

Show author details

Kazuyoshi Wakuta: Affiliation:
Nagaoka Technical College, Nagaoka, Niigata 940-8532, Japan, E-mail: wakuta@nagaoka-ct.ac.jp

Article contents

Abstract

Get access

Rights & Permissions

Abstract

We study the multichain case of a vector-valued Markov decision process with average reward criterion. We characterize optimal deterministic stationary policies via systems of linear inequalities and discuss a policy iteration algorithm for finding all optimal deterministic stationary policies.

Type: Research Article
Information: Probability in the Engineering and Informational Sciences , Volume 14 , Issue 4 , October 2000 , pp. 533 - 548

DOI: https://doi.org/10.1017/S0269964800144092 [Opens in a new window]

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article contents

VECTOR-VALUED MARKOV DECISION PROCESSES WITH AVERAGE REWARD CRITERION: THE MULTICHAIN CASE

Abstract

Access options

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests