On constrained Markov decision processes

Moshe Haviv*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

25 Scopus citations

Abstract

A multichain Markov decision process with constraints on the expected state-action frequencies may lead to a unique optimal policy which does not satisfy Bellman's principle of optimality. The model with sample-path constraints does not suffer from this drawback.

Original languageEnglish
Pages (from-to)25-28
Number of pages4
JournalOperations Research Letters
Volume19
Issue number1
DOIs
StatePublished - Jul 1996

Keywords

  • Constrained optimization
  • Markov processes
  • Sample path

Fingerprint

Dive into the research topics of 'On constrained Markov decision processes'. Together they form a unique fingerprint.

Cite this