TY - GEN
T1 - Generalized prioritized sweeping
AU - Andre, David
AU - Friedman, Nir
AU - Parr, Ronald
PY - 1998
Y1 - 1998
N2 - Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent's limited computational resources to achieve a good estimate of the value of environment states. To choose effectively where to spend a costly planning step, classic prioritized sweeping uses a simple heuristic to focus computation on the states that are likely to have the largest errors. In this paper, we introduce generalized prioritized sweeping, a principled method for generating such estimates in a representation-specific manner. This allows us to extend prioritized sweeping beyond an explicit, state-based representation to deal with compact representations that are necessary for dealing with large state spaces. We apply this method for generalized model approximators (such as Bayesian networks), and describe preliminary experiments that compare our approach with classical prioritized sweeping.
AB - Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent's limited computational resources to achieve a good estimate of the value of environment states. To choose effectively where to spend a costly planning step, classic prioritized sweeping uses a simple heuristic to focus computation on the states that are likely to have the largest errors. In this paper, we introduce generalized prioritized sweeping, a principled method for generating such estimates in a representation-specific manner. This allows us to extend prioritized sweeping beyond an explicit, state-based representation to deal with compact representations that are necessary for dealing with large state spaces. We apply this method for generalized model approximators (such as Bayesian networks), and describe preliminary experiments that compare our approach with classical prioritized sweeping.
UR - http://www.scopus.com/inward/record.url?scp=21844480297&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:21844480297
SN - 0262100762
SN - 9780262100762
T3 - Advances in Neural Information Processing Systems
SP - 1001
EP - 1007
BT - Advances in Neural Information Processing Systems 10 - Proceedings of the 1997 Conference, NIPS 1997
PB - Neural information processing systems foundation
T2 - 11th Annual Conference on Neural Information Processing Systems, NIPS 1997
Y2 - 1 December 1997 through 6 December 1997
ER -