Non-Markovian policies in sequential decision problems

In this article we prove the validity of the Bellman Optimality Equation and related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also nonMarkovian policies are taken into account. The theory is motivated by some expe...

Teljes leírás

Elmentve itt :

Bibliográfiai részletek
Szerző:	Szepesvári Csaba
Dokumentumtípus:	Cikk
Megjelent:	1998
Sorozat:	Acta cybernetica 13 No. 3
Kulcsszavak:	Számítástechnika, Kibernetika
Tárgyszavak:	Természettudományok Számítás- és információtudomány
Online Access:	http://acta.bibl.u-szeged.hu/12592


LEADER	01080nab a2200217 i 4500
001	acta12592
005	20220613155617.0
008	161015s1998 hu o 0\|\| eng d
022			\|a 0324-721X
040			\|a SZTE Egyetemi Kiadványok Repozitórium \|b hun
041			\|a eng
100	1		\|a Szepesvári Csaba
245	1	0	\|a Non-Markovian policies in sequential decision problems \|h [elektronikus dokumentum] / \|c Szepesvári Csaba
260			\|c 1998
300			\|a 305-318
490	0		\|a Acta cybernetica \|v 13 No. 3
520	3		\|a In this article we prove the validity of the Bellman Optimality Equation and related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also nonMarkovian policies are taken into account. The theory is motivated by some experiments with a learning robot.
650		4	\|a Természettudományok
650		4	\|a Számítás- és információtudomány
695			\|a Számítástechnika, Kibernetika
856	4	0	\|u http://acta.bibl.u-szeged.hu/12592/1/cybernetica_013_numb_003_305-318.pdf \|z Dokumentum-elérés

Non-Markovian policies in sequential decision problems

Hasonló tételek