Factored temporal difference learning in the new ties environment

QR kód

Factored temporal difference learning in the new ties environment

Although reinforcement learning is a popular method for training an agent for decision making based on rewards, well studied tabular methods are not applicable for large, realistic problems. In this paper, we experiment with a factored version of temporal difference learning, which boils down to a l...

Teljes leírás

Elmentve itt :

Bibliográfiai részletek
Szerzők:	Gyenes Viktor Bontovics Ákos Lőrincz András
Testületi szerző:	Symposium of Young Scientists on Intelligent Systems (2.) (2007) (Budapest)
Dokumentumtípus:	Cikk
Megjelent:	2008
Sorozat:	Acta cybernetica 18 No. 4
Kulcsszavak:	Számítástechnika, Kibernetika
Tárgyszavak:	Természettudományok Számítás- és információtudomány
Online Access:	http://acta.bibl.u-szeged.hu/12840

Hasonló tételek

Learning in a virtual environment
Szerző: Hampel György
Megjelent: (2014)

Creating a Virtual Learning Environment
Szerző: Hampel György, et al.
Megjelent: (2014)

Gossip-Based Machine Learning in Fully Distributed Environments
Szerző: Hegedűs István
Megjelent: (2017)

Factored value iteration converges
Szerző: Szita István, et al.
Megjelent: (2008)

A new concept of effective regression test generation in a C++ specific environment
Szerző: Biczó Mihály, et al.
Megjelent: (2008)

Modular reinforcement learning a case study in a robot domain /
Szerző: Kalmár Zsolt, et al.
Megjelent: (2000)

InfoMax Bayesian learning of the Furuta pendulum
Szerző: Jeni László A., et al.
Megjelent: (2008)

Building Context-Dependent DNN Acoustic Models using Kullback-Leibler Divergence-Based State Tying
Szerző: Gosztolya Gábor, et al.
Megjelent: (2015)

Module based reinforcement learning for a real robot [abstract] /
Szerző: Kalmár Zsolt, et al.
Megjelent: (1998)

Temporal speech parameters detect mild cognitive impairment in different languages validation and comparison of the Speech-GAP Test® in English and Hungarian /
Szerző: Kálmán János, et al.
Megjelent: (2022)

DRM systems in wireless environment [abstract] /
Szerző: Móga Rita, et al.
Megjelent: (2006)

Temporal logic with cyclic counting and the degree of aperiodicity of finite automata
Szerző: Ésik Zoltán, et al.
Megjelent: (2003)

Learning decision trees in continuous space [abstract] /
Szerző: Zsiros Ákos
Megjelent: (2000)

About the axiomatization of first- and second-order spatio-temporal logics [abstract] /
Szerző: Vályi Sándor
Megjelent: (2000)

Learning decision trees in continuous space
Szerző: Dombi József, et al.
Megjelent: (2001)

Task allocation possibilities in simulated Fog environments
Szerző: Márkus András
Megjelent: (2020)

Application of learning methods in MCDA models overview and experimental comparison : [abstract] /
Szerző: Zsiros Ákos
Megjelent: (2002)

Prototype environment for refactoring clean programs [abstract] /
Szerző: Szabó-Nacsa Rozália, et al.
Megjelent: (2004)

Graphical web application development environment [abstract] /
Szerző: Székely István
Megjelent: (2004)

Global/nonlinear optimization in modeling environments [abstract] /
Szerző: Pintér János D.
Megjelent: (2002)

Different aspects in the quantification of the Sky View Factor in complex environments
Szerző: Hämmerle Martin, et al.
Megjelent: (2014)

Navigation of simulated mobile robots in the Webots environment [abstract] /
Szerző: Szabó Richárd
Megjelent: (2002)

LL frame system of learning methods [abstract] /
Szerző: Hócza András, et al.
Megjelent: (2002)

Rewarding misclassifications in oblique decision tree learning [abstract] /
Szerző: Salamon András
Megjelent: (2002)

Factorizations of languages and commutativity conditions
Szerző: Mateescu Alexandru, et al.
Megjelent: (2002)

IPv6 macromobility simulation using OMNeT++ environment [abstract] /
Szerző: Imre Sándor, et al.
Megjelent: (2002)

Joint optimization of spectro-temporal features and deep neural nets for robust automatic speech recognition
Szerző: Kovács György, et al.
Megjelent: (2015)

Mooc vs. traditional learning Possibilities and weaknesses of e-learning sites /
Szerző: Kőrösi Gábor
Megjelent: (2016)

A programming environment for a transputer-based multiprocessor system
Szerző: Aspnäs Mats, et al.
Megjelent: (1990)

Modelling of a communication system evolving in a random environment
Szerző: Sztrik János, et al.
Megjelent: (1991)

tiéd minden [vers] /
Szerző: Tóth Erzsébet
Megjelent: (2012)

Test component assignment and scheduling in a load testing environment [abstract] /
Szerző: Bozóki Ferenc, et al.
Megjelent: (2008)

Developing applications for testing left-handed people in virtual environments [abstract] /
Szerző: Umenhoffer Tamás, et al.
Megjelent: (2004)

Parameter learning algorithms in online scheduling [abstract] /
Szerző: Németh Tibor, et al.
Megjelent: (2008)

Pedagogical considerations in an e-learning framework [abstract] /
Szerző: Muhi Dániel
Megjelent: (2004)

New methods in tele-cardiology [abstract] /
Szerző: Balázs Gábor, et al.
Megjelent: (2002)

Location-aware Task Allocation Strategies for IoT-Fog-Cloud Environments
Szerző: Márkus András, et al.
Megjelent: (2021)

Development of a communication environment between IPv6 and IPv4
Szerző: Fóris Gábor, et al.
Megjelent: (2002)

Optimizing Branching Strategies in Mono- and Multi-Repository Environments A Comprehensive Analysis /
Szerző: Shakikhanli Ulvi, et al.
Megjelent: (2024)

Joint Optimization of Spectro-Temporal Features and Deep Neural Nets for Robust Automatic Speech Recognition
Szerző: Kovács György, et al.
Megjelent: (2015)