Direct Optimal Control using TD(λ) Mixtures of Experts

Chatwin, C R, Paraskevopoulos, V and Heywood, M I (2001) Direct Optimal Control using TD(λ) Mixtures of Experts. International Journal of Knowledge-Based Intelligent Engineering Systems, 5 (2). pp. 83-91. ISSN 1327-2314

Full text not available from this repository.

Abstract

Real-time control of continuous valued plants using TD(lamda) reinforcement learning is detailed. This problem is significantly more dif icult then the case of a discrete control space as in bang-bang or Q-learning. The methodology employs a combination of Stochastic Real-Valued units, Mixtures of Experts and RBF partitioning To do so the significance of both Maximum-Likelihood and Square Error Cost functions are emphasised, as is provision for RBF co-variances during training. The resulting architecture is demonstrated on benchmark problems.

Item Type: Article
Schools and Departments: School of Engineering and Informatics > Engineering and Design
Depositing User: Chris Chatwin
Date Deposited: 06 Feb 2012 19:56
Last Modified: 10 Jul 2012 11:59
URI: http://sro.sussex.ac.uk/id/eprint/23068
📧 Request an update