Nacionalni portal odprte znanosti

Iskalni niz:

išči po

Vrsta gradiva:

Jezik:

Prikaži samo zadetke s polnim besedilom

Št. zadetkov: 3

A Unifying Perspective of Parametric Policy Search Methods for Markov Decision Processes

Thomas Furmston

Video in druga učna gradiva

Oznake: computer science;machine learning;markov processes;reinforcement learning

Parametric policy search algorithms are one of the methods of choice for the optimisation of Markov Decision Processes, with Expectation Maximisation and natural gradient ascent being considered the current state of the art in the field. In this article we provide a unifying perspective of these two ...

Leto: 2012 Vir: videolectures.net

Solving Deterministic Policy (PO)MDPs using

Thomas Furmston

Video in druga učna gradiva

Oznake:

The viewpoint of solving Markov Decision Processes and their partially observable extension refers to nding policies that max- imise the expected reward. We follow the rephrasing of this problem as learning in a related probabilistic model. Our trans-dimensional distri- bution formulation obtain ...

Leto: 2009 Vir: videolectures.net

Lagrange Dual Decomposition for Finite Horizon Markov Decision Processes

Data & Web Mining Lab , Thomas Furmston

Video in druga učna gradiva

Oznake: computer science;machine learning;markov processes

Solving finite-horizon Markov Decision Processes with stationary policies is a computationally difficult problem. Our dynamic dual decomposition approach uses Lagrange duality to decouple this hard problem into a sequence of tractable sub-problems. The resulting procedure is a straightforward modifi ...

Leto: 2011 Vir: videolectures.net

Št. zadetkov: 3

Nacionalni portal odprte znanosti

Dostop do znanja slovenskih raziskovalnih organizacij