The Theoretical Foundation for Incremental Least-Squares Temporal Difference Learning

Loading...
Thumbnail Image

Date

Citation for Previous Publication

Link to Related Item

Abstract

Description

Technical report TR06-25. In this paper we present a mathematical foundation for Incremental Least-Squares Temporal Difference Learning (iLSTD) for policy evaluation in reinforcement learning with linear function approximation. iLSTD is an incremental method for achieving results similar to LSTD, the data-efficient, least-squares version of temporal difference learning, without incurring the full cost of the LSTD computation. Here, we give a technical foundation for the asymptotic properties of iLSTD. | TRID-ID TR06-25

Item Type

http://purl.org/coar/resource_type/c_93fc

Alternative

Other License Text / Link

Language

en

Location

Time Period

Source