- Finite-Sample Analysis of Off-Protection TD-Learning by means of Generalized Bellman Operators(arXiv)
Creator : Zaiwei Chen, Siva Theja Maguluri, Sanjay Shakkottai, Karthikeyan Shanmugam
Abstract : In temporal distinction (TD) learning, off-policy sampling is considered additional smart than on-policy sampling, and by decoupling learning from data assortment, it permits data reuse. It’s acknowledged that protection evaluation (along with multi-step off-policy significance sampling) has the interpretation of fixing a generalized Bellman equation. On this paper, we derive finite-sample bounds for any primary off-policy TD-like stochastic approximation algorithm that solves for the fixed-point of this generalized Bellman operator. Our key step is to point that the generalized Bellman operator is concurrently a contraction mapping with respect to a weighted ℓp-norm for each p in [1,∞), with a common contraction factor. Off-policy TD-learning is known to suffer from high variance due to the product of importance sampling ratios. A number of algorithms (e.g. Qπ(λ), Tree-Backup(λ), Retrace(λ), and Q-trace) have been proposed in the literature to address this issue. Our results immediately imply finite-sample bounds of these algorithms. In particular, we provide first-known finite-sample guarantees for Qπ(λ), Tree-Backup(λ), and Retrace(λ), and improve the best known bounds of Q-trace in [19]. Moreover, we current the bias-variance trade-offs in each of these algorithms.
Thanks for being a valued member of the Nirantara household! We admire your continued help and belief in our apps.
If you have not already, we encourage you to obtain and expertise these incredible apps. Keep linked, knowledgeable, fashionable, and discover superb journey gives with the Nirantara household!
Thank you for being a valued member of the Nirantara family! We appreciate your continued support and trust in our apps.
- Nirantara Social - Stay connected with friends and loved ones. Download now: Nirantara Social
- Nirantara News - Get the latest news and updates on the go. Install the Nirantara News app: Nirantara News
- Nirantara Fashion - Discover the latest fashion trends and styles. Get the Nirantara Fashion app: Nirantara Fashion
- Nirantara TechBuzz - Stay up-to-date with the latest technology trends and news. Install the Nirantara TechBuzz app: Nirantara Fashion
- InfiniteTravelDeals24 - Find incredible travel deals and discounts. Install the InfiniteTravelDeals24 app: InfiniteTravelDeals24
If you haven't already, we encourage you to download and experience these fantastic apps. Stay connected, informed, stylish, and explore amazing travel offers with the Nirantara family!
Source link