Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Reinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.
The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.
Les informations fournies dans la section « Synopsis » peuvent faire référence à une autre édition de ce titre.
Richard S. Sutton is Professor of Computing Science and AITF Chair in Reinforcement Learning and Artificial Intelligence at the University of Alberta, and also Distinguished Research Scientist at DeepMind.
Andrew G. Barto is Professor Emeritus in the College of Computer and Information Sciences at the University of Massachusetts Amherst.
Les informations fournies dans la section « A propos du livre » peuvent faire référence à une autre édition de ce titre.
Vendeur : Anybook.com, Lincoln, Royaume-Uni
Etat : Good. This is an ex-library book and may have the usual library/used-book markings inside.This book has hardback covers. In good all round condition. Dust jacket in fair condition. Please note the Image in this listing is a stock photo and may not match the covers of the actual item,900grams, ISBN:9780262193986. N° de réf. du vendeur 4315703
Quantité disponible : 1 disponible(s)
Vendeur : ReviBlio, Barcelona, B, Espagne
Condition: 15 pages with some highlighted text, the rest excellent. The book provides a clear and simple account of the key ideas and algorithms in this area of artificial intelligence, where an agent learns to maximize a cumulative reward by interacting with a complex, uncertain environment. It covers the history of the field's intellectual foundations and proceeds to the core algorithms and concepts, including: The Reinforcement Learning Problem framed in terms of Markov Decision Processes (MDPs). Basic Solution Methods like Dynamic Programming, Monte Carlo methods, and the influential Temporal-Difference (TD) learning (e.g., Q-learning and SARSA). Function Approximation for handling large state spaces, including the use of artificial neural networks. More advanced topics like policy-gradient methods and a discussion of RL's relationships to psychology and neuroscience. Often referred to as the "bible" of the field, it is a foundational text suitable for students, researchers, and practitioners with a basic understanding of probability. N° de réf. du vendeur ABE-1760107744142
Quantité disponible : 1 disponible(s)
Vendeur : YESIBOOKSTORE, MIAMI, FL, Etats-Unis
hardcover. Etat : As New. N° de réf. du vendeur 0262193981-VB
Quantité disponible : 1 disponible(s)
Vendeur : Buchpark, Trebbin, Allemagne
Etat : Gut. Zustand: Gut | Seiten: 344 | Sprache: Englisch | Produktart: Bücher | Keine Beschreibung verfügbar. N° de réf. du vendeur 1509267/203
Quantité disponible : 4 disponible(s)
Vendeur : GoldBooks, Denver, CO, Etats-Unis
Etat : new. N° de réf. du vendeur 12W50_51_0262193981
Quantité disponible : 1 disponible(s)