adicionar à lista de desejos
Exploiting Environment Configurability In Reinforcement Learning eBook
idioma: inglês
Editor:
SAGE PUBLICATIONS, dezembro de 2022 ‧
ver detalhes do produto
128,53€
10% DESCONTO
CARTÃO
YnpGallUSnJWbWd2TUVGWWJFeGhVamxzYVVOVlozRkNWbUZ0VFVjeFVFa3dTelZpUmpGalQxQmFla3R1ZGpCa2VqWTVVVnBoV200cmRtVmhZVGQyUlV4cE5DdHBhbHB4YTNKdGFGcE1ablpqZEVkQlJVbzVTMDRyV21kVWNVTnBSMnhaYW5ocFMzSnRLMkpqWVZWTEszUkVNMDE1TmpGM1dteDJXbUZaVTJndll5OVFXa3BuYmpWeE5VZHJXblZ3Y0ZaNmIySnNkek5sZUVaSlYwSTFRV0pFWkdSV1IwUlRSMUZ3VEZaalQyZExia3hDY1hGVVowaDJkbEozWTFwcWJuaHNiMVpZZVVWT04wWk1lalZGWldOVFYzSnFiRUo0VDBSeFdpOTFXRTFzVm1ObFRFUkdkWFZqUjNSMGJDdHdNMEZCYzJrMWNrdG5Rems1TTFSdFZYbHljVGh4UTBWbmRrMVBZWGxSTDJaMFEyZE9iVGN4VTBKUk0wRTVVbm92YWpKdkx6Qm1URVptUjB4M1ZVcHdNVXhDWlhRNE0yUlpMMU5YTW1jMFRFRlpUV2h4UlhKRmRFRXJkalZNVnpJemN6ZFNaVmxhUm5OTU5ub3dWMFExUzBWUk4zbDZRMU0yWkM5WFFWTjVNaTlpTDFFNVZrcFJMMUI2UmxnMWVUSkdMMkZYT1ZocFRYcEtVVEZwU3pKdGEyTXJPVkZ1WkhKcU5rNHllRFU1WVN0WFpXRkpMMnRPZVUxb2QzRkVkR3dyZWs5TlJ6UnljMmh1V2paWWEzQnljRWRaUlZKUFZEVmxiRzU2VTBGTk0xWjZiazFwSzFvemJ6Wm5NWE5RZEdwRmREQkliaXRMVDBoVFZ6WklNSGhMYVc5eWNYaERhVU5tYTJwM1ZFdFdOVFYzY1cweWREUnVOa1IxVjIxNVZFSjVlak5NT1ZKRGNXWnplVEEyZGtWVmJVUkhhbE5tY1ZsME1rcHpjbGRHT0ZwTFVHeDJPVWR5T1dNM2NIZEpaVzlGTjFCc056ZFJiSFpSUjJGNFNGTkdUemxOUzNwdGFGSXhRMU5OUkRsT2VrZEZlVXhJWXpCemVuSmxhRmN2YUM4M0syZHVZV2QzUm5semVuTXdWWEJ3VWxjNFVqRnhkelp1VFZoNk9XbHVVVFZEU2pCTU9XZHhURXhTWVZKMldsaFdRVmwzUVRoWk9FRkxaVUZ2ZWpOMmRVNUhZVkF6T1hkb2FpOW1WbFV5TjI1eWMzSXJRVkp6YkRCRlZHTjBZa2xOV1hWSVkxbERaazk0WVUxRmEzcE9LM2xGYWpsNlFYbzFOVXRqUFE9PTpNTFZBS08vZlBlMlJ5aDZaNm0xT1RRPT0=
DISPONIBILIDADE IMEDIATA
Ebook para ADE
SINOPSE
In recent decades, Reinforcement Learning (RL) has emerged as an effective approach to address complex control tasks. In a Markov Decision Process (MDP), the framework typically used, the environment is assumed to be a fixed entity that cannot be altered externally. There are, however, several real-world scenarios in which the environment can be modified to a limited extent.
This book, Exploiting Environment Configurability in Reinforcement Learning, aims to formalize and study diverse aspects of environment configuration. In a traditional MDP, the agent perceives the state of the environment and performs actions. As a consequence, the environment transitions to a new state and generates a reward signal. The goal of the agent consists of learning a policy, i.e., a prescription of actions that maximize the long-term reward. Although environment configuration arises quite often in real applications, the topic is very little explored in the literature. The contributions in the book are theoretical, algorithmic, and experimental and can be broadly subdivided into three parts. The first part introduces the novel formalism of Configurable Markov Decision Processes (Conf-MDPs) to model the configuration opportunities offered by the environment. The second part of the book focuses on the cooperative Conf-MDP setting and investigates the problem of finding an agent policy and an environment configuration that jointly optimize the long-term reward. The third part addresses two specific applications of the Conf-MDP framework: policy space identification and control frequency adaptation.
The book will be of interest to all those using RL as part of their work.
DETALHES
| Propriedade | Descrição |
|---|---|
| ISBN: | 9781643683638 |
| Editor: | SAGE PUBLICATIONS |
| Data de Lançamento: | dezembro de 2022 |
| Idioma: | Inglês |
| Páginas: | 376 |
| Tipo de produto: | eBook |
| Formato e Compatibilidade: | PDF para ADE |
| Coleção: | Frontiers In Artificial Intelligence And Applications (Ios Press) |
| Classificação Temática: |
eBooks em Inglês
>
Ciências Exatas e Naturais
>
Matemática
eBooks em Inglês > Informática > Outras Aplicações |
| EAN: | 9781643683638 |
LIVROS DA MESMA COLEÇÃO
-
New Trends In Intelligent Software Methodologies, Tools And TechniqueseBook10%SAGE PUBLICATIONS162,18€
180,20€ -
New Trends In Intelligent Software Methodologies, Tools And TechniqueseBook10%SAGE PUBLICATIONS148,40€ 10% CARTÃO