作者Patra, Sunandita
ProQuest Information and Learning Co
University of Maryland, College Park. Computer Science
書名Acting, Planning, and Learning Using Hierarchical Operational Models
出版項2020
說明1 online resource (253 pages)
文字text
無媒介computer
成冊online resource
附註Source: Dissertations Abstracts International, Volume: 82-05, Section: B
Advisor: Nau, Dana
Thesis (Ph.D.)--University of Maryland, College Park, 2020
Includes bibliographical references
The most common representation formalisms for planning are descriptive models that abstractly describe what the actions do and are tailored for efficiently computing the next state(s) in a state-transition system. However, real-world acting requires operational models that describe how to do things, with rich control structures for closed-loop online decision-making in a dynamic environment. Use of a different action model for planning than the one used for acting causes problems with combining acting and planning, in particular for the development and consistency verification of the different models.As an alternative, this dissertation defines and implements an integrated acting-and-planning system in which both planning and acting use the same operational models, which are written in a general-purpose hierarchical task-oriented language offering rich control structures.The acting component, called Reactive Acting Engine (RAE), is inspired by the well-known PRS system, except that instead of being purely reactive, it can get advice from a planner. The dissertation also describes three planning algorithms which plan by doing several Monte Carlo rollouts in the space of operational models. The best of these three planners, Plan-with-UPOM uses a UCT-like Monte Carlo Tree Search procedure called UPOM (UCT Procedure for Operational Models), whose rollouts are simulated executions of the actor's operational models. The dissertation also presents learning strategies for use with RAE and UPOM that acquire from online acting experiences and/or simulated planning results, a mapping from decision contexts to method instances as well as a heuristic function to guide UPOM. The experimental results show that Plan-with-UPOM and the learning strategies significantly improve the acting efficiency and robustness of RAE. It can be proved that UPOM converges asymptotically by mapping its search space to an MDP. The dissertation also describes a real-world prototype of RAE and Plan-with-UPOM to defend software-defined networks, a relatively new network management architecture, against incoming attacks
Electronic reproduction. Ann Arbor, Mich. : ProQuest, 2021
Mode of access: World Wide Web
主題Artificial intelligence
Computer science
Systems science
Information technology
Instructional design
Acting and planning
Dynamic environments
Hierarchical operational models
Online planning
Supervised learning
Descriptive models
State-transition system
Operational models
Closed-loop online decision-making
Reactive Acting Engine
Network management architecture
Electronic books.
0800
0984
0489
0447
0790
ISBN/ISSN9798678124920
QRCode
相關連結: click for full text (PQDT) (網址狀態查詢中....)
館藏地 索書號 條碼 處理狀態  

Go to Top