館藏資源查詢 | 國立臺灣師範大學圖書館

檢索點：關鍵字： 限制可取得館藏

排序選項：

上一筆下一筆

作者	Patra, Sunandita
	ProQuest Information and Learning Co
	University of Maryland, College Park. Computer Science
書名	Acting, Planning, and Learning Using Hierarchical Operational Models
出版項	2020


說明	1 online resource (253 pages)
文字	text
無媒介	computer
成冊	online resource
附註	Source: Dissertations Abstracts International, Volume: 82-05, Section: B
	Advisor: Nau, Dana
	Thesis (Ph.D.)--University of Maryland, College Park, 2020
	Includes bibliographical references
	The most common representation formalisms for planning are descriptive models that abstractly describe what the actions do and are tailored for efficiently computing the next state(s) in a state-transition system. However, real-world acting requires operational models that describe how to do things, with rich control structures for closed-loop online decision-making in a dynamic environment. Use of a different action model for planning than the one used for acting causes problems with combining acting and planning, in particular for the development and consistency verification of the different models.As an alternative, this dissertation defines and implements an integrated acting-and-planning system in which both planning and acting use the same operational models, which are written in a general-purpose hierarchical task-oriented language offering rich control structures.The acting component, called Reactive Acting Engine (RAE), is inspired by the well-known PRS system, except that instead of being purely reactive, it can get advice from a planner. The dissertation also describes three planning algorithms which plan by doing several Monte Carlo rollouts in the space of operational models. The best of these three planners, Plan-with-UPOM uses a UCT-like Monte Carlo Tree Search procedure called UPOM (UCT Procedure for Operational Models), whose rollouts are simulated executions of the actor's operational models. The dissertation also presents learning strategies for use with RAE and UPOM that acquire from online acting experiences and/or simulated planning results, a mapping from decision contexts to method instances as well as a heuristic function to guide UPOM. The experimental results show that Plan-with-UPOM and the learning strategies significantly improve the acting efficiency and robustness of RAE. It can be proved that UPOM converges asymptotically by mapping its search space to an MDP. The dissertation also describes a real-world prototype of RAE and Plan-with-UPOM to defend software-defined networks, a relatively new network management architecture, against incoming attacks
	Electronic reproduction. Ann Arbor, Mich. : ProQuest, 2021
	Mode of access: World Wide Web
主題	Artificial intelligence
	Computer science
	Systems science
	Information technology
	Instructional design
	Acting and planning
	Dynamic environments
	Hierarchical operational models
	Online planning
	Supervised learning
	Descriptive models
	State-transition system
	Operational models
	Closed-loop online decision-making
	Reactive Acting Engine
	Network management architecture
	Electronic books.
	0800
	0984
	0489
	0447
	0790
ISBN/ISSN	9798678124920

上一筆下一筆

加入個人書庫回報書目問題轉入EndNote

館藏地	索書號	條碼	處理狀態