Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires

Machine learning algorithms have increased tremendously in power in recent years but have yet to be fully utilized in many ecology and sustainable resource management domains such as wildlife reserve design, forest fire management and invasive species spread. One thing these domains have in common i...

Full description

Bibliographic Details
Main Author:	Ganapathi Subramanian, Sriram
Format:	Master Thesis
Language:	English
Published:	University of Waterloo 2018
Subjects:	Reinforcement Learning Machine Learning Deep Learning Spatially Spreading Processes Sustainability Forest Wildfire Management A3C Fort McMurray
Online Access:	http://hdl.handle.net/10012/13148

id	ftunivwaterloo:oai:uwspace.uwaterloo.ca:10012/13148
record_format	openpolar
spelling	ftunivwaterloo:oai:uwspace.uwaterloo.ca:10012/13148 2023-05-15T16:17:41+02:00 Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires Ganapathi Subramanian, Sriram 2018-04-19 http://hdl.handle.net/10012/13148 en eng University of Waterloo http://hdl.handle.net/10012/13148 Reinforcement Learning Machine Learning Deep Learning Spatially Spreading Processes Sustainability Forest Wildfire Management A3C Master Thesis 2018 ftunivwaterloo 2022-06-18T23:01:47Z Machine learning algorithms have increased tremendously in power in recent years but have yet to be fully utilized in many ecology and sustainable resource management domains such as wildlife reserve design, forest fire management and invasive species spread. One thing these domains have in common is that they contain dynamics that can be characterized as a Spatially Spreading Process (SSP) which requires many parameters to be set precisely to model the dynamics, spread rates and directional biases of the elements which are spreading. We introduce a novel approach for learning in SSP domains such as wild fires using Reinforcement Learning (RL) where fire is the agent at any cell in the landscape and the set of actions the fire can take from a location at any point in time includes spreading into any point in the 3 $\times$ 3 grid around it (including not spreading). This approach inverts the usual RL setup since the dynamics of the corresponding Markov Decision Process (MDP) is a known function for immediate wildfire spread. Meanwhile, we learn an agent policy for a predictive model of the dynamics of a complex spatially-spreading process. Rewards are provided for correctly classifying which cells are on fire or not compared to satellite and other related data. We use 3 demonstrative domains to prove the ability of our approach. The first one is a popular online simulator of a wildfire, the second domain involves a pair of forest fires in Northern Alberta which are the Fort McMurray fire of 2016 that led to an unprecedented evacuation of almost 90,000 people and the Richardson fire of 2011, and the third domain deals with historical Saskatchewan fires previously compared by others to a physics-based simulator. The standard RL algorithms considered on all the domains include Monte Carlo Tree Search, Asynchronous Advantage Actor-Critic (A3C), Deep Q Learning (DQN) and Deep Q Learning with prioritized experience replay. We also introduce a novel combination of Monte-Carlo Tree Search (MCTS) and A3C algorithms that ... Master Thesis Fort McMurray University of Waterloo, Canada: Institutional Repository Fort McMurray
institution	Open Polar
collection	University of Waterloo, Canada: Institutional Repository
op_collection_id	ftunivwaterloo
language	English
topic	Reinforcement Learning Machine Learning Deep Learning Spatially Spreading Processes Sustainability Forest Wildfire Management A3C
spellingShingle	Reinforcement Learning Machine Learning Deep Learning Spatially Spreading Processes Sustainability Forest Wildfire Management A3C Ganapathi Subramanian, Sriram Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires
topic_facet	Reinforcement Learning Machine Learning Deep Learning Spatially Spreading Processes Sustainability Forest Wildfire Management A3C
description	Machine learning algorithms have increased tremendously in power in recent years but have yet to be fully utilized in many ecology and sustainable resource management domains such as wildlife reserve design, forest fire management and invasive species spread. One thing these domains have in common is that they contain dynamics that can be characterized as a Spatially Spreading Process (SSP) which requires many parameters to be set precisely to model the dynamics, spread rates and directional biases of the elements which are spreading. We introduce a novel approach for learning in SSP domains such as wild fires using Reinforcement Learning (RL) where fire is the agent at any cell in the landscape and the set of actions the fire can take from a location at any point in time includes spreading into any point in the 3 $\times$ 3 grid around it (including not spreading). This approach inverts the usual RL setup since the dynamics of the corresponding Markov Decision Process (MDP) is a known function for immediate wildfire spread. Meanwhile, we learn an agent policy for a predictive model of the dynamics of a complex spatially-spreading process. Rewards are provided for correctly classifying which cells are on fire or not compared to satellite and other related data. We use 3 demonstrative domains to prove the ability of our approach. The first one is a popular online simulator of a wildfire, the second domain involves a pair of forest fires in Northern Alberta which are the Fort McMurray fire of 2016 that led to an unprecedented evacuation of almost 90,000 people and the Richardson fire of 2011, and the third domain deals with historical Saskatchewan fires previously compared by others to a physics-based simulator. The standard RL algorithms considered on all the domains include Monte Carlo Tree Search, Asynchronous Advantage Actor-Critic (A3C), Deep Q Learning (DQN) and Deep Q Learning with prioritized experience replay. We also introduce a novel combination of Monte-Carlo Tree Search (MCTS) and A3C algorithms that ...
format	Master Thesis
author	Ganapathi Subramanian, Sriram
author_facet	Ganapathi Subramanian, Sriram
author_sort	Ganapathi Subramanian, Sriram
title	Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires
title_short	Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires
title_full	Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires
title_fullStr	Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires
title_full_unstemmed	Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires
title_sort	reinforcement learning for determining spread dynamics of spatially spreading processes with emphasis on forest fires
publisher	University of Waterloo
publishDate	2018
url	http://hdl.handle.net/10012/13148
geographic	Fort McMurray
geographic_facet	Fort McMurray
genre	Fort McMurray
genre_facet	Fort McMurray
op_relation	http://hdl.handle.net/10012/13148
_version_	1766003576736317440

Reinforcement Learning for Determining Spread Dynamics of Spatially Spreading Processes with Emphasis on Forest Fires

Similar Items