Structural integrity management via hierarchical resource allocation and continuous-control reinforcement learning

Metwally, Ziead; ICASP14; Andriotis, Charalampos

dc.contributor.author	Metwally, Ziead
dc.contributor.author	ICASP14
dc.contributor.author	Andriotis, Charalampos
dc.date.accessioned	2023-08-03T14:02:08Z
dc.date.available	2023-08-03T14:02:08Z
dc.date.issued	2023
dc.identifier.citation	Charalampos Andriotis, Ziead Metwally, Structural integrity management via hierarchical resource allocation and continuous-control reinforcement learning, 14th International Conference on Applications of Statistics and Probability in Civil Engineering (ICASP14), Dublin, Ireland, 2023.
dc.description	PUBLISHED
dc.description.abstract	Maintenance planning of engineering systems is typically posed as a discrete stochastic optimal control problem, as it refers to determining a series of distinct interventions that upkeep structural integrity. Advanced algorithmic schemes within the joint framework of Partially Observable Markov Decision Processes (POMDPs) and multi-agent Deep Reinforcement Learning (DRL) have been recently able to approximate well global optima for this complex problem, outperforming existing time- and condition-based decision strategies. Integral to their success is the hypothesis that system components represent individual agents who form cooperative policies to minimize a central life-cycle cost. Thereby, the policy output scales linearly with the number of components, alleviating the curse of dimensionality related to combinatorial choices. State complexity and long-term optimality are handled efficiently via deep learning and POMDP principles, respectively. However, the efficiency of multi-agent coordination can fade as the number of agents increases. To this end, we propose a new formulation: we pose the problem as a continuous-control dynamic resource allocation one, combining hierarchical DRL and mixed-integer programming. Moving from flat decentralized to hierarchical multi-agent decompositions allows us to improve further the policy output scalability. The new Adaptive Knapsack Hierarchical Resource Allocator (AK-HRA) DRL architecture distributes available resources within the system, creating local, independently solvable, multi-choice knapsack optimization problems. By design, AK HRA allows decision-makers to inscribe known hierarchical structures and local decision rules in their architectures, thereby enhancing control and interpretability over the solution space. The efficacy of the new approach is demonstrated in a multi-component reliability system subject to stochastic deterioration.
dc.language.iso	en
dc.relation.ispartofseries	14th International Conference on Applications of Statistics and Probability in Civil Engineering(ICASP14)
dc.rights	Y
dc.title	Structural integrity management via hierarchical resource allocation and continuous-control reinforcement learning
dc.title.alternative	14th International Conference on Applications of Statistics and Probability in Civil Engineering(ICASP14)
dc.type	Conference Paper
dc.type.supercollection	scholarly_publications
dc.type.supercollection	refereed_publications
dc.rights.ecaccessrights	openAccess
dc.identifier.uri	http://hdl.handle.net/2262/103609

Files in this item

Name:: submission_476.pdf
Size:: 1.083Mb
Format:: PDF

View/Open

Name:: license.txt
Size:: 3.534Kb
Format:: Text file

View/Open

This item appears in the following Collection(s)

ICASP14
14th International Conference on Application of Statistics and Probability in Civil Engineering

Show simple item record

Browse

My Account

Structural integrity management via hierarchical resource allocation and continuous-control reinforcement learning

Files in this item

This item appears in the following Collection(s)