Reinforcement learning hyper-heuristics for optimisation

Alanazi, Fawaz

Reinforcement learning hyper-heuristics for optimisation

Tools

Alanazi, Fawaz (2017) Reinforcement learning hyper-heuristics for optimisation. PhD thesis, University of Nottingham.

Preview

PDF (Thesis - as examined) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Available under Licence All Rights Reserved.
Download (1MB) | Preview

Abstract

Hyper-heuristics are search algorithms which operate on a set of heuristics with the goal of solving a wide range of optimisation problems. It has been observed that different heuristics perform differently between different optimisation problems. A hyper-heuristic combines a set of predefined heuristics, and applies a machine learning technique to predict which heuristic is the most suitable to apply at a given point in time while solving a given problem. A variety of machine learning techniques have been proposed in the literature. Most of the existing machine learning techniques are reinforcement learning mechanisms interacting with the search environment with the goal of adapting the selection of heuristics during the search process. The literature on the theoretical foundation of reinforcement learning hyper-heuristics is almost nonexisting. This work provides theoretical analyses of reinforcement learning hyper-heuristics. The goal is to shed light on the learning capabilities and limitations of reinforcement learning hyper-heuristics. This improves our understanding of these hyper-heuristics, and aid the design of better reinforcement learning hyper-heuristics. It is revealed that the commonly used additive reinforcement learning mechanism, under a mild assumption, chooses asymptotically heuristics uniformly at random. This thesis also proposes the problem of identifying the most suitable heuristic with a given error probability. We show a general lower bound on the time that "every" reinforcement learning hyper-heuristic needs to identify the most suitable heuristic with a given error probability. The results reveal a general limitation to learning achieved by this computational approach. Following our theoretical analysis, different reusable and easyto-implement reinforcement learning hyper-heuristics are proposed in this thesis. The proposed hyper-heuristics are evaluated on well-known combinatorial optimisation problems. One of the proposed reinforcement learning hyper-heuristics outperformed a state-of-the-art algorithm on several benchmark problems of the well-known CHeSC 2011.

Item Type:	Thesis (University of Nottingham only) (PhD)
Supervisors:	Lehre, Per Kristian Ozcan, Ender
Keywords:	Hyper-heuristics, Reinforcement Learning, Runtime Analysis
Subjects:	Q Science > QA Mathematics > QA 75 Electronic computers. Computer science
Faculties/Schools:	UK Campuses > Faculty of Science > School of Computer Science
Item ID:	42204
Depositing User:	Alanazi, Fawaz
Date Deposited:	18 Jul 2017 04:40
Last Modified:	28 Feb 2025 13:45
URI:	https://eprints.nottingham.ac.uk/id/eprint/42204

Actions (Archive Staff Only)

Edit View

LoginAdmin