Krein Space Methods for Structured Data

Redshaw, Joseph (2024) Krein Space Methods for Structured Data. PhD thesis, University of Nottingham.

[img]
Preview
PDF (corrected version) (Thesis - as examined) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Available under Licence Creative Commons Attribution.
Download (1MB) | Preview

Abstract

Learning from structured data, including sequences and graphs, is a significant and central challenge in machine learning that has far-reaching applications across numerous disciplines, including chemistry and biochemistry. Kernel methods are undoubtedly an essential tool in this challenge. Their use of a similarity function, known as a kernel function, facilitates learning complex relationships from data of arbitrary structure. However, many expressive notions of similarity are not valid kernel functions, meaning they are not applicable to standard kernel methods. Krein space methods are a potential solution to this problem, as they generalise kernel methods to a much larger class of similarity functions. In this thesis, we explore the application of Krein space methods to structured data. Focusing on problems in chemistry and biochemistry, in which structured data and domain-specific similarity measures are commonplace, we investigate to what extent Krein space methods can be utilised to develop supervised learning models for structured data. In particular, we develop models to identify translation initiation sites in nucleic acid sequences, predict the yield of a carbon-nitrogen cross coupling reaction and identify peptides exhibiting antimicrobial properties. We find that the resulting performance of the models is highly dependent on the choice of similarity function and that Krein space methods outperform standard kernel methods in some, but not all, of the domains considered.

Item Type: Thesis (University of Nottingham only) (PhD)
Supervisors: Hirst, Jonathan
Keywords: structured data, kernel methods, Kreın space methods, antimicrobial peptides
Subjects: Q Science > QA Mathematics > QA299 Analysis
Q Science > QD Chemistry > QD241 Organic chemistry > QD415 Biochemistry
Faculties/Schools: UK Campuses > Faculty of Science > School of Chemistry
Item ID: 78390
Depositing User: Redshaw, Joseph
Date Deposited: 24 Jul 2024 04:44
Last Modified: 24 Jul 2024 04:44
URI: https://eprints.nottingham.ac.uk/id/eprint/78390

Actions (Archive Staff Only)

Edit View Edit View