Competitive Fragmentation Modeling of Mass Spectra for Metabolite Identification

dc.contributor.advisorGreiner, Russell (Computing Science)
dc.contributor.authorAllen, Felicity R
dc.contributor.otherSchuurmans, Dale (Computing Science)
dc.contributor.otherNeumann, Steffen (Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry – IPB Halle)
dc.contributor.otherWishart, David (Computing Science)
dc.contributor.otherGreiner, Russell (Computing Science)
dc.contributor.otherHarynuk, James (Chemistry)
dc.date.accessioned2025-05-29T08:19:08Z
dc.date.available2025-05-29T08:19:08Z
dc.date.issued2016-06
dc.description.abstractOne of the key obstacles to the effective use of mass spectrometry (MS) in high throughput metabolomics is the difficulty in interpreting measured spectra to accurately and efficiently identify metabolites. Traditional methods for automated metabolite identification compare the target MS spectrum to spectra of known molecules in a reference database, ranking candidate molecules based on the closeness of the spectral match. However the limited coverage of available databases has led to interest in computational methods for generating accurate reference MS spectra from chemical structures. This is the target application for this work. My main research contribution is to propose a method for spectrum prediction, which we call Competitive Fragmentation Modeling (CFM). I demonstrate that this method works effectively for both electron ionization (EI)-MS and electrospray tandem MS (ESI-MS/MS). It uses a probabilistic generative model for the fragmentation processes occurring in a mass spectrometer, and a machine learning approach to learn parameters for this model from data. CFM has been used in both a spectrum prediction task (ie, predicting the mass spectrum from a chemical structure), and in a putative metabolite identification task (ranking possible structures for a target spectrum). In the spectrum prediction task, CFM showed improved performance when compared to a full enumeration of all peaks corresponding to all substructures of the molecule. In the metabolite identification task, CFM obtained substantially better rankings for the correct candidate than existing methods. As further validation, this method won the structure identification category of the international Critical Assessment of Small Molecule Identification (CASMI) 2014 competition. The method is also available for general use via a web interface.
dc.identifier.doihttps://doi.org/10.7939/R3SF2MH4B
dc.language.isoen
dc.rightsThis thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.
dc.subjectMetabolite identification
dc.subjectMachine learning
dc.subjectMass-spectrometry
dc.titleCompetitive Fragmentation Modeling of Mass Spectra for Metabolite Identification
dc.typehttp://purl.org/coar/resource_type/c_46ec
thesis.degree.grantorhttp://id.loc.gov/authorities/names/n79058482
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy
ual.date.graduationSpring 2016
ual.departmentDepartment of Computing Science
ual.jupiterAccesshttp://terms.library.ualberta.ca/public

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Allen_Felicity_R_201601_PhD.pdf
Size:
2.18 MB
Format:
Adobe Portable Document Format