Fall 2025 theses and dissertations (non-restricted) will be available in ERA on November 17, 2025.

Deep Interpretable Modelling

Loading...
Thumbnail Image

Institution

http://id.loc.gov/authorities/names/n79058482

Degree Level

Doctoral

Degree

Doctor of Philosophy

Department

Department of Computing Science

Supervisor / Co-Supervisor and Their Department(s)

Citation for Previous Publication

Link to Related Item

Abstract

The recent success of deep neural networks has exposed the problem of model transparency. The need for explainability is particularly critical in sensitive domains. In addition, regulatory frameworks for the “responsible” deployment of AI are emerging, creating legal requirements for transparent, explainable models. There are many approaches to explainability, including the distinction between top-down methods. Such as adapting existing logical models of explainability to deep learning and bottom-up methods (e.g., augmenting the “semantics-free” networks with fragments of semantic components). However, there is the challenge of how a deep network can learn multi-level representations or create explanation support on demand when requested. Here we describe our development and experiments with building interpretable deep neural networks for Natural Language Processing (NLP). We focus on learning interpretable representations to generate reliable explanations that give users a deeper understanding of the model’s behavior. These representations offer feature attribution, contrastive, and hierarchical explanations. We also show the effectiveness of our approach to model distillation and rationale extraction.

Item Type

http://purl.org/coar/resource_type/c_46ec

Alternative

License

Other License Text / Link

This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.

Language

en

Location

Time Period

Source