Data Quality Assurance in Autonomous Driving Systems

Loading...
Thumbnail Image

Institution

University of Alberta

Degree Level

Master's

Degree

Master of Science

Department

Department of Electrical and Computer Engineering

Specialization

Software Engineering and Intelligent Systems

Supervisor / Co-Supervisor and Their Department(s)

Citation for Previous Publication

Link to Related Item

Abstract

In recent years, autonomous driving systems (ADSs) using deep learning-based modules have significantly attracted the attention of researchers from different communities, such as computer vision. These intelligent systems require a precise and accurate training process before their deployment to real-life situations. The performance and reliability of ADSs are dependent on two important factors, namely, training dataset and model components, each of which must be carefully taken into consideration. Since in most of the realistic cases, the models of ADSs are released in a black-box form, and access to their components (e.g., loss functions and hyper-parameters) is not granted, therefore, ensuring the quality of the samples in the ADSs training datasets is of paramount importance. In view of these explanations, in this work, we focus on developing an efficient scheme for cleaning the training datasets of ADSs that employ deep image object detectors, by identifying the samples in the dataset with erroneous bounding boxes. In this regard, we leverage the visual signals associated with the bounding boxes, in addition to their spatial coordinates, for predicting the erroneous status of the bounding boxes in an accurate manner. Moreover, we incorporate confident learning in the proposed scheme in order to prune the predictions of the erroneous statuses of the bounding boxes, and, further contribute to developing secure and reliable ADSs. The results of the extensive experiments demonstrate the effectiveness of various ideas employed in the design of the proposed erroneous bounding box detection scheme for the ADSs datasets. Further, it is shown that the proposed scheme could significantly outperform the other state-of-the-art data selection methods in cleaning the training datasets of ADSs.

Item Type

http://purl.org/coar/resource_type/c_46ec

Alternative

License

Other License Text / Link

This thesis is made available by the University of Alberta Libraries with permission of the copyright owner solely for non-commercial purposes. This thesis, or any portion thereof, may not otherwise be copied or reproduced without the written consent of the copyright owner, except to the extent permitted by Canadian copyright law.

Language

en

Location

Time Period

Source