An Efficient and Accurate Numerical Determination of the Cluster Resolution Metric in Two Dimensions

Abstract

Description

Cluster resolution (CR) is a useful metric for guiding automated feature selection of classification models. CR is a measure of class separation in a linear subspace for variable subsets via the determination of maximal, non‐intersecting confidence ellipses. Feature selection by cluster resolution (FS‐CR) is most commonly used to extract panels of useful, discriminating features from sparsely populated chromatographic peak tables, optimizing models from raw signals, or when working with datasets with many more variables than samples. The absence of a numerical method for calculating CR necessitates a great deal of dynamic programming and algorithmic complexity. In this work, we present a numerical determination of the CR metric, which reduces computation time by about 65 times when compared with the dynamic programming approach and simplifies the operating principles of FS‐CR algorithm.

Item Type

http://purl.org/coar/resource_type/c_6501 http://purl.org/coar/version/c_970fb48d4fbd8a85

Alternative

Other License Text / Link

Language

en

Location

Time Period

Source