Algebraic stability indicators for ranked lists in molecular profiling

Jurman, Giuseppe; Merler, Stefano; Barla, A.; Paoli, S.; Galea, A.; Furlanello, Cesare

Motivation: We propose a method for studying the stability of biomarker lists obtained from functional genomics studies. It is common to adopt resampling methods to tune and evaluate markerbased diagnostic and prognostic systems in order to prevent selection bias. Such caution promotes honest estimation of class prediction, but leads to alternative sets of solutions. In microarray studies, the difference in lists may be bewildering, also due to the presence of modules of functionally related genes. Methods for assessing stability understand the dependency of the markers on the data or on the predictor's type and help selecting solutions. Results: A computational framework for comparing sets of ranked biomarker lists is presented. Notions and algorithms are based on concepts from permutation group theory. We introduce several algebraic indicators and metric methods for symmetric groups, including the Canberra distance, a weighted version of Spearman's footrule. We also consider distances between partial lists and an aggregation of sets of lists into an optimal list based on voting theory (Borda count). The stability indicators are applied in practical situations to several synthetic, cancer microarray and proteomics datasets. The addressed issues are predictive classification, presence of modules, comparison of alternative biomarker lists, outlier removal, control of selection bias by randomization techniques, enrichment analysis.

Algebraic stability indicators for ranked lists in molecular profiling

Jurman, Giuseppe;Merler, Stefano;A. Barla;S. Paoli;A. Galea;Furlanello, Cesare

2008-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

Algebraic stability indicators for ranked lists in molecular profiling

Jurman, Giuseppe;Merler, Stefano;A. Barla;S. Paoli;A. Galea;Furlanello, Cesare

2008-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)