TY - GEN
T1 - Gene prioritization via weighted Kendall rank aggregation
AU - Kim, Minji
AU - Raisali, Fardad
AU - Farnoud, Farzad
AU - Milenkovic, Olgica
PY - 2013
Y1 - 2013
N2 - Gene prioritization is a class of methods for discovering genes implicated in the onset and progression of a disease. As candidate genes are ranked based on similarity to known disease genes according to different set of criteria, the overall aggregation of these ranked datasets is a vital step of the prioritization procedure. Aggregation of different lists of ordered genes is accomplished either via classical order statistics analysis or via combinatorial ordinal data fusion. We propose a novel approach to combinatorial gene prioritization via Linear Programming (LP) optimization and use the recently introduced weighted Kendall τ distance to assess similarities between rankings. The weighted Kendall τ distance allows for constructing aggregates that have higher accuracy at the top of the ranking, usually tested experimentally, and it can also accommodate ties in rankings and handle negative outliers. In addition, the Kendall distance does not use quantitative data which in many instances may be unreliable. We illustrate the performance of the prioritization method on a set of test genes pertaining to the Bardet-Biedl syndrome, schizophrenia, and HIV and show that the combinatorial method matches or outperforms state-of-the art algorithms such as ToppGene.
AB - Gene prioritization is a class of methods for discovering genes implicated in the onset and progression of a disease. As candidate genes are ranked based on similarity to known disease genes according to different set of criteria, the overall aggregation of these ranked datasets is a vital step of the prioritization procedure. Aggregation of different lists of ordered genes is accomplished either via classical order statistics analysis or via combinatorial ordinal data fusion. We propose a novel approach to combinatorial gene prioritization via Linear Programming (LP) optimization and use the recently introduced weighted Kendall τ distance to assess similarities between rankings. The weighted Kendall τ distance allows for constructing aggregates that have higher accuracy at the top of the ranking, usually tested experimentally, and it can also accommodate ties in rankings and handle negative outliers. In addition, the Kendall distance does not use quantitative data which in many instances may be unreliable. We illustrate the performance of the prioritization method on a set of test genes pertaining to the Bardet-Biedl syndrome, schizophrenia, and HIV and show that the combinatorial method matches or outperforms state-of-the art algorithms such as ToppGene.
UR - https://www.scopus.com/pages/publications/84894147569
UR - https://www.scopus.com/pages/publications/84894147569#tab=citedBy
U2 - 10.1109/CAMSAP.2013.6714038
DO - 10.1109/CAMSAP.2013.6714038
M3 - Conference contribution
AN - SCOPUS:84894147569
SN - 9781467331463
T3 - 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, CAMSAP 2013
SP - 184
EP - 187
BT - 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, CAMSAP 2013
T2 - 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, CAMSAP 2013
Y2 - 15 December 2013 through 18 December 2013
ER -