The null hypothesis of GSEA, and a novel statistical model for competitive gene set analysis

Research output: Contribution to journalJournal articleResearchpeer-review

Abstract

MOTIVATION: Competitive gene set analysis intends to assess whether a specific set of genes is more associated with a trait than the remaining genes. However, the statistical models assumed to date to underly these methods do not enable a clear cut formulation of the competitive null hypothesis. This is a major handicap to the interpretation of results obtained from a gene set analysis.

RESULTS: This work presents a hierarchical statistical model based on the notion of dependence measures, which overcomes this problem. The two levels of the model naturally reflect the modular structure of many gene set analysis methods. We apply the model to show that the popular GSEA method, which recently has been claimed to test the self-contained null hypothesis, actually tests the competitive null if the weight parameter is zero. However, for this result to hold strictly, the choice of the dependence measures underlying GSEA and the estimators used for it is crucial.
Original languageEnglish
JournalBioinformatics
Volume33
Issue number9
Pages (from-to)1271-1277
ISSN1367-4811
DOIs
Publication statusPublished - 2017

Keywords

  • Gene expression
  • Genome analysis
  • Mathematical modelling
  • Pathways
  • Statistics

Fingerprint

Dive into the research topics of 'The null hypothesis of GSEA, and a novel statistical model for competitive gene set analysis'. Together they form a unique fingerprint.

Cite this