Integration of Differential Gene-combination Search and Gene Set Enrichment Analysis: A General Approach
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Title
Integration of Differential Gene-combination Search and Gene Set Enrichment Analysis: A General Approach
Alternative title
Published Date
2009-12-30
Publisher
Type
Report
Abstract
Motivation: Gene Set Enrichment Analysis (GSEA) and its variations aim to discover collections of genes that show moderate but coordinated differences in expression. However, such techniques may be ineffective if many individual genes in a phenotype-related gene set have weak discriminative power. A potential solution is to search for combinations of genes that are highly differentiating even when individual genes are not. Although such techniques have been developed, these approaches have not been used with GSEA to any significant degree because of the large number of potential gene combinations and the heterogeneity of measures that assess the differentiation provided by gene groups of different sizes.
Results: To integrate the search for differentiating gene combinations and GSEA, we propose a general framework with two key components: (A) a procedure that reduces the number of scores to be handled by GSEA to the number of genes by summarizing the scores of the gene combinations involving a particular gene in a single score, and (B) a procedure to integrate the heterogeneous scores from combinations of different sizes and from different gene combination measures by mapping the scores to p-values. Experiments on four gene expression data sets demonstrate that the integration of GSEA and gene combination search can enhance the power of traditional GSEA by discovering gene sets that include genes with weak individual differentiation but strong joint discriminative power. Also, gene sets discovered by the integrative framework share several common biological processes and improve the consistency of the results among three lung cancer data sets.
Availability: Source code and datasets: http://vk.cs.umn.edu/ICG/.
Contact: gangfang@cs.umn.edu
Keywords
Description
Related to
Replaces
License
Series/Report Number
Technical Report; 09-031
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Fang, Gang; Steinbach, Michael; Myers, Chad L.; Kumar, Vipin. (2009). Integration of Differential Gene-combination Search and Gene Set Enrichment Analysis: A General Approach. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/215818.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.