Network-based support vector machines for classification of microarray gene expression data.

Loading...
Thumbnail Image

Persistent link to this item

Statistics
View Statistics

Journal Title

Journal ISSN

Volume Title

Title

Network-based support vector machines for classification of microarray gene expression data.

Published Date

2009-09

Publisher

Type

Thesis or Dissertation

Abstract

The importance of network-based approach to identifying biological markers for diag- nostic classification and prognostic assessment in the context of microarray has been increasingly recognized. Standard methods treat all genes independently and identically a priori and ignore the biological observation that genes function together in biological processes. For binary classification, we are motivated to improve predictive accuracy and gene selection by developing novel network-based classification tools that explicitly incorporate interrelationships of genes as described by gene networks. We propose three network-based support vector machines (SVM) by suitably forming the penalty term. The neighboring-gene (NG) penalty groups pairwise gene neighbors and sums up the L1-norm of each group over the entire network, leading to NG-SVM. NG-SVM tends to select pairs of neighboring genes. The disease-gene-centric (DGC) penalty is constructed on groups defined on an upper-lower hierarchy imposed on the undirected network. DGC-SVM aims to detect collectives of genes clustering together and around some key disease genes. The truncated L1-norm (TL1) penalty intends to correct bias induced by penalization through a threshold parameter C > 0 built into the L1-norm as used in NG-SVM and DGC-SVM. Simulation studies and real data applications demonstrate that the proposed methods are able to capture more disease genes and less noise genes than the existing popular methods, standard SVM and L1-SVM. We conclude that the proposed methods have the potential to be effective classification tools for microarrays and other high-dimensional data.

Description

University of Minnesota Ph.D. dissertation. September 2009. Major: Biostatistics. Advisor: Wei Pan. 1 computer file (PDF); xii, 98 pages.

Related to

Replaces

License

Collections

Series/Report Number

Funding information

Isbn identifier

Doi identifier

Previously Published Citation

Suggested citation

Zhu, Yanni. (2009). Network-based support vector machines for classification of microarray gene expression data.. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/57042.

Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.