Network-based support vector machines for classification of microarray gene expression data.
2009-09
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Title
Network-based support vector machines for classification of microarray gene expression data.
Alternative title
Authors
Published Date
2009-09
Publisher
Type
Thesis or Dissertation
Abstract
The importance of network-based approach to identifying biological markers for diag-
nostic classification and prognostic assessment in the context of microarray has been
increasingly recognized. Standard methods treat all genes independently and identically
a priori and ignore the biological observation that genes function together in biological
processes. For binary classification, we are motivated to improve predictive accuracy
and gene selection by developing novel network-based classification tools that explicitly
incorporate interrelationships of genes as described by gene networks.
We propose three network-based support vector machines (SVM) by suitably forming
the penalty term. The neighboring-gene (NG) penalty groups pairwise gene neighbors
and sums up the L1-norm of each group over the entire network, leading to NG-SVM.
NG-SVM tends to select pairs of neighboring genes. The disease-gene-centric (DGC)
penalty is constructed on groups defined on an upper-lower hierarchy imposed on the
undirected network. DGC-SVM aims to detect collectives of genes clustering together
and around some key disease genes. The truncated L1-norm (TL1) penalty intends
to correct bias induced by penalization through a threshold parameter C > 0 built
into the L1-norm as used in NG-SVM and DGC-SVM. Simulation studies and real
data applications demonstrate that the proposed methods are able to capture more
disease genes and less noise genes than the existing popular methods, standard SVM
and L1-SVM. We conclude that the proposed methods have the potential to be effective
classification tools for microarrays and other high-dimensional data.
Description
University of Minnesota Ph.D. dissertation. September 2009. Major: Biostatistics. Advisor: Wei Pan. 1 computer file (PDF); xii, 98 pages.
Related to
Replaces
License
Collections
Series/Report Number
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Zhu, Yanni. (2009). Network-based support vector machines for classification of microarray gene expression data.. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/57042.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.