Gene selection for cancer classification using support vector machines

8.1kCitations
Citations of this article
3.0kReaders
Mendeley users who have this article in their library.

This article is free to access.

Abstract

DNA micro-arrays now permit scientists to screen thousands of genes simultaneously and determine whether those genes are active, hyperactive or silent in normal or cancerous tissue. Because these new micro-array devices generate bewildering amounts of raw data, new analytical methods must be developed to sort out whether cancer tissues have distinctive signatures of gene expression over normal tissues or other types of cancer tissues. In this paper, we address the problem of selection of a small subset of genes from broad patterns of gene expression data, recorded on DNA micro-arrays. Using available training examples from cancer and normal patients, we build a classifier suitable for genetic diagnosis, as well as drug discovery. Previous attempts to address this problem select genes with correlation techniques. We propose a new method of gene selection utilizing Support Vector Machine methods based on Recursive Feature Elimination (RFE). We demonstrate experimentally that the genes selected by our techniques yield better classification performance and are biologically relevant to cancer. In contrast with the baseline method, our method eliminates gene redundancy automatically and yields better and more compact gene subsets. In patients with leukemia our method discovered 2 genes that yield zero leave-one-out error, while 64 genes are necessary for the baseline method to get the best result (one leave-one-out error). In the colon cancer database, using only 4 genes our method is 98% accurate, while the baseline method is only 86% accurate.

References Powered by Scopus

45794Citations
8786Readers

This article is free to access.

Cluster analysis and display of genome-wide expression patterns

13601Citations
4207Readers
Get full text
Get full text

Cited by Powered by Scopus

Regularization and variable selection via the elastic net

13101Citations
6841Readers

This article is free to access.

4912Citations
6545Readers
Get full text

Data Mining: Practical Machine Learning Tools and Techniques, Third Edition

4560Citations
1832Readers
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Guyon, I., Weston, J., Barnhill, S., & Vapnik, V. (2002). Gene selection for cancer classification using support vector machines. Machine Learning, 46(1–3), 389–422. https://doi.org/10.1023/A:1012487302797

Readers over time

‘09‘10‘11‘12‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘24‘25095190285380

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 1343

67%

Researcher 400

20%

Professor / Associate Prof. 178

9%

Lecturer / Post doc 78

4%

Readers' Discipline

Tooltip

Computer Science 760

50%

Engineering 375

25%

Agricultural and Biological Sciences 249

16%

Biochemistry, Genetics and Molecular Bi... 141

9%

Article Metrics

Tooltip
Mentions
Blog Mentions: 3
News Mentions: 3
References: 5

Save time finding and organizing research with Mendeley

Sign up for free
0