Microarray Gene Expression Ranking With Z Score For Cancer Classification
Microarray Gene Expression Ranking With Z Score For Cancer Classification
Microarray Gene Expression Ranking With Z Score For Cancer Classification
Volume: 2 Issue: 8
ISSN: 2321-8169
2480 2484
_______________________________________________________________________________________________
Dr P Ponmuthuramalingam
Research Scholar
Government Arts College ,
Coimbatore, Tamil Nadu, India
Abstract--Over the past few decade there has been explosion in the amount of genomic data available to biomedical engineer due to the
advantage of biotechnology. For example using microarray it is possible to find out a persons gene expressions profile more than 30000
genomes. Among this one of the most important gene selection problem is gene ranking. Here we will describe Z-score ranking for microarray
gene expression selection. In that technique it choose the gene and then applied the Z-Score Ranking technique and then divides the genes into
subsets with Successive Feature selection and then finally LDA Applied for the result. With this Z-score ranking technique we will get the
accurate results and less effort. The Lymphoma and Leukemia dataset genes are utilized. The proposed technique shows capable classification
accuracy for the whole test data sets.
__________________________________________________*****_________________________________________________
1.
INTRODUCTION
_______________________________________________________________________________________
ISSN: 2321-8169
2480 2484
_______________________________________________________________________________________________
classes. An additional probable model for TS might be a tstatistic between the centroid of an exact class and the
centroid of all the additional classes. To find the minimum
gene subset when after selecting some top genes in the
importance ranking list, this attempt to classify the data set
with only one gene. This work inputs each selected gene
into LDA classifier.
3.2 Successive Feature Selection
Successive Feature Selection SFS method (SFS) a
set of 10 features is procedure single at a time that the
rate of x is taken due to memory constraint and it is
experimentally establish that the fitting values of x is equal
to or lower than 10. The output is the grade of features. In
the successive stage that the feature is reduced once at a
time and a subset of features is achieved. That the
classification accuracy using classifiers calculate, and the
top subset of features is processed to the next level. There
might be further than one top subset of features in a given
stage. A feature is dropped in level 1 that offers four
dissimilar subsets of features. The top set in level 1 {x, x2,
x4} is which is chosen for level 2. In a related way a feature
is dropped from the best set of features of level 1 into level
2, which provides three different subsets of features. The
best sets in level 2 are {x2, x4} and {x1, x2} supposing that
their classification accuracies are the similar and are
elevated than those of other subsets and the best set in level
3 is {x2} .
This procedure is finished when all the features are ranked.
Two ranked sets are achieved in SFS: that is R1= {x2, x4,
x1, x3} and R2= {x2, x1, x4, x3}
2
3
4
1
x1
x3
3
4
2
4
4
2
1 = 2 , 4 , 1 , 3
4
1
4
1
2 = 2 , 1 , 4 , 3
3
Figure 1 Successive Feature Selection
2481
IJRITCC | August 2014, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
ISSN: 2321-8169
2480 2484
_______________________________________________________________________________________________
8.
3.
2.
3.
4.
5.
6.
7.
EXPERIMENTAL RESULTS
Class
Number
of Gene
Training
samples
Test
samples
Lymphoma
4026
43
19
Leukemia
7129
40
19
2482
IJRITCC | August 2014, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
ISSN: 2321-8169
2480 2484
_______________________________________________________________________________________________
Table 2 Accuracy and Execution Time for proposed method
Dataset
Methods
Number of
Selected Genes
Accuracy (%)
Execution Time
(Seconds)
Existing Feature
Selection with LDA
250
89
10
Proposed Feature
Selection with LDA
20
97
Existing Feature
Selection with LDA
150
83
14
Proposed Feature
Selection with LDA
25
95
11
Lymphoma
Leukemia
Table 2 shows the accuracy and execution time for feature selection in gene expression data. Figure 3 shows the
comparison of accuracy for the proposed method of feature selection with LDA classification and the Existing method of feature
selection with LDA, from the table obviously noticed that the planned technique provides improved results by their correctness in
percentage.
100
90
80
Existing Feature
Selection with
LDA
Accuracy (%)
70
60
Proposed Feature
Selection with
LDA
50
40
30
20
10
0
Lymphoma
Leukemia
Datasets
2483
IJRITCC | August 2014, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
ISSN: 2321-8169
2480 2484
_______________________________________________________________________________________________
Execution Time(Seconds)
60
50
Existing Feature
Selection with
LDA
Proposed Feature
Selection with
LDA
40
30
20
10
0
Lymphoma
Leukemia
Datasets
Fig4 shows the execution time for both Existing and Proposed feature selection methods
4.
CONCLUSION
[6]
[7]
[8]
[9]
[10]
[11]
REFERENCES
[1]
[2]
[3]
[4]
[5]
[12]
[13]
[14]
[15]
2484
IJRITCC | August 2014, Available @ http://www.ijritcc.org
_______________________________________________________________________________________