0% found this document useful (0 votes)

154 views6 pages

An Overview On Gene Expression Analysis: Dr. R. Radha, P. Rajendiran

The document provides an overview of gene expression analysis. It discusses how DNA microarray technology allows measuring the expression of thousands of genes in parallel under multiple conditions. Key data mining techniques that help analyze gene expression data include classification, clustering, and prediction. Microarray experiments assess gene expression levels under different conditions, such as across tissue samples. Gene expression profiling is used in various applications including cancer research and molecular disease classification.

Uploaded by

International Organization of Scientific Research (IOSR)

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

154 views6 pages

An Overview On Gene Expression Analysis: Dr. R. Radha, P. Rajendiran

Uploaded by

International Organization of Scientific Research (IOSR)

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661 Volume 4, Issue 1 (Sep-Oct. 2012), PP 31-36 www.iosrjournals.

org

An Overview on Gene Expression Analysis

Dr. R. Radha1, P. Rajendiran2
1

(Department of Computer Science, S. D .N. B. Vaishnave college of Women, chromepet, Chennai, Tamil nadu India.) 2 (Department of Computer Science, Vidyaa Vikas Educational Institutions, Tiruchengode, Namakkal, Tamilnadu - India)

Abstract: Recent advances in DNA microarray technology, also known as gene chips, allow measuring the
expression of thousands of genes in parallel under multiple experimental conditions [1]. This technology is having a significant impact on genomic studies. Disease diagnosis, drug discovery and toxicological research benefit from the microarray technology. Arrays are now widely used in basic biomedical research for mRNA expression profiling and are increasing being used to explore patterns of gene expression in clinical research. Keywords: ANN, Classification, Clustering, Gene expression, Micro Array

Introduction

Various approaches have recently been used in outcome prediction using gene expression data. It has been shown that specific patterns of gene expression occur during different biological states such as cell development and during normal physiological responses in tissues and cells. There are many data mining techniques which help to analyze the gene expression data [2]. The generation of quantitative expression patterns of many genes in parallel can be achieved by using techniques based on complementary DNA micro arrays [3], [4]. II. Gene Expression Data A microarray experiment typically assesses a large number of DNA sequences (genes, CDNA clones, or expressed sequence tag [ESTs] under multiple conditions. These conditions may be a time series during a biological process (e.g: the yeast cell cycle) or a collection of different tissue samples [5]. The original gene expression matrix obtained from a scanning process contains noise, missing values, and systematic variations arising from the experimental procedure. Within a gene expression matrix, there are usually several particular macroscopic phenotypes of samples related to some diseases or drug effects. The remaining genes in the gene expression matrix are irrelevant to the division of samples of interest and thus are regarded as noise in the data set [5]. A recent effort to understand how genes contribute to disease approaches the discovery of sub-classes of diffuse large B-cell lymphoma (DLBCL) by using expression analysis [6]. It has been shown that the discovery of sub-classes in DLBCL has not been successful by relying exclusively on morphological features [3]. Alizadeh et al [6] demonstrate that the molecular profile of a tumor obtained from CDNA microarrays can indeed be interpreted as a robust and clear picture of the tumor biology. In [7], at Patrick Browns lab at Stanford has used microarrays to measure gene expression levels for the entire yeast genome (approximately 6400 distinct CDNA sequences) during the diauxic shift (transition from sugar metabolism to ethanol metabolism) , sporulation and the entire cell cycle. These data sets are publicly available. The Brown lab also has an online guide to build your own arrayed and scanner. These micro arrays have been commercialized by Incyte pharmaceuticals microarray division(formerly Synteni). Incyte Gene expression Microarrays(GEMs) are available with templates from human, rat, mouse, plant and microbial genomes. Different approaches have recently been used on outcome prediction using gene expression profiles. In the Cox proportional hazard regression method [8,9] genes most related to survival are first identified by a univariate Cox analysis, and a risk score is then defined as a linear weighted combination of the expression values of the identified genes[10,11]. Advances in techniques for high throughput data gathering, such as microarray and DNA sequencing machine have opened up new research avenues in genomics. Large-scale biological research such as genome projects are now producing enormous quantities of genomic data using these rapidly growing technologies. Transforming the massive data to useful biological knowledge is the present challenge. Different analysis tools are being developed in order to detect and understand the phenomena of gene regulation and physiological functions and assessing the quality of a genomic sequence [12].

www.iosrjournals.org

31 | Page

An Overview on Gene Expression Analysis

With the wealth of gene expression data from microarray (such as high density oligonucleotide arrays and CDNA arrays) prediction, classification and clustering techniques are used for analysis and interpretation of the data. Some important recent applications are in molecular classification of acute leukemia(Golub et al.,1999,[14]), cluster analysis of tumor and normal colon tissues (Alon et al.,1999,[30]). Clustering and classification of human cancer cell lines (Ross et al.,2000,[66]). Diffuse Large B-cell lymphoma(DLBCL; Alizadesh et al., 2000, [6]), human mammary epithelial cells and breast cancer (Perou et al., 1999 [67], 2000 [68]) and skin cancer melanoma(Bittner et al.,2000[69]). These techniques have also helped to identify previously undetected sub types of cancer (Gloub et al.,1999[14];Alizadeh et al., 2000[6];Bittner et al.,2000[69];Perou et al.,2000[68]). The problem of prediction may come in various forms of applications as well; the prediction of patient survival duration with germinal center B-like DLBCL compared to those with activated B-Like DLBCL using Kaplan Meier survival curves (Ross et al., 2000 [66]). Gene expression data from DNA microarrays are characterized by many measured variables (genes) on only a few observations (experiments) although both the number of experiments and genes per experiments are growing rapidly [28]. Recent technical and analytical advances make it practical to quantitative the expression of thousands of genes in parallel using complementary DNA microarrays [3]. This mode of analysis has been used to observe gene expression variation in a variety of human tumors [29-30]. To apply this method to questions in normal and malignant lymphocyte biology, we designed a specialized microarray the lympho chip- by selecting genes that are preferentially expressed in lympho id cells and genes with known or suspected roles in processes important in immunology or cancer[31]. Due to recent advances in DNA microarray technology, is now feasible to obtain gene expression profiles of tissue samples at relatively low costs. Many scientists around the world use the advantages of this gene profiling to characterize complex biological circumstances and diseases microarray techniques that are used in genome wide gene expression and genome mutation analysis help scientists and physicians in understanding of the patho physiological mechanisms, in diagnoses and prognoses, and choosing treatment plans B. Transcriptional profiling is a tool that provides unique data about disease mechanisms, regulatory pathways, and gene function[3]. This technology not only allows comparison of gene profiles in normal and pathological tissues or cells, but also helps us establish interrelationships among genes, e.g.. Clustering of genes, coincident temporal pattern of expression, identify upstream and downstream targets of genes, understand mechanisms of disease at a molecular level, and define and validate novel drug targets[32]. A Comprehensive review of biological and technological aspects of microarray technology can be found in [33]. Ramaswamy et.al.[34] and Alizadeh et al., provide[35] a detailed discussion of the clinical implications of microarray in oncology. For excellent reviews on many different aspects of microarray technology, the reader is referred to the two special supplements [36-37]. References [38-39] provide an overview of gene expression data analysis. Topics covered include experimental design tissues, normalization, quality control, exploratory analysis (data visualization), and the problem of multiple testing for determining the differentially expressed genes. Aittoakallio et al [40] and Quackembush [41] underlined that the methods used to analyze the gene expression data can have a profound influence on the interpretation of the results and therefore a basic understanding of bio informatics tools is required for optimal experimental design and meaningful data analysis. Availability of gene expression profiles of tissue samples from different diagnostic classes led to the application of many well- established pattern recognition / classification algorithms to these profiles, in an attempt to provide more accurate and automatic class prediction [35,39,6, 42]. Brazma et al.,[43] and Ball et al.,[44] discussed the importance of establishing a standard for recording and reporting microarray-based gene expression data and proposed a minimum information about a Microarray Experiment (MIAME) that describes the minimum information required to ensure that micro array data can be easily interpreted and that results desired from its analysis can be independently verified. Kuo et al.,[45] compared two high- throughput CDNA microarray technologies, stand ford type (i.e.. spotted) CDNA microarrays and Affymetrix oligonucleotide microarrays and showed that corresponding mRNA measurements from the two platforms showed poor correlation. Further their results suggest gene-specific, or more precisely, probe-specific factors influencing measurements differently in the two platforms, implying a poor prognosis for a broad utilization of gene expression measurements across platforms. By measuring transcription levels of genes in an organism under various conditions, at different developmental stages and in different tissues, we can build up gene expression profiles which characterize the dynamic functioning of each gene in the genome. We can imagine the expression data represented in a matrix with rows representing genes, columns representing samples and each cell containing a number characterizing the expression level of the particular gene in the particular sample we will call such a table a gene expression matrix. Building up a database of such matrices will help us to understand gene regulation, metabolic and signaling pathways, the genetic mechanisms of disease, and the response to drug treatments. For instance, if over expression of certain genes are correlated with a certain cancer, we can explore the conditions that affect www.iosrjournals.org 32 | Page

An Overview on Gene Expression Analysis

the expression of these genes and the genes that have similar expression profiles, we can also investigate which compound (particular drugs) lower the expression level of these genes[46].

III.

Data mining classification technique for gene expression data

In data mining classification is one of the most important tasks. It maps the data into predefined targets. It is a supervised learning as targets are predefined. The aim of the classification is to build a classifier based on some cases with some attributes to describe the objects or one attribute to describe the group of the objects. Then the classifier is used to predict the group of new cases from the domain based on the value of other attributes. The systematic classification of types of tumors is crucial to achieve advances in cancer treatment and research. It has been suggested that the specification of therapies according to tumor types differentiated by pathogenetic patterns may maximize the efficiency of the treatment and minimize toxicity on the patients [14, 6]. Several limitations about the conventional classification techniques based on morphological features of the tumor have been reported in the literature [15]. Moreover, by analyzing complex patterns defined by molecular markers, it has been demonstrated that there are subtypes of acute leukemia, prostate cancer and non-HodgkinsLymphomas[14]. There are two useful tasks in cancer classification, prediction of classes and discovery of classes. The prediction task consists of the assignment of particular tumor samples to known types of cancer. The discovery task refers to the unsupervised identification of relevant groups of samples and the characterization of subtypes of cancer. Their research aims to implement a discovery task based on a global expression analysis approach. Most approaches to the computational analysis of gene expression data are functionally significant classification of genes in unsupervised fashion and the discrimination of high risk patients from low risk ones. On the other hand, supervised learning techniques use training set to optimize the discrimination model. Artificial Neural Network (ANN) is one of the supervised methods and a powerful tool for accurately detecting causal relationships [13]. Tamayo et.at, have illustrated the value of Kohonens self-organizing feature maps (SOFM) [16] to interpret gene expression patterns during yeast growth cycle and hematopoietic differentiation [17]. They identify predominant gene expression patterns in those biological processes that suggested, for instance, novel hypotheses about hematopoietic differentiation useful for the treatment of acute promyelocytic leukaemia. Similarly based on a SOFM, Golub et al. [14] approaches the problem of molecular classification of cancer. Classification of biomedical data faces a special challenge because of the characteristics of the data: too few data examples with too many features. How to improve the classification performance or the generalization ability of a classifier in the biomedical domain becomes one of the active research areas. One approach is to build a fusion model to combine multiple classifiers together and result in a combined classifier which can achieve a better performance than any of its composing individual classifiers [12]. [18] proposed a sum classifier fusion model to combine multiple SVMs by applying the knowledge of fuzzy logic and genetic algorithms. The most straight fordward classifier design approach is based on the concept of similarity. In this approach, the distance between the test patterns whose class is to be decided and the known representatives or prototypes of classes are measured. Given a training set and a similarity measure or metric, to decide for the class membership of a test sample, the k-nearest neighbors (k-NN) find the class membership of the k closet samples in the training set and take a majority vote. The k-NN classifier that assigns the test samples to the class of nearest observations in the training set is often used as a benchmark for other classifiers, since it always offers reasonable classification performance [47]. In the nearest mean classifier, the prototypes are the class means / centres or centroids. Tibshirani et al., [48] suggested an enhancement for the nearest centroid classifier, called Nearest Shrunken Centroids(NSC) (The NSC is also referred to as PAM. Prediction Analysis of Microarrays, due to the name of the associated paper and software). In NSC, weak components of the class- centroids are shrunk or deleted via soft-thresholding. The classification accuracy (expressed in terms of training test, and cross validation error rates) and the number of present (or undeleted) genes are plotted against a parameter called delta that adjusts the amount of shrinkage and an optimal value for delta is selected by examining the error rates shrinkage eliminates the information that does not contribute towards class prediction, i.e noise,. The contribution or strength of each class centroid to the classification is measured by a t- statistics, where the numerator is the difference between individual class means and the overall mean and the denominator is the pooled estimate of standard deviation inflated by a fudge factor. Another popular classifier design approach is based on Artificial Neural Networks. NN consists of many interconnected processing elements, called neurons, resembling human brains structure through different structures (varying number of layers and number of neurons per layer) linear or non linear transfer functions that the individual neurons use, and training paradigms during which the weights of the connections are adjusted or tuned, the NN can model / reveal complex relationship among inputs and outputs exemplified or embedded in the training data [32]. www.iosrjournals.org 33 | Page

An Overview on Gene Expression Analysis

Other popular classifier design approaches include Fishers Linear Discriminant Analysis (FLDA). The FLDA is both a class-predictor design and a feature extraction /selection approach, or expressed differently, FLDA is a classifier design approach with built in feature extraction / selection capability. A linear discriminant function is nothing but a special linear combination of the values of all the features that are used in classifier design. In order to introduce other major classifier design approaches such as DLDA and DQDA that are frequently used in gene expression profile classification, we will briefly review the Bayesian decision theory. In this model based setting, the class conditional densities are assumed to have multivariate normal densities typically. In Classification and Regression Trees approach, Breimon et al., [49], used node impurity criteria such as entropy (information content) and Ginis index of diversity [51]. Some important features are selected and binary splits are formed on those features repeatedly. Each terminal features subset is associated with a class label. Dudoit et al.,[50] identified three main aspects for tree construction, selection of splits, decision to declare a node terminal or to continue splitting, and assignment of each terminal node to a class. Depending on how these topics are treated, many variations of tree are possible. Since the decisions / splits at nodes are binary, decision boundaries are parallel to the features axes, as such they are intrinsically suboptimal [ 47]. A common way to represent gene expression measurements does not only allow to directly combine microarray data sets, but also to readily apply the generated classifier on a new data set which is represented in the same manner. To this end [52, 53] proposed the method TSP (Top Scoring Pair) and [54] the generalized version kTSP(k-Top Scoring Pairs), classifiers which directly refer to the relative ranks, i.e the ordering of the actual gene expression value with in a profile kTSP was shown to perform as good as state of- the art algorithms while using a relatively small number of genes for classification. Machine learning techniques such as neural networks are adequate for gene expression patterns and cancer classification analysis for their well-known pattern recognition and data organization capabilities [55],[56]. Advanced neural learning algorithms have not only improved the accuracy, reliability and efficiency of many medical pattern recognition systems, but they also show several advantages for the implementation of decision support systems in physiological genomics [57] [58].

IV.

Data mining clustering technique for gene expression data

Clustering problems arise in many different applications such as data mining and knowledge discovery, data compression, pattern recognition and pattern classification in order to grouping similar genes in one cluster so that genes within the same cluster are similar to each other and different from genes in other cluster [19]. Clustering techniques have proven to be helpful to understand gene function, gene regulation, cellular processes, and sub types of cells. Genes with similar expression patterns (co expressed genes) can be clustered together with similar cellular function [5]. The purpose of clustering gene expression data is to reveal the nature structure inherent in the data. A good clustering algorithm should depend as little as possible on prior knowledge, for example requiring the predetermined number of cluster as an input parameter. Clustering algorithms for gene expression data should be capable of extracting useful information from noisy data. Gene expression data are often high connected and may have intersecting and embedded patterns [20]. Clustering algorithm which also provides some graphical representation of the cluster structure is much favored by biologists. There are numerous clustering techniques presently available to cluster particularly the gene expression data such as hierarchical clustering technique which is a method used commonly by many people in early days. A common problem associated with this method is visualization of clustering results in terms of dendrogram which is difficult when a data set is large [21]. In the popular k-means clustering method, the user was always uncertain to define the precise number of clusters. In hard clustering, data is divided into distinct clusters, where each data element belongs to exactly one cluster. In some situations, the object may belong to more than one cluster, and associated with each element is a set membership level. Clustering may be either crisp (or) fuzzy [22]. Fuzzy clustering of microarray data has an advantage over crisp partitioning because of great amount of imprecision and uncertainty related with gene expression data [23]. Fuzzy c- means [24] and genetic algorithms (GA) [25],[26] have been used effectively in clustering gene expression data. The fuzzy c-means algorithm requires the number of clusters as an input parameter. The GA based algorithms have been found to detect biologically relevant clusters but are dependent on proper tuning of the input parameters. [27] have presented a framework for the unsupervised analysis gene expression data. They developed an interrelated two-way clustering method which they applied on the gene expression matrices transformed from the new microarray data. This approach detects significant patterns within samples while dynamically selecting significant genes which manifest the conditions of actual empirical interest. Through iterative clustering the number of genes are reduced which improves the accuracy of sample class discovery. The method was proved effective by conducting experiment with two multiple sclerosis data sets and a leukemia data set. These www.iosrjournals.org 34 | Page

An Overview on Gene Expression Analysis

experiments indicate that this appears to be a promising approach for unsupervised sample clustering on gene array data sets. The goal of clustering is to group together objects (gens or samples) with similar properties. This can also be viewed as the reduction of the dimensionality of the system. Clustering is not a new technique, many algorithms have been developed for it and many of these algorithms have been applied to analyze expression data. The hierarchical [59] and k-mean clustering algorithms [60] and [61] as well as self-organizing maps [62] have all been used for clustering expression profiles. Even a simple clustering algorithms based on binning (i.e. discrete `zing the expression profile space and clustering together the profiles that map into the same bin) has been shown to be useful for clustering genes and subsequent discovering of transcription factor binding sites[63]. More recently new algorithms have been developed specifically for gene expression profile clustering for instance based on finding approximate cliques in graphs [64]. Gene expression profile clustering does not necessarily require the full genome. For instance Iyer et.al.,[65] studied 8600 genes in human fibroblasts and obtained 10 distinct gene clusters each associated with genes with particular functional roles, such as signal transduction, coagulation, homeostasis, inflammation etc.

Conclusion

Gene expression profiling has great potential for accurate cancer diagnosis. In this paper, we have discussed different types of advances in techniques for high throughput data gathering such as microarrays and DNA sequencing machine that have opened up new research in genomics . Large-scale biological research such as genome projects are now producing enormous quantities of genomic data using these rapidly growing technologies. Different analysis tools are developed in order to detect and understand the phenomena of gene regulation and physiological functions and assessing the quality of a genomic sequence.

References
[1] [2] [3] [4] [5] [6] [7] [8] [9] [10] M.B Eisen, P.T.Spellman,P.O.Brown, and D. Botstein, Cluster analysis and display of genome- wide expression patterns, (Proc,Natl.Acad.Sci. USA), Vol.95,pp.14863-8, (1998). P.J.Russel,Fundamentals of genetics, Second Edition, (San Francisco, Addison Wesly Longman Inc., 2000). M.Schena,D.Shalon, R.W.Davis and P.O.Brown, Quantitative monitoring of gene expression patterns with a complementary DNA Micro Array Science,270, 476-471, 1995. M.B.Eisen and P.O.Brown, DNA arrays for analysis of gene expression, Methods Enzymol.,303,179-205,(1999). Daxin Jiang, Chun Tang and Aidong Zhang,Cluster analysis for Gene Expression Data : A Survey IEEE Transactions on Knowledge and Data Engineering, vol.16, No.11, November 2004, pp 1370-1384 A.A.Alizadeh et al., Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling Nature, 403, 503511,(2000). Dhaeselecr, Shoudan Liang and Roland Somogyi, PsB99 Tutorial Gene Expression Analysis and Modeling. Cox, D.R.Regression models and life-tables (with discussion),J.R.Stat Soc.,B34:184-220,(1972). Lunn,M., & McNeil,D.R.., Applying Cox Regression to Competing Risks, Biometrics 51: 524-532,(1995). Beer D.G.Kardia, S.L.,Huang, C.C.Giodano, T.J..,Levin, A.M.Misek, D.E.,Lin, L.,Chen.G.Gharib,T.G.,Thomas, D.G.,Lizyness, M.L.,Kuick, R.,Hayasaka, S.,Taylor, J.M.,Iannettoni, M.D.,Irringer, M.B&Hanash,S., Gene Expression Profiles predict survival of patients with lung adenocarcinoma, Nat. Med.,8(8): 816-823,2002. Rosenwald,A.et al, The use of molecular profiling to predict survival after chemotherapy for Diffuse large-B cell lymphoma, NEJM,346(25):1937-1947,2002. R. Radha., Gene Expression Analysis, International Journal of Advanced Science and Technology, Vol.33, August 2011. Khan, J. et al., Classification and diagnostic Prediction of Cancers using gene expression profiling and artificial Networks, Nat.Med., 7:673-679,2001. T.R.Golub,D.K.Slonim , P.Tamay, C.Huard, M.Gassembeek, J.P. Mesirov, H.Coller, M.L Loh, J.R.Downing, M.A.Caligiuri, C.D. Bloomfield and E.S. Lander, Molecular classification of Cancer: class discovery and class prediction by Gene expression monitoring, science, 286, 531-537, 1999. F.Azuaje, Interpretation of genome expression patterns: Computational challenges and opportunities, to be published by IEEE Engineering in Medicine and Biology, November 2000. T.Kohonem, Self-organizing Maps, (Heidelberg, Springer, 1995). P.Tamayo, D.Slonim, J.Mesirov, Qzhn, S.Kitareewan, E.Dmistrovsky, E.lander and T.R.Golub, Interpreting Patterns of gene expression with self organizing maps: methods and applications to hematopoietic differentiation, The Proceedings of the National Academy of Sciences of U.S.A.,96,2907-2912,(1999). Xiujuan chen, Yong Li, Robert Harrison, Yan-Qing Zhang, Genetic fuzzy classification fusion of multiple SVMs for biomedical data journal of intelligent & Fuzzy systems. Volume 18, issue 6, December 2007, IOS press Amsterdam. Han, Kamber,Data Mining Concepts and Techniques, (Elsevier Publications, 2006). D.jiang, J.pei, and A.Zhang, DHC: a density based hierarchical clustering method for time series gene expression data. In Proceedings of BIBE2002,:3rd IEEE International Symposium on Bio-informatics and Bio-Engineering. Bethesda Maryland 2003, p.393. Anil K.Jain and Richard C.Dubes, Alogrithms for clustering data, (Prentice Hall,New Jersey, 1988). P. Valarmathie, Dr. MV. Srinath, Dr.T.Ravichandran, K.Dinakaran, Hybrid Fuzzy C-means Clustering Technique for Gene expression data, International Journal of Research and Reviews in Applied Sciences, ISSN:2076-734X, EISSN:2076-7366, Volume 1, issue 1, October 2009. Anirban, Mukhopadhayay, Ujjuval Maulik and sanghamitra bandyopadhyay, Efficient two stage fuzzy clustering of microarray gene expression data, International Conference on information Technology (ICIT06) , 2006 IEEE. J.C. Bezdek, Pattern Recognintion With Fuzzy Objective Function Algorithms, (New York;Plenum Press, 1981). S. Bandyopadhyay, A.Mukhopadhyay, and U.Maulik, An important algorithm for clustering gene expression data,Bioinformatics, vol.23(21),pp. 2859-2865,2007.

[11] [12] [13] [14]

[15] [16] [17]

[18]. [19]. [20].

[21]. [22].

[23]. [24]. [25].

www.iosrjournals.org

35 | Page

An Overview on Gene Expression Analysis

[26]. [27]. [28]. U. Maulik, A.Mukhopadhyay, and S.Bandyopadhyay, Combining Pareto optimal clusters using supervised learning for identifying co-expressed genes, BMC Bioinformatics. Vol.co(27),2009. C. Tang and A.Zhang, Interrelated Two-Way Clustering and its Application on Gene Expression Data, Presented at International Journal on Artificial Intelligence Tools,2005, p.p.577-598. Danh V.Nguyen1 and David M. Rocke2.Tumor Classification by Partial least squares using Microarray Gene expression data., 1. Center for Image Processing and Integrated Computing and 2. Department Applied Science. University of California, Davis, CA 95616, USA, Received on November 23, 2000; revised on March 22, 2001; accepted on June 6, 2001. Bubendrof. L. et al. Hormone Theraphy failure in human prostate Cancer: analysis by complementary DNA and tissue Microarrays. J.Natl Cancer Inst. Al., 1758-1764 (1999). Alon, U et al. Broad Patterns of Gene expression revealed by clustering analysis of tumor and normal colon tissues probed by Olignonucleotide arrays. Proc.Natt.Acad Sci USA 96, 6745-6750(1999). Alizadeh.A.et. al. The Lymphochip: a specialized cDNA Microarray for the genomic-scale analysis of gene expression in normal and malignant lymphocytes. Cold Spring Harbor Symp. Quant-Biol.(in the press). Mush H.Asyali, Dilek Colak, Omer Demirkaya and Mehmet S. Inan. Gene Expression Profile Classification : A Review, Current Bioinformatics, 2006, l, 55-73. Bentham Science Publishers Ltd. Nguyen DV, Arpat AB, Wang N, Carroll RJ. DNA Microarray experiments : biological and technological aspects Biometrics 2002, 58; 701-17. Ramaswamy S. Golub TR, DNA Microarray in clinical oncology, J.Clin Oncel 2002:20:1932-41. Alizadeh AA, Ross DT, Perou CM, Rijin Mud, Towards a novel classification of human malignancies based on gene expression patterns J.pathol 2000:195;41-52. Nature Genetics, The chipping forecast. Vol.(21) Supplement 1999. 1-60. Nature Genetics, The chipping forecast. 2002: 461-552. Loung YF, Cavalieri D. Fundamentals of CDNA Microarray data analysis Trends Genetic 2003;19-649-59. Lu.Y.Han,J. Cancer classification using gene expression data Int Syst 2003;28:243-68. Aittokallio T. Kurki M.Nevalainen O.Nikula T. West, A. Lahesmaa R. Computational strategies for analyzing data in gene expression microarray experiments. J.Bioinforma comput Bio, 2003;1;541-86. Quacknenbush J. Computational analysis of microarray data. Nat. Rev. Genet 2001 : 2: 418-27. Zhu J. Hastie T. Classification of gene microarrays by penalized logistic regression, Biostatistics;2004;5:427-43. Brazma A. Hingamp P. Quackenbush J, et al., Minimum information about a Microarray experiment (MIAME) toward standards for microarray data, NatGenet 2001:29:365-71. Ball CA. Sherlock, G.Parkinson H et al, standards for microarray data minimum information about a microarray data, Science 2002, 298:539. KuoWP. Jenssen Tk. Butte AJ, Ohno-Machado L.Kohane Is. Analysis of matched mRNA measurements from two different micro array technologies, Bioinformatics 2002, 18:405-12. Gianni Cesareni ., Alvis Brazma, Jaak Vilo edited ,the Gene expression data analysis, European Molecular Biology laboratory, Outsation Hinxton the European Bioinformatics Institute, Cambridge CB10, ISD, UK. Received 5 June 2000. Jain A, Duin P, Mao J, statistical Pattern recognition : A review, IEEE Transactions on PAMI 2000;22:4-37. Tibshirani R, Hastc, T. Narasimhan B. Chu G. Diagnosis of multiple cancer types by Shrunken centroids of gene expression ProcNAtl Acad Sci USA 2002: 99:6567-72. Breiman L, Friedman JH, Olshen R, Stone C J. Classification and Regression Trees, Wadsworth, Bel mont, CA 1984. Dudoit S, Fridlyand J, Speed TP Comparison of discrimination methods for the classification of tumors using gene expression data. J AM stat Assoc 2002;77-87. Kulkarni SR, Lugosi G. Venkatesh SS learning Pattern Classification a survey. IEEE Trans in from Theory 1998;44:2178-206. Geman D, dAvignen C, Naiman DQ, Winslow RL; Classifying gene expression Profiles from Pairwise mRNA Comparisons. Stat Appl Genet Mol Biol 2004, 3: Article 19. Xu.L, Tan Ac, Naiman DQ, Geman D, Winslow RL: Robust Prostate Cancer marker genes emerge from direct integration of interstudy microarray data. Bioinformatics 2005, 21(20):3905-11. Tan AC, Naiman DQ, Xu,L, Winslow RL, Geman D; simple decision rules for classifying human cancers from gene expression profiles. Bioinformatics 2005, 21(20): 3896-904. B. Ripley, Pattern Recognition and Neural Networks ,(Cambridge, England, Cambridge university press, 1996). F. Azuaje, W. Dubitzky, P. Lopes, N. Black, K.Adamson, x. Wu and J. White, Predicting Coronary disease risk based on short Term RR Intervals Measurements; A Neural Network Approach, Artificial Intelligence in Medicine, 15,275-298,1999. F. Azuaje, Making Genome Expression Data Meaningful: Prediction and discovery of classes of cancer through a connectionist learning approach, to be published in the proceedings of the IEEE symposium on Bioinformatics and Biomedical Engineering (BIBE 2000). F. Azuaje, W. Dubitzky, N.Black and K.Adamson, discovering Relevance Knowledge in Data: A Growing cell Structure Approach, IEEE Transactions on systems, Man and Cybernetics, Part B, 30, 448-460,2000. M. Eisen, P.T.Spellman, D. Botstein, P.O.Brown Proc. Natl. Acad. Sci. USA, 95(1998),pp. 14863-14867. Hartigan, J.A.(1975) Clustering Algorithms, (John wiley and Sons, New York). S. Tavazoie, D. Hughes, M.J.Campbell,R.J.Cho, G.M.Church, Nature Genet, 22(1999), pp.281 - 285 View Record in Scopus / ci ted By in Scopus(1295). P.Tamayo, D. Slonim , J. Mesirov, G.Zhu, S.Kitareewan, E.Dmitrovsky, E.Lander, T.Golub Proc, Natl.Acad. Sci.USA, 96(1999),pp. 2907-2912 View Record in Scopus / cited By in Scopus (1702). A. Brazma, I. Jonassen, J. Vilo, E. Ukkonen Genome Res., 8(1998), pp. 1202-1215. View Record in Scopus / cited by in Scopus (169). Ben Dor, A and Yakhini, Z. (1999) Proceedings of the Third Annual International Conference on Computational Molecular Biology RECOMB 1999, pp. 33-42,( ACM Press , Lyon.) V.R.Iyer, M.B.Eisen,D.T.Ross, G.Schuler, T. Moore, J.C.F.Lee, J.M.Trent, L.M.Staudt, J. Hudson Jr., M.S. Boguski, D. Lashkari, D. Shalon, D. Botstein, P.O.Brown Science, 283(1999), pp. 83-87. Ross,D.T., (2000) Systematic Variation in gene expression patterns in human cancer cell lines, Nature Genet., 24, 227-235. Perou,C.M., (1999) Distinctive gene expression patterns in human mammary epithelial cells and breast cancer, Proc. Natl Acad. Sci. USA, 96, 9112-9217. Perou,C.M., (2000) Molecular Portrait of human breast tumors. Nature, 406, 747-752. Bittner, M., (2000) Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature 406, 536-540.

[29]. [30]. [31]. [32]. [33]. [34]. [35]. [36]. [37]. [38]. [39]. [40]. [41]. [42]. [43]. [44]. [45]. [46]. [47]. [48]. [49]. [50]. [51]. [52]. [53]. [54]. [55]. [56]. [57].

[58]. [59]. [60]. [61]. [62]. [63]. [64]. [65]. [66]. [67]. [68]. [69].

www.iosrjournals.org

36 | Page

Introduction To Cancer Genome Analysis
100% (1)
Introduction To Cancer Genome Analysis
41 pages
Essentials of Molecular Biology - David Freifelder, George M. Malacinski - 2, 1993 - Jones and Bartlett Publishers - 9780867201376 - Anna's A
No ratings yet
Essentials of Molecular Biology - David Freifelder, George M. Malacinski - 2, 1993 - Jones and Bartlett Publishers - 9780867201376 - Anna's A
504 pages
BIOINFORMATICS Chapter 1 3rd Sem
100% (1)
BIOINFORMATICS Chapter 1 3rd Sem
44 pages
5991-5213EN GC Catalog LR PDF
100% (1)
5991-5213EN GC Catalog LR PDF
708 pages
Spectroscopy Catalog
100% (1)
Spectroscopy Catalog
184 pages
Corning Micro Array Technology
50% (2)
Corning Micro Array Technology
19 pages
Fundamentals of Genetics
No ratings yet
Fundamentals of Genetics
20 pages
Biotech Companies in India PDF
0% (1)
Biotech Companies in India PDF
1,230 pages
List of Biological Databases
100% (1)
List of Biological Databases
8 pages
DNA Microarray
100% (1)
DNA Microarray
37 pages
MicroRNA in Cancer
No ratings yet
MicroRNA in Cancer
148 pages
Towards Personalized Cancer Care A Report of Crispr Cas9 48duthe4h6
100% (1)
Towards Personalized Cancer Care A Report of Crispr Cas9 48duthe4h6
6 pages
Cytoscape
No ratings yet
Cytoscape
86 pages
PHD Thesis Topics in American Literature
100% (2)
PHD Thesis Topics in American Literature
8 pages
The Application of The Permutation Test in Genome Wide Expression Analysis
No ratings yet
The Application of The Permutation Test in Genome Wide Expression Analysis
115 pages
Microarray Technology and Applications: Purnima Kartha. N
No ratings yet
Microarray Technology and Applications: Purnima Kartha. N
63 pages
Linux and Kernel Component
100% (1)
Linux and Kernel Component
17 pages
I Semester: M.Tech Full Time Scheme (New)
No ratings yet
I Semester: M.Tech Full Time Scheme (New)
53 pages
Limonoids - Biosynthesis, Biochemistry and Analyis
No ratings yet
Limonoids - Biosynthesis, Biochemistry and Analyis
44 pages
Basic Concepts of Genetics, Autosomes, Allosomes, Chromosome Disorders 2023
No ratings yet
Basic Concepts of Genetics, Autosomes, Allosomes, Chromosome Disorders 2023
69 pages
Tumor Marker 12345678
No ratings yet
Tumor Marker 12345678
65 pages
Biochips Market
No ratings yet
Biochips Market
2 pages
Drosophila Models For Human Diseases
100% (1)
Drosophila Models For Human Diseases
314 pages
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
100% (2)
Bioinformatics: Nadiya Akmal Binti Baharum (PHD)
54 pages
Science Magazine - 24 March 2006
No ratings yet
Science Magazine - 24 March 2006
163 pages
Bexfield 2010 Metagenomic - Virus
No ratings yet
Bexfield 2010 Metagenomic - Virus
8 pages
Lecture Notes in Networks and Systems
No ratings yet
Lecture Notes in Networks and Systems
42 pages
Labmanual CS 1
No ratings yet
Labmanual CS 1
52 pages
Youth Entrepreneurship: Opportunities and Challenges in India
100% (1)
Youth Entrepreneurship: Opportunities and Challenges in India
5 pages
Genomics Lectures 9 To 14-2023 PDF
No ratings yet
Genomics Lectures 9 To 14-2023 PDF
65 pages
Aracne Califano2006 Nat Protocol
No ratings yet
Aracne Califano2006 Nat Protocol
10 pages
DNA Microarrays
No ratings yet
DNA Microarrays
29 pages
Seminars in Cancer Biology: Tianzuo Zhan, Niklas Rindtor FF, Johannes Betge, Matthias P. Ebert, Michael Boutros T
100% (1)
Seminars in Cancer Biology: Tianzuo Zhan, Niklas Rindtor FF, Johannes Betge, Matthias P. Ebert, Michael Boutros T
14 pages
1 What Is Bioinformatics
No ratings yet
1 What Is Bioinformatics
34 pages
Necessary Evils of Private Tuition: A Case Study
No ratings yet
Necessary Evils of Private Tuition: A Case Study
6 pages
Factors Affecting Success of Construction Project
No ratings yet
Factors Affecting Success of Construction Project
10 pages
Effects of Formative Assessment On Mathematics Test Anxiety and Performance of Senior Secondary School Students in Jos, Nigeria
100% (1)
Effects of Formative Assessment On Mathematics Test Anxiety and Performance of Senior Secondary School Students in Jos, Nigeria
10 pages
Fatigue Analysis of A Piston Ring by Using Finite Element Analysis
No ratings yet
Fatigue Analysis of A Piston Ring by Using Finite Element Analysis
4 pages
NCBI Part1
100% (2)
NCBI Part1
52 pages
Comparison of Explosive Strength Between Football and Volley Ball Players of Jamboni Block
No ratings yet
Comparison of Explosive Strength Between Football and Volley Ball Players of Jamboni Block
2 pages
An Introduction On Bioinformatics
No ratings yet
An Introduction On Bioinformatics
66 pages
Markov Chain
No ratings yet
Markov Chain
7 pages
Molecualr Basis of Diagnosis PPT Final
No ratings yet
Molecualr Basis of Diagnosis PPT Final
41 pages
Plant Biotechnology
100% (1)
Plant Biotechnology
17 pages
Reviews: Next-Generation Computational Tools For Interrogating Cancer Immunity
100% (1)
Reviews: Next-Generation Computational Tools For Interrogating Cancer Immunity
23 pages
Finding Temporal Pattern in Gene Expression Profiles
No ratings yet
Finding Temporal Pattern in Gene Expression Profiles
1 page
Biotechnology and Pharmaceutical Evolution
100% (1)
Biotechnology and Pharmaceutical Evolution
4 pages
Techniques Used in Molecular Biology-1
No ratings yet
Techniques Used in Molecular Biology-1
72 pages
Experimental Design Considerations: 3 Replicates
No ratings yet
Experimental Design Considerations: 3 Replicates
1 page
Current Diagnostic Methods For Hematological Malignancies: A Mini-Review
No ratings yet
Current Diagnostic Methods For Hematological Malignancies: A Mini-Review
7 pages
Biology-Lab Manual Sep2022
No ratings yet
Biology-Lab Manual Sep2022
32 pages
What Is Molecular Imaging?
No ratings yet
What Is Molecular Imaging?
10 pages
561 Full
No ratings yet
561 Full
7 pages
Zheng Hong 郑红 Department of Medical Genetics & Cell Biology
No ratings yet
Zheng Hong 郑红 Department of Medical Genetics & Cell Biology
63 pages
Bioinformatics
No ratings yet
Bioinformatics
55 pages
Handbook of Analysis of Oligonucleotides and Related Products 1st Edition Jose V. Bonilla
No ratings yet
Handbook of Analysis of Oligonucleotides and Related Products 1st Edition Jose V. Bonilla
48 pages
Ne Mutations and DNA Repair PDF
No ratings yet
Ne Mutations and DNA Repair PDF
35 pages
What Is An SNP Array?: DNA Hybridization-Based Technique
No ratings yet
What Is An SNP Array?: DNA Hybridization-Based Technique
8 pages
Wwwbiracnicin
No ratings yet
Wwwbiracnicin
252 pages
Tutorial R
No ratings yet
Tutorial R
456 pages
(IJCST-V4I3P23) :fadoua Rafii, Badr Dine Rossi Hassani, M'hamed Aït Kbir
No ratings yet
(IJCST-V4I3P23) :fadoua Rafii, Badr Dine Rossi Hassani, M'hamed Aït Kbir
8 pages
1 - Introduction To Computational Biology
No ratings yet
1 - Introduction To Computational Biology
22 pages
Design and Analysis of Ladder Frame Chassis Considering Support at Contact Region of Leaf Spring and Chassis Frame
No ratings yet
Design and Analysis of Ladder Frame Chassis Considering Support at Contact Region of Leaf Spring and Chassis Frame
9 pages
SomaScan v4.0 and v4.1 Data Standardization
No ratings yet
SomaScan v4.0 and v4.1 Data Standardization
3 pages
Pairwise Sequence Alignment
No ratings yet
Pairwise Sequence Alignment
12 pages
Computational Biology and Bioinformatics
100% (1)
Computational Biology and Bioinformatics
11 pages
Genomic Technologies in Clinical Diagnostics - Glossary: Term Alignment Allele
No ratings yet
Genomic Technologies in Clinical Diagnostics - Glossary: Term Alignment Allele
7 pages
Special Networks: "Principles of Soft Computing, 2
No ratings yet
Special Networks: "Principles of Soft Computing, 2
22 pages
Recombinant DNA Safety Guidelines PDF
No ratings yet
Recombinant DNA Safety Guidelines PDF
54 pages
Mouse Models
No ratings yet
Mouse Models
32 pages
Getting Started in Biological Pathway Construction and Analysis
No ratings yet
Getting Started in Biological Pathway Construction and Analysis
5 pages
Bioinformatics Session1
No ratings yet
Bioinformatics Session1
35 pages
Tutorial For Proteome Data Analysis Using The Perseus Software Platform
No ratings yet
Tutorial For Proteome Data Analysis Using The Perseus Software Platform
22 pages
Bioinformatics Assignment
No ratings yet
Bioinformatics Assignment
10 pages
Kinetics Microbial Growth
No ratings yet
Kinetics Microbial Growth
32 pages
ABSTACT .. 02 Chapter - I: Chapter - Ii Chapter - Iii Chapter - Iv Chapter - V Chapter - Vi
No ratings yet
ABSTACT .. 02 Chapter - I: Chapter - Ii Chapter - Iii Chapter - Iv Chapter - V Chapter - Vi
40 pages
GMX - BD Rhapsody Single Cell Analysis System - BR - EN
No ratings yet
GMX - BD Rhapsody Single Cell Analysis System - BR - EN
8 pages
BIOINFORMATICS LAB Report
No ratings yet
BIOINFORMATICS LAB Report
14 pages
Bioinformatics - Group21 - Report - Application of Bioinformatics in Agriculture
No ratings yet
Bioinformatics - Group21 - Report - Application of Bioinformatics in Agriculture
11 pages
Bioinformatics For Health Care: By-Daniyal Jadhav PRN No - 19010143002
No ratings yet
Bioinformatics For Health Care: By-Daniyal Jadhav PRN No - 19010143002
24 pages
Cancer Biomarkers
No ratings yet
Cancer Biomarkers
7 pages
Lecture12 Functional Pathway Analysis
No ratings yet
Lecture12 Functional Pathway Analysis
13 pages
Introduction To BioMEMS, 1st Edition Full Chapter Download
100% (15)
Introduction To BioMEMS, 1st Edition Full Chapter Download
16 pages
Bi0505 Lab
No ratings yet
Bi0505 Lab
102 pages
Bioinformatics Assignment Topic: Phylogenetics Analysis Softwares
No ratings yet
Bioinformatics Assignment Topic: Phylogenetics Analysis Softwares
12 pages
Bioinformatics: Intended Learning Outcomes
No ratings yet
Bioinformatics: Intended Learning Outcomes
9 pages
Indian Company Listxls - Compress
No ratings yet
Indian Company Listxls - Compress
378 pages
APPLICATION OF BIOINFORMATICS IN MOLECULAR BIOLOGY AND CURRENT RESEACRH-Dr. Ruchi Yadav
No ratings yet
APPLICATION OF BIOINFORMATICS IN MOLECULAR BIOLOGY AND CURRENT RESEACRH-Dr. Ruchi Yadav
105 pages
Fish Cytogenetics, Genotoxicity and Mutagenesis. (5-12-1998)
No ratings yet
Fish Cytogenetics, Genotoxicity and Mutagenesis. (5-12-1998)
6 pages
Unit 5-Introduction To Biological Databases
No ratings yet
Unit 5-Introduction To Biological Databases
14 pages
Introduction To Bioinformatics Lab: 10B17BT571 Core Course Credits: 1 L0T0P2
No ratings yet
Introduction To Bioinformatics Lab: 10B17BT571 Core Course Credits: 1 L0T0P2
3 pages
Next Generation
No ratings yet
Next Generation
5 pages
Omics
No ratings yet
Omics
6 pages
Instruction Manual, Iscript Select cDNA Synthesis Kit, Rev B
No ratings yet
Instruction Manual, Iscript Select cDNA Synthesis Kit, Rev B
2 pages
Group # 13
No ratings yet
Group # 13
49 pages
Bioinformatics: Applications: ZOO 4903 Fall 2006, MW 10:30-11:45 Sutton Hall, Room 312 Jonathan Wren
No ratings yet
Bioinformatics: Applications: ZOO 4903 Fall 2006, MW 10:30-11:45 Sutton Hall, Room 312 Jonathan Wren
75 pages
DNA Microarray
100% (1)
DNA Microarray
162 pages

An Overview On Gene Expression Analysis: Dr. R. Radha, P. Rajendiran

Uploaded by

An Overview On Gene Expression Analysis: Dr. R. Radha, P. Rajendiran

Uploaded by

IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661 Volume 4, Issue 1 (Sep-Oct. 2012), PP 31-36 www.iosrjournals.

An Overview on Gene Expression Analysis

An Overview on Gene Expression Analysis

An Overview on Gene Expression Analysis

Data mining classification technique for gene expression data

An Overview on Gene Expression Analysis

Data mining clustering technique for gene expression data

An Overview on Gene Expression Analysis

[11] [12] [13] [14]

[15] [16] [17]

[18]. [19]. [20].

[23]. [24]. [25].

An Overview on Gene Expression Analysis

You might also like