Alignment of gene expression profiles from test samples against a reference database: New method for context-specific interpretation of microarray data

Alignment of gene expression profiles from test samples against a reference database: New method for context-specific interpretation of microarray data Kilpinen, Sami K Ojala, Kalle A Kallioniemi, Olli P England BioData mining BioData Min. 2011 Mar 31;4:5. eng BACKGROUND: Gene expression microarray...

Full description

Bibliographic Details
Published in:BioData Mining
Main Authors: Kilpinen, S. K., Ojala, K. A., Kallioniemi, Olli
Other Authors: Institute for Molecular Medicine Finland
Format: Article in Journal/Newspaper
Language:English
Published: 2016
Subjects:
Online Access:http://hdl.handle.net/10138/162845
Description
Summary:Alignment of gene expression profiles from test samples against a reference database: New method for context-specific interpretation of microarray data Kilpinen, Sami K Ojala, Kalle A Kallioniemi, Olli P England BioData mining BioData Min. 2011 Mar 31;4:5. eng BACKGROUND: Gene expression microarray data have been organized and made available as public databases, but the utilization of such highly heterogeneous reference datasets in the interpretation of data from individual test samples is not as developed as e.g. in the field of nucleotide sequence comparisons. We have created a rapid and powerful approach for the alignment of microarray gene expression profiles (AGEP) from test samples with those contained in a large annotated public reference database and demonstrate here how this can facilitate interpretation of microarray data from individual samples. METHODS: AGEP is based on the calculation of kernel density distributions for the levels of expression of each gene in each reference tissue type and provides a quantitation of the similarity between the test sample and the reference tissue types as well as the identity of the typical and atypical genes in each comparison. As a reference database, we used 1654 samples from 44 normal tissues (extracted from the Genesapiens database). RESULTS: Using leave-one-out validation, AGEP correctly defined the tissue of origin for 1521 (93.6%) of all the 1654 samples in the original database. Independent validation of 195 external normal tissue samples resulted in 87% accuracy for the exact tissue type and 97% accuracy with related tissue types. AGEP analysis of 10 Duchenne muscular dystrophy (DMD) samples provided quantitative description of the key pathogenetic events, such as the extent of inflammation, in individual samples and pinpointed tissue-specific genes whose expression changed (SAMD4A) in DMD. AGEP analysis of microarray data from adipocytic differentiation of mesenchymal stem cells and from normal myeloid cell types and leukemias provided quantitative ...