TY - JOUR
T1 - Development and evaluation of new mask protocols for gene expression profiling in humans and chimpanzees
AU - Toleno, Donna M.
AU - Renaud, Gabriel
AU - Wolfsberg, Tyra G.
AU - Islam, Munirul
AU - Wildman, Derek E.
AU - Siegmund, Kimberly D.
AU - Hacia, Joseph G.
N1 - This research was supported by grants from the National Institutes of Health (GM072447 and DE012711-09S1 to J.G.H.). In addition, this research was supported in part by the Intramural Research Program of the National Human Genome Research Institute, National Institutes of Health. This investigation was conducted in a facility constructed with support from Research Facilities Improvement Program Grant Number C06 (RR10600-01, CA62528-01, RR14514-01) from the National Center for Research Resources, National Institutes of Health.
We thank Yoav Gilad at the University of Chicago, James MacDonald and Monica Uddin at the University of Michigan for thoughtful advice and discussion. We thank Dr. Patricia Dranchak, Mallory Gerace, and Timothy Triche, Jr. at the University of Southern California for critical reading and thoughtful discussion. Computation for the work described in this paper was supported by the University of Southern California Center for High-
PY - 2009/3/5
Y1 - 2009/3/5
N2 - Background: Cross-species gene expression analyses using oligonucleotide microarrays designed to evaluate a single species can provide spurious results due to mismatches between the interrogated transcriptome and arrayed probes. Based on the most recent human and chimpanzee genome assemblies, we developed updated and accessible probe masking methods that allow human Affymetrix oligonucleotide microarrays to be used for robust genome-wide expression analyses in both species. In this process, only data from oligonucleotide probes predicted to have robust hybridization sensitivity and specificity for both transcriptomes are retained for analysis. Results: To characterize the utility of this resource, we applied our mask protocols to existing expression data from brains, livers, hearts, testes, and kidneys derived from both species and determined the effects probe numbers have on expression scores of specific transcripts. In all five tissues, probe sets with decreasing numbers of probes showed non-linear trends towards increased variation in expression scores. The relationships between expression variation and probe number in brain data closely matched those observed in simulated expression data sets subjected to random probe masking. However, there is evidence that additional factors affect the observed relationships between gene expression scores and probe number in tissues such as liver and kidney. In parallel, we observed that decreasing the number of probes within probe sets lead to linear increases in both gained and lost inferences of differential cross-species expression in all five tissues, which will affect the interpretation of expression data subject to masking. Conclusion: We introduce a readily implemented and updated resource for human and chimpanzee transcriptome analysis through a commonly used microarray platform. Based on empirical observations derived from the analysis of five distinct data sets, we provide novel guidelines for the interpretation of masked data that take the number of probes present in a given probe set into consideration. These guidelines are applicable to other customized applications that involve masking data from specific subsets of probes.
AB - Background: Cross-species gene expression analyses using oligonucleotide microarrays designed to evaluate a single species can provide spurious results due to mismatches between the interrogated transcriptome and arrayed probes. Based on the most recent human and chimpanzee genome assemblies, we developed updated and accessible probe masking methods that allow human Affymetrix oligonucleotide microarrays to be used for robust genome-wide expression analyses in both species. In this process, only data from oligonucleotide probes predicted to have robust hybridization sensitivity and specificity for both transcriptomes are retained for analysis. Results: To characterize the utility of this resource, we applied our mask protocols to existing expression data from brains, livers, hearts, testes, and kidneys derived from both species and determined the effects probe numbers have on expression scores of specific transcripts. In all five tissues, probe sets with decreasing numbers of probes showed non-linear trends towards increased variation in expression scores. The relationships between expression variation and probe number in brain data closely matched those observed in simulated expression data sets subjected to random probe masking. However, there is evidence that additional factors affect the observed relationships between gene expression scores and probe number in tissues such as liver and kidney. In parallel, we observed that decreasing the number of probes within probe sets lead to linear increases in both gained and lost inferences of differential cross-species expression in all five tissues, which will affect the interpretation of expression data subject to masking. Conclusion: We introduce a readily implemented and updated resource for human and chimpanzee transcriptome analysis through a commonly used microarray platform. Based on empirical observations derived from the analysis of five distinct data sets, we provide novel guidelines for the interpretation of masked data that take the number of probes present in a given probe set into consideration. These guidelines are applicable to other customized applications that involve masking data from specific subsets of probes.
UR - https://www.scopus.com/pages/publications/63349085347
UR - https://www.scopus.com/pages/publications/63349085347#tab=citedBy
U2 - 10.1186/1471-2105-10-77
DO - 10.1186/1471-2105-10-77
M3 - Article
C2 - 19265541
AN - SCOPUS:63349085347
SN - 1471-2105
VL - 10
JO - BMC bioinformatics
JF - BMC bioinformatics
M1 - 77
ER -