Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data - PubMed
- ️Sat Jan 01 2005
. 2005 Nov 10;33(20):e175.
doi: 10.1093/nar/gni179.
Pinglang Wang, Andrew D Boyd, Georgi Kostov, Brian Athey, Edward G Jones, William E Bunney, Richard M Myers, Terry P Speed, Huda Akil, Stanley J Watson, Fan Meng
Affiliations
- PMID: 16284200
- PMCID: PMC1283542
- DOI: 10.1093/nar/gni179
Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data
Manhong Dai et al. Nucleic Acids Res. 2005.
Abstract
Genome-wide expression profiling is a powerful tool for implicating novel gene ensembles in cellular mechanisms of health and disease. The most popular platform for genome-wide expression profiling is the Affymetrix GeneChip. However, its selection of probes relied on earlier genome and transcriptome annotation which is significantly different from current knowledge. The resultant informatics problems have a profound impact on analysis and interpretation the data. Here, we address these critical issues and offer a solution. We identified several classes of problems at the individual probe level in the existing annotation, under the assumption that current genome and transcriptome databases are more accurate than those used for GeneChip design. We then reorganized probes on more than a dozen popular GeneChips into gene-, transcript- and exon-specific probe sets in light of up-to-date genome, cDNA/EST clustering and single nucleotide polymorphism information. Comparing analysis results between the original and the redefined probe sets reveals approximately 30-50% discrepancy in the genes previously identified as differentially expressed, regardless of analysis method. Our results demonstrate that the original Affymetrix probe set definitions are inaccurate, and many conclusions derived from past GeneChip analyses may be significantly flawed. It will be beneficial to re-analyze existing GeneChip data with updated probe set definitions.
Figures
Similar articles
-
The effect of GeneChip gene definitions on the microarray study of cancers.
Lu X, Zhang X. Lu X, et al. Bioessays. 2006 Jul;28(7):739-46. doi: 10.1002/bies.20433. Bioessays. 2006. PMID: 16850407
-
Alberts R, Terpstra P, Hardonk M, Bystrykh LV, de Haan G, Breitling R, Nap JP, Jansen RC. Alberts R, et al. BMC Bioinformatics. 2007 Apr 20;8:132. doi: 10.1186/1471-2105-8-132. BMC Bioinformatics. 2007. PMID: 17448222 Free PMC article.
-
Yu H, Wang F, Tu K, Xie L, Li YY, Li YX. Yu H, et al. BMC Bioinformatics. 2007 Jun 11;8:194. doi: 10.1186/1471-2105-8-194. BMC Bioinformatics. 2007. PMID: 17559689 Free PMC article.
-
[Transcriptome analyses and transcriptome databases].
Suzuki Y, Yamashita R, Nakai K, Sugano S. Suzuki Y, et al. Tanpakushitsu Kakusan Koso. 2004 Aug;49(11 Suppl):1859-65. Tanpakushitsu Kakusan Koso. 2004. PMID: 15377029 Review. Japanese. No abstract available.
-
Normalization of microarray data: single-labeled and dual-labeled arrays.
Do JH, Choi DK. Do JH, et al. Mol Cells. 2006 Dec 31;22(3):254-61. Mol Cells. 2006. PMID: 17202852 Review.
Cited by
-
User-friendly solutions for microarray quality control and pre-processing on ArrayAnalysis.org.
Eijssen LM, Jaillard M, Adriaens ME, Gaj S, de Groot PJ, Müller M, Evelo CT. Eijssen LM, et al. Nucleic Acids Res. 2013 Jul;41(Web Server issue):W71-6. doi: 10.1093/nar/gkt293. Epub 2013 Apr 24. Nucleic Acids Res. 2013. PMID: 23620278 Free PMC article.
-
Kerman IA, Bernard R, Bunney WE, Jones EG, Schatzberg AF, Myers RM, Barchas JD, Akil H, Watson SJ, Thompson RC. Kerman IA, et al. Front Neurosci. 2012 Oct 18;6:135. doi: 10.3389/fnins.2012.00135. eCollection 2012. Front Neurosci. 2012. PMID: 23087602 Free PMC article.
-
Grootswagers P, Smeets E, Oteng AB, Groot L. Grootswagers P, et al. Aging (Albany NY). 2021 Apr 2;13(7):9398-9418. doi: 10.18632/aging.202912. Epub 2021 Apr 2. Aging (Albany NY). 2021. PMID: 33799307 Free PMC article. Clinical Trial.
-
Kirschner A, Thiede M, Blaeschke F, Richter GH, Gerke JS, Baldauf MC, Grünewald TG, Busch DH, Burdach S, Thiel U. Kirschner A, et al. Oncotarget. 2016 Aug 30;7(35):56584-56597. doi: 10.18632/oncotarget.10647. Oncotarget. 2016. PMID: 27447745 Free PMC article.
-
Dougherty JD, Fomchenko EI, Akuffo AA, Schmidt E, Helmy KY, Bazzoli E, Brennan CW, Holland EC, Milosevic A. Dougherty JD, et al. Cancer Res. 2012 Sep 15;72(18):4856-68. doi: 10.1158/0008-5472.CAN-11-2632. Epub 2012 Aug 3. Cancer Res. 2012. PMID: 22865458 Free PMC article.
References
-
- Bolstad B.M., Irizarry R.A., Astrand M., Speed T.P. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003;19:185–193. - PubMed
-
- Irizarry R.A., Hobbs B., Collin F., Beazer-Barclay Y.D., Antonellis K.J., Scherf U., Speed T.P. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4:249–264. - PubMed
-
- Lander E.S., Linton L.M., Birren B., Nusbaum C., Zody M.C., Baldwin J., Devon K., Dewar K., Doyle M., FitzHugh W., et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous