pubmed.ncbi.nlm.nih.gov

Genetic architecture of transcript-level variation in humans - PubMed

Genetic architecture of transcript-level variation in humans

Shiwei Duan et al. Am J Hum Genet. 2008 May.

Abstract

We report here the results of testing the pairwise association of 12,747 transcriptional gene-expression values with more than two million single-nucleotide polymorphisms (SNPs) in samples of European (CEPH from Utah; CEU) and African (Yoruba from Ibadan; YRI) ancestry. We found 4,677 and 5,125 significant associations between expression quantitative nucleotides (eQTNs) and transcript clusters in the CEU and the YRI samples, respectively. The physical distance between an eQTN and its associated transcript cluster was referred to as the intrapair distance. An association with 4 Mb or less intrapair distance was defined as local; otherwise, it was defined as distant. The enrichment analysis of functional categories shows that genes harboring the local eQTNs are enriched in the categories related to nucleosome and chromatin assembly; the genes harboring the distant eQTNs are enriched in the categories related to transmembrane signal transduction, suggesting that these biological pathways are likely to play a significant role in regulation of gene expression. We highlight in the EPHX1 gene a deleterious nonsynonymous SNP that is distantly associated with gene expression of ORMDL3, a susceptibility gene for asthma.

PubMed Disclaimer

Figures

Figure 1
Figure 1

Features of Significant Association in the CEU and the YRI LCLs (A) Histogram of average expression levels of 16,997 TCs in human LCLs (176 individuals). The horizontal blue box-plot provides the mean and 95% confidence intervals of the expression levels of 16,997 TCs. The gray vertical bars represent TCs whose mean log2-transformed expression values in 176 LCLs are less than or equal to 5.405, the 25th quantile of mean TC expression values; the dark blue vertical bars represent the 12,747 TCs selected for the genome-wide association (GWA) study. (B) Relationship between raw p value and false-discovery rate (FDR) of the GWA studies in the CEU and the YRI samples. The FDR values are estimated with Benjamini-Hochberg correction. There are two dashed vertical lines denoting p value of 3 × 10−6 (p value of 0.05 corrected by 12,747 tested TCs) and 2 × 10−8 (p value of 0.05 corrected by the approximately two million SNPs), respectively. p value of 2 × 10−8 corresponds to an FDR < 0.1 in the GWA studies of both CEU and YRI samples. (C and D) The genomic distribution of 4,677 TC-eQTN associations in the CEU (C), and the genomic distribution of 5,125 TC-eQTN associations in the YRI (D). The blue crosses are the distant TC-eQTN associations; the red crosses along the diagonal lines are the local TC-eQTN associations. (E and F) The intrapair distance distribution of 2,102 CEU (red) and 761 YRI (blue) same-chromosome TC-eQTN pairs (E). The intrapair distance distribution of 118 CEU (red) and 40 YRI (blue) same-chromosome TC-eQTN pairs having eQTNs in the conserved region (F). IPD on the x axis refers to intrapair distance.

Figure 2
Figure 2

The genomic Distribution of eQTNs and TCs in the CEU and the YRI Populations The tan and black boxes represent the chromosomes in the CEU (left) and the YRI (right) samples. The green tick marks denote the location of distant TCs. The yellow tick marks denote the location of local TCs. The red tick marks above the chromosome boxes denote the location of the distant eQTNs, and the red tick marks below the chromosome boxes denote the position of the local eQTNs.

Figure 3
Figure 3

The Distantly Associated eQTN_hotspots (A) The genomic distribution of eQTN_hotspots. The x axis represents location on the chromosome of distant eQTN_hotspots, and the y axis is the number of TCs associated with the eQTNs in each bin (bin size equals to 500 Kb). The dashed lines show the maximum number of six associated TCs that would be expected to fall into any one bin by chance with a probability of 0.001 (binomial test, corrected by the bin numbers). Bins with bar heights at or above this line represent eQTN_hotspots. There are 14 and 38 hotspots in CEU and YRI, respectively (see Table S4). Hotspots were numbered sequentially according to their locations in the genome from chromosome 1 to 22. The stars denote two distant eQTN_hotspots in the YRI associated with gene sets enriched in functional-annotation categories (details in Table 2). (B and C) Heat maps of the expression of TCs associated with the eQTN hotspots of YRI_Hs_30 and YRI_Hs_36. The rows are TCs, and columns are the YRI samples. The green and red color in the heat map means expression below and above the average level, respectively. (D) Heat map of the expression patterns of histone genes and their related genes in the CEU and the YRI samples. The rows are TCs, and columns are 176 samples. The green and red colors in the heat map indicate expression below and above the average level, respectively. (E) The expression levels of histones and their related genes in the CEU (red) and YRI (blue) samples. The shading indicates an association between expression level with one or more eQTNs in CEU only (pink), in YRI only (blue), or in both (green).

Figure 4
Figure 4

ORMDL3 Gene Expression Is Associated with SNP rs1051740 in the CEU (A) The whole-genome association of ORMDL3 expression level in the CEU population. (B) SNP rs1051740 and SNP rs7216389 have an additive effect on the expression of ORMDL3. The red allele denotes the allele corresponding to higher gene expression level of ORMDL3, which is implicated with higher risk of asthma susceptibility in the study of Moffatt et al. (C) The location of EPHX1_H113Y (rs1051740) in the conserved protein domain of EPHX1 (epoxide hydrolase N terminus, CDD:69934).

Similar articles

Cited by

References

    1. Huang R.S., Duan S., Bleibel W.K., Kistner E.O., Zhang W., Clark T.A., Chen T.X., Schweitzer A.C., Blume J.E., Cox N.J., Dolan M.E. A genome-wide approach to identify genetic variants that contribute to etoposide-induced cytotoxicity. Proceedings of the National Academy of Sciences of the United States of America. 2007;104:9758–9763. - PMC - PubMed
    1. Huang R.S., Duan S., Shukla S.J., Kistner E.O., Clark T.A., Chen T.X., Schweitzer A.C., Blume J.E., Dolan M.E. Identification of genetic variants contributing to Cisplatin-induced cytotoxicity by use of a genomewide approach. Am. J. Hum. Genet. 2007;81:427–437. - PMC - PubMed
    1. Rockman M.V., Kruglyak L. Genetics of global gene expression. Nat. Rev. Genet. 2006;7:862–872. - PubMed
    1. Brem R.B., Yvert G., Clinton R., Kruglyak L. Genetic dissection of transcriptional regulation in budding yeast. Science. 2002;296:752–755. - PubMed
    1. Jansen R.C., Nap J.P. Genetical genomics: The added value from segregation. Trends Genet. 2001;17:388–391. - PubMed

Publication types

MeSH terms

Substances