|
Latest Paper:
PLoS One. 2012 ;7 (2):e30629
22359548
Jean-Baptiste Veyrieras,
Daniel J Gaffney,
Joseph K Pickrell,
Yoav Gilad,
Matthew Stephens,
Jonathan K Pritchard
Department of Human Genetics, The University of Chicago, Chicago, Illinois, United States of America.
Mapping of expression quantitative trait loci (eQTLs) is an important technique for studying how genetic variation affects gene regulation in natural populations. In a previous study using Illumina expression data from human lymphoblastoid cell lines, we reported that cis-eQTLs are especially enriched around transcription start sites (TSSs) and immediately upstream of transcription end sites (TESs). In this paper, we revisit the distribution of eQTLs using additional data from Affymetrix exon arrays and from RNA sequencing. We confirm that most eQTLs lie close to the target genes; that transcribed regions are generally enriched for eQTLs; that eQTLs are more abundant in exons than introns; and that the peak density of eQTLs occurs at the TSS. However, we find that the intriguing TES peak is greatly reduced or absent in the Affymetrix and RNA-seq data. Instead our data suggest that the TES peak observed in the Illumina data is mainly due to exon-specific QTLs that affect 3' untranslated regions, where most of the Illumina probes are positioned. Nonetheless, we do observe an overall enrichment of eQTLs in exons versus introns in all three data sets, consistent with an important role for exonic sequences in gene regulation.
Jacob F Degner,
Athma A Pai,
Roger Pique-Regi,
Jean-Baptiste Veyrieras,
Daniel J Gaffney,
Joseph K Pickrell,
Sherryl De Leon,
Katelyn Michelini,
Noah Lewellen,
Gregory E Crawford,
Matthew Stephens,
Yoav Gilad,
Jonathan K Pritchard
Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA.
The mapping of expression quantitative trait loci (eQTLs) has emerged as an important tool for linking genetic variation to changes in gene regulation. However, it remains difficult to identify the causal variants underlying eQTLs, and little is known about the regulatory mechanisms by which they act. Here we show that genetic variants that modify chromatin accessibility and transcription factor binding are a major mechanism through which genetic variation leads to gene expression differences among humans. We used DNase I sequencing to measure chromatin accessibility in 70 Yoruba lymphoblastoid cell lines, for which genome-wide genotypes and estimates of gene expression levels are also available. We obtained a total of 2.7 billion uniquely mapped DNase I-sequencing (DNase-seq) reads, which allowed us to produce genome-wide maps of chromatin accessibility for each individual. We identified 8,902 locations at which the DNase-seq read depth correlated significantly with genotype at a nearby single nucleotide polymorphism or insertion/deletion (false discovery rate = 10%). We call such variants 'DNase I sensitivity quantitative trait loci'(dsQTLs). We found that dsQTLs are strongly enriched within inferred transcription factor binding sites and are frequently associated with allele-specific changes in transcription factor binding. A substantial fraction (16%) of dsQTLs are also associated with variation in the expression levels of nearby genes (that is, these loci are also classified as eQTLs). Conversely, we estimate that as many as 55% of eQTL single nucleotide polymorphisms are also dsQTLs. Our observations indicate that dsQTLs are highly abundant in the human genome and are likely to be important contributors to phenotypic variation.
Genome Biol. 2012 Jan 31;13 (1):R7
22293038
Daniel J Gaffney,
Jean-Baptiste Veyrieras,
Jacob F Degner,
Roger Pique-Regi,
Athma A Pai,
Gregory E Crawford,
Matthew Stephens,
Yoav Gilad,
Jonathan K Pritchard
Department of Human Genetics, University of Chicago, 920 E58th Street, Chicago, IL 60637, USA. dg13@sanger.ac.uk.
ABSTRACT: BACKGROUND: Expression quantitative trait loci (eQTLs) are likely to play an important role in the genetics of complex traits; however, their functional basis remains poorly understood. Using the HapMap lymphoblastoid cell lines, we combine 1000 Genomes genotypes and an extensive catalogue of human functional elements to investigate the biological mechanisms that eQTLs perturb. RESULTS: We use a Bayesian hierarchical model to estimate the enrichment of eQTLs in a wide variety of regulatory annotations. We find that approximately 40% of eQTLs occur in open chromatin, and that they are particularly enriched in transcription factor binding sites, suggesting that many directly impact protein-DNA interactions. Analysis of core promoter regions shows that eQTLs also frequently disrupt some known core promoter motifs but, surprisingly, are not enriched in other well-known motifs such as the TATA box. We also show that information from regulatory annotations alone, when weighted by the hierarchical model, can provide a meaningful ranking of the SNPs that are most likely to drive gene expression variation. CONCLUSIONS: Our study demonstrates how regulatory annotation and the association signal derived from eQTL-mapping can be combined into a single framework. We used this approach to further our understanding of the biology that drives human gene expression variation, and of the putatively causal SNPs that underlie it.
Michel Rousset,
Isabelle Bonnin,
Carine Remoué,
Matthieu Falque,
Bénédicte Rhoné,
Jean-Baptiste Veyrieras,
Delphine Madur,
Alain Murigneux,
François Balfourier,
Jacques Le Gouis,
Sylvain Santoni,
Isabelle Goldringer
UMR de Génétique Végétale, INRA/CNRS/AgroParisTech/Univ. Paris XI, Ferme du Moulon, 91190 Gif sur Yvette, France. mrousset@moulon.inra.fr
Earliness is very important for the adaptation of wheat to environmental conditions and the achievement of high grain yield. A detailed knowledge of key genetic components of the life cycle would enable an easier control by the breeders. The objective of the study was to investigate the effect of candidate genes on flowering time. Using a collection of hexaploid wheat composed of 235 lines from diverse geographical origins, we conducted an association study for six candidate genes for flowering time and its components (vernalization sensitivity and earliness per se). The effect on the variation of earliness components of polymorphisms within the copies of each gene was tested in ANOVA models accounting for the underlying genetic structure. The collection was structured in five groups that minimized the residual covariance. Vernalization requirement and lateness tend to increase according to the mean latitude of each group. Heading date for an autumnal sowing was mainly determined by the earliness per se. Except for the Constans (CO) gene orthologous of the barley HvCO3, all gene polymorphisms had a significant impact on earliness components. The three traits used to quantify vernalization requirement were primarily associated with polymorphisms at Vrn-1 and then at Vrn-3 and Luminidependens (LD) genes. We found a good correspondence between spring/winter types and genotypes at the three homeologous copies of Vrn-1. Earliness per se was mainly explained by polymorphisms at Vrn-3 and to a lesser extent at Vrn-1, Hd-1 and Gigantea (GI) genes. Vernalization requirement and earliness as a function of geographical origin, as well as the possible role of the breeding practices in the geographical distribution of the alleles and the hypothetical adaptive value of the candidate genes, are discussed.
BMC Plant Biol. 2011 ;11 :16
21247437
Institut National de la Recherche Agronomique, UR 1052 Génétique et Amélioration des Fruits et Légumes, BP94, 84140 Montfavet, France.
BACKGROUND Integrating QTL results from independent experiments performed on related species helps to survey the genetic diversity of loci/alleles underlying complex traits, and to highlight potential targets for breeding or QTL cloning. Potato (Solanum tuberosum L.) late blight resistance has been thoroughly studied, generating mapping data for many Rpi-genes (R-genes to Phytophthora infestans) and QTLs (quantitative trait loci). Moreover, late blight resistance was often associated with plant maturity. To get insight into the genomic organization of late blight resistance loci as compared to maturity QTLs, a QTL meta-analysis was performed for both traits. RESULTS Nineteen QTL publications for late blight resistance were considered, seven of them reported maturity QTLs. Twenty-one QTL maps and eight reference maps were compiled to construct a 2,141-marker consensus map on which QTLs were projected and clustered into meta-QTLs. The whole-genome QTL meta-analysis reduced by six-fold late blight resistance QTLs (by clustering 144 QTLs into 24 meta-QTLs), by ca. five-fold maturity QTLs (by clustering 42 QTLs into eight meta-QTLs), and by ca. two-fold QTL confidence interval mean. Late blight resistance meta-QTLs were observed on every chromosome and maturity meta-QTLs on only six chromosomes. CONCLUSIONS Meta-analysis helped to refine the genomic regions of interest frequently described, and provided the closest flanking markers. Meta-QTLs of late blight resistance and maturity juxtaposed along chromosomes IV, V and VIII, and overlapped on chromosomes VI and XI. The distribution of late blight resistance meta-QTLs is significantly independent from those of Rpi-genes, resistance gene analogs and defence-related loci. The anchorage of meta-QTLs to the potato genome sequence, recently publicly released, will especially improve the candidate gene selection to determine the genes underlying meta-QTLs. All mapping data are available from the Sol Genomics Network (SGN) database.
PLoS Genet. 2010 ;6 (4):e1000913
20421933
Cit:1
Muniyandi Nagarajan,
Jean-Baptiste Veyrieras,
Maud de Dieuleveult,
Hélène Bottin,
Steffen Fehrmann,
Anne-Laure Abraham,
Séverine Croze,
Lars M Steinmetz,
Xavier Gidrol,
Gaël Yvert
Laboratoire de Biologie Moléculaire de la Cellule, Ecole Normale Supérieure de Lyon, CNRS, Université de Lyon, Lyon, France.
Epigenomes commonly refer to the sequence of presence/absence of specific epigenetic marks along eukaryotic chromatin. Complete histone-borne epigenomes have now been described at single-nucleosome resolution from various organisms, tissues, developmental stages, or diseases, yet their intra-species natural variation has never been investigated. We describe here that the epigenomic sequence of histone H3 acetylation at Lysine 14 (H3K14ac) differs greatly between two unrelated strains of the yeast Saccharomyces cerevisiae. Using single-nucleosome chromatin immunoprecipitation and mapping, we interrogated 58,694 nucleosomes and found that 5,442 of them differed in their level of H3K14 acetylation, at a false discovery rate (FDR) of 0.0001. These Single Nucleosome Epi-Polymorphisms (SNEPs) were enriched at regulatory sites and conserved non-coding DNA sequences. Surprisingly, higher acetylation in one strain did not imply higher expression of the relevant gene. However, SNEPs were enriched in genes of high transcriptional variability and one SNEP was associated with the strength of gene activation upon stimulation. Our observations suggest a high level of inter-individual epigenomic variation in natural populations, with essential questions on the origin of this diversity and its relevance to gene x environment interactions.
Nature. 2010 Mar 10;:
20220758
Cit:60
Joseph K Pickrell,
John C Marioni,
Athma A Pai,
Jacob F Degner,
Barbara E Engelhardt,
Everlyne Nkadori,
Jean-Baptiste Veyrieras,
Matthew Stephens,
Yoav Gilad,
Jonathan K Pritchard
Department of Human Genetics.
Understanding the genetic mechanisms underlying natural variation in gene expression is a central goal of both medical and evolutionary genetics, and studies of expression quantitative trait loci (eQTLs) have become an important tool for achieving this goal. Although all eQTL studies so far have assayed messenger RNA levels using expression microarrays, recent advances in RNA sequencing enable the analysis of transcript variation at unprecedented resolution. We sequenced RNA from 69 lymphoblastoid cell lines derived from unrelated Nigerian individuals that have been extensively genotyped by the International HapMap Project. By pooling data from all individuals, we generated a map of the transcriptional landscape of these cells, identifying extensive use of unannotated untranslated regions and more than 100 new putative protein-coding exons. Using the genotypes from the HapMap project, we identified more than a thousand genes at which genetic variation influences overall expression levels or splicing. We demonstrate that eQTLs near genes generally act by a mechanism involving allele-specific expression, and that variation that influences the inclusion of an exon is enriched within and near the consensus splice sites. Our results illustrate the power of high-throughput sequencing for the joint analysis of variation in transcription, splicing and allele-specific expression across individuals.
Mol Biol Evol. 2008 Dec 17;:
19091723
Cit:11
Sridhar Kudaravalli,
Jean-Baptiste Veyrieras,
Barbara E Stranger,
Emmanouil T Dermitzakis,
Jonathan K Pritchard
Dept of Human Genetics, The University of Chicago, 920 E 58th St CLSC 507, Chicago IL 60637, USA.
Changes in gene expression may represent an important mode of human adaptation. However, to date, there are relatively few known examples in which selection has been shown to act directly on levels or patterns of gene expression. In order to test whether SNPs that affect gene expression in cis are frequently targets of positive natural selection in humans, we analyzed genome-wide SNP and expression data from cell lines associated with the International HapMap Project. Using a haplotype-based test for selection that was designed to detect incomplete selective sweeps, we found that SNPs showing signals of selection are more likely than random SNPs to be associated with gene expression levels in cis. This signal is significant in the Yoruba (which is the population that shows the strongest signals of selection overall), and shows a trend in the same direction in the other HapMap populations. Our results argue that selection on gene expression levels is an important type of human adaptation. Finally, our work provides an analytical framework for tackling a more general problem that will become increasingly important: namely, testing whether selection signals overlap significantly with SNPs that are associated with phenotypes of interest.
Jean-Baptiste Veyrieras,
Sridhar Kudaravalli,
Su Yeon Kim,
Emmanouil T Dermitzakis,
Yoav Gilad,
Matthew Stephens,
Jonathan K Pritchard
Department of Human Genetics, The University of Chicago, Chicago, IL, USA. jb.veyrieras@gmail.com
Recent studies of the HapMap lymphoblastoid cell lines have identified large numbers of quantitative trait loci for gene expression (eQTLs). Reanalyzing these data using a novel Bayesian hierarchical model, we were able to create a surprisingly high-resolution map of the typical locations of sites that affect mRNA levels in cis. Strikingly, we found a strong enrichment of eQTLs in the 250 bp just upstream of the transcription end site (TES), in addition to an enrichment around the transcription start site (TSS). Most eQTLs lie either within genes or close to genes; for example, we estimate that only 5% of eQTLs lie more than 20 kb upstream of the TSS. After controlling for position effects, SNPs in exons are approximately 2-fold more likely than SNPs in introns to be eQTLs. Our results suggest an important role for mRNA stability in determining steady-state mRNA levels, and highlight the potential of eQTL mapping as a high-resolution tool for studying the determinants of gene regulation.
Theor Appl Genet. 2008 Oct 2;:
18830577
Cit:9
Silke Stracke,
Grit Haseneyer,
Jean-Baptiste Veyrieras,
Hartwig Geiger,
Sascha Sauer,
Andreas Graner,
Hans-Peter Piepho
Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), 06466, Gatersleben, Germany, sstrack@gwdg.de.
The interaction between members of a gene network has an important impact on the variation of quantitative traits, and can influence the outcome of phenotype/genotype association studies. Three genes (Ppd-H1, HvCO1, HvFT1) known to play an essential role in the regulation of flowering time under long days in barley were subjected to an analysis of nucleotide diversity in a collection of 220 spring barley accessions. The coding region of Ppd-H1 was highly diverse, while both HvCO1 and HvFT1 showed a rather limited level of diversity. Within all three genes, the extent of linkage disequilibrium was variable, but on average only moderate. Ppd-H1 is strongly associated with flowering time across four environments, showing a difference of five to ten days between the most extreme haplotypes. The association between flowering time and the variation at HvFT1 and HvCO1 was strongly dependent on the haplotype present at Ppd-H1. The interaction between HvCO1 and Ppd-H1 was statistically significant, but this association disappeared when the analysis was corrected for the geographical origin of the accessions. No association existed between flowering time and allelic variation at HvFT1. In contrast to Ppd-H1, functional variation at both HvCO1 and HvFT1 is limited in cultivated barley.
|
Polish News | |||||||||||||||
|
|||||||||||||||||
|
|