Polymorphism, Single Nucleotide :: genetics
Latest Paper:
Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA. reich@genetics.med.harvard.edu
India has been underrepresented in genome-wide surveys of human variation. We analyse 25 diverse groups in India to provide strong evidence for two ancient populations, genetically divergent, that are ancestral to most Indians today. One, the 'Ancestral North Indians'(ANI), is genetically close to Middle Easterners, Central Asians, and Europeans, whereas the other, the 'Ancestral South Indians'(ASI), is as distinct from ANI and East Asians as they are from each other. By introducing methods that can estimate ancestry without accurate ancestral populations, we show that ANI ancestry ranges from 39-71% in most Indian groups, and is higher in traditionally upper caste and Indo-European speakers. Groups with only ASI ancestry may no longer exist in mainland India. However, the indigenous Andaman Islanders are unique in being ASI-related groups without ANI ancestry. Allele frequency differences between groups in India are larger than in Europe, reflecting strong founder effects whose signatures have been maintained for thousands of years owing to endogamy. We therefore predict that there will be an excess of recessive diseases in India, which should be possible to screen and map genetically.
Mesh-terms: Asia :: ethnology; Chromosomes, Human, Y :: genetics; Continental Population Groups :: genetics; DNA, Mitochondrial :: genetics; Ethnic Groups :: genetics; Europe :: ethnology; Female; Founder Effect; Gene Frequency; Genes, Recessive :: genetics; Genetic Variation :: genetics; Genetics, Medical; Genetics, Population; Genome, Human :: genetics; Genomics; Genotype; Geography; Humans; India; Language; Linkage Disequilibrium :: genetics; Male; Middle East :: ethnology; Phylogeny; Polymorphism, Single Nucleotide :: genetics; Principal Component Analysis;
Most cited papers:
Department of Statistics, University of Oxford, United Kingdom. pritch@stats.ox.ac.uk
The use, in association studies, of the forthcoming dense genomewide collection of single-nucleotide polymorphisms (SNPs) has been heralded as a potential breakthrough in the study of the genetic basis of common complex disorders. A serious problem with association mapping is that population structure can lead to spurious associations between a candidate marker and a phenotype. One common solution has been to abandon case-control studies in favor of family-based tests of association, such as the transmission/disequilibrium test (TDT), but this comes at a considerable cost in the need to collect DNA from close relatives of affected individuals. In this article we describe a novel, statistically valid, method for case-control association studies in structured populations. Our method uses a set of unlinked genetic markers to infer details of population structure, and to estimate the ancestry of sampled individuals, before using this information to test for associations within subpopulations. It provides power comparable with the TDT in many settings and may substantially outperform it if there are conflicting associations in different subpopulations.
Mesh-terms: Alleles; Case-Control Studies; Chromosome Mapping :: methods; Chromosome Mapping :: statistics & numerical data; Computer Simulation; Female; Genetic Diseases, Inborn :: genetics; Genetic Markers :: genetics; Genetics, Population; Human; Linkage Disequilibrium :: genetics; Male; Models, Genetic; Nuclear Family; Pedigree; Phenotype; Polymorphism, Single Nucleotide :: genetics; Reproducibility of Results; Sensitivity and Specificity; Statistical Distributions; Support, Non-U.S. Gov't; Support, U.S. Gov't, Non-P.H.S. ;
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA. sherry@ncbi.nlm.nih.gov
In response to a need for a general catalog of genome variation to address the large-scale sampling designs required by association studies, gene mapping and evolutionary biology, the National Center for Biotechnology Information (NCBI) has established the dbSNP database [S.T.Sherry, M.Ward and K.Sirotkin (1999) Genome Res., 9, 677-679]. Submissions to dbSNP will be integrated with other sources of information at NCBI such as GenBank, PubMed, LocusLink and the Human Genome Project data. The complete contents of dbSNP are available to the public at website: http://www.ncbi.nlm.nih.gov/SNP. The complete contents of dbSNP can also be downloaded in multiple formats via anonymous FTP at ftp://ncbi.nlm.nih.gov/snp/.
Mesh-terms: Animals; Biotechnology; Databases, Factual; Humans; Information Services; Internet; National Institutes of Health (U.S.) ; National Library of Medicine (U.S.) ; Polymorphism, Single Nucleotide :: genetics; Research Support, U.S. Gov't, Non-P.H.S. ; Research Support, U.S. Gov't, P.H.S. ; United States; Variation (Genetics) ;
Department of Chemistry and Biochemistry, University of California, 611 Charles E. Young Drive East, Los Angeles, CA 90095-1570, USA.
We have identified 6201 alternative splice relationships in human genes, through a genome-wide analysis of expressed sequence tags (ESTs). Starting with approximately 2.1 million human mRNA and EST sequences, we mapped expressed sequences onto the draft human genome sequence and only accepted splices that obeyed the standard splice site consensus. A large fraction (47%) of these were observed multiple times, indicating that they comprise a substantial fraction of the mRNA species. The vast majority of the detected alternative forms appear to be novel, and produce highly specific, biologically meaningful control of function in both known and novel human genes, e.g. specific removal of the lysosomal targeting signal from HLA-DM beta chain, replacement of the C-terminal transmembrane domain and cytoplasmic tail in an FC receptor beta chain homolog with a different transmembrane domain and cytoplasmic tail, likely modulating its signal transduction activity. Our data indicate that a large proportion of human genes, probably 42% or more, are alternatively spliced, but that this appears to be observed mainly in certain types of molecules (e.g. cell surface receptors) and systemic functions, particularly the immune system and nervous system. These results provide a comprehensive dataset for understanding the role of alternative splicing in the human genome, accessible at http://www.bioinformatics.ucla.edu/HASDB.
Mesh-terms: Alternative Splicing :: genetics; Base Sequence; Computational Biology; Consensus Sequence :: genetics; Conserved Sequence :: genetics; Databases; Exons :: genetics; Expressed Sequence Tags; Genes :: genetics; Genome, Human; Genomics; HLA-D Antigens :: chemistry; HLA-D Antigens :: genetics; Humans; Internet; Introns :: genetics; Multigene Family :: genetics; Polymorphism, Single Nucleotide :: genetics; Protein-Serine-Threonine Kinases :: genetics; RNA Splice Sites :: genetics; RNA, Messenger :: analysis; RNA, Messenger :: genetics; Reproducibility of Results; Research Support, Non-U.S. Gov't; Research Support, U.S. Gov't, Non-P.H.S. ; Sequence Alignment;
Department of Genetics, North Carolina State University, Raleigh, Box 7614, North Carolina 27695, USA. trudy_mackay@ncsu.edu
Phenotypic variation for quantitative traits results from the segregation of alleles at multiple quantitative trait loci (QTL) with effects that are sensitive to the genetic, sexual, and external environments. Major challenges for biology in the post-genome era are to map the molecular polymorphisms responsible for variation in medically, agriculturally, and evolutionarily important complex traits; and to determine their gene frequencies and their homozygous, heterozygous, epistatic, and pleiotropic effects in multiple environments. The ease with which QTL can be mapped to genomic intervals bounded by molecular markers belies the difficulty in matching the QTL to a genetic locus. The latter requires high-resolution recombination or linkage disequilibrium mapping to nominate putative candidate genes, followed by genetic and/or functional complementation and gene expression analyses. Complete genome sequences and improved technologies for polymorphism detection will greatly advance the genetic dissection of quantitative traits in model organisms, which will open avenues for exploration of homologous QTL in related taxa.
Department of Ecology and Evolutionary Biology, University of Arizona, USA. acutter@email.arizona.edu
Natural selection can produce a correlation between local recombination rates and levels of neutral DNA polymorphism as a consequence of genetic hitchhiking and background selection. Theory suggests that selection at linked sites should affect patterns of neutral variation in partially selfing populations more dramatically than in outcrossing populations. However, empirical investigations of selection at linked sites have focused primarily on outcrossing species. To assess the potential role of selection as a determinant of neutral polymorphism in the context of partial self-fertilization, we conducted a multivariate analysis of single-nucleotide polymorphism (SNP) density throughout the genome of the nematode Caenorhabditis elegans. We based the analysis on a published SNP data set and partitioned the genome into windows to calculate SNP densities, recombination rates, and gene densities across all six chromosomes. Our analyses identify a strong, positive correlation between recombination rate and neutral polymorphism (as estimated by noncoding SNP density) across the genome of C. elegans. Furthermore, we find that levels of neutral polymorphism are lower in gene-dense regions than in gene-poor regions in some analyses. Analyses incorporating local estimates of divergence between C. elegans and C. briggsae indicate that a mutational explanation alone is unlikely to explain the observed patterns. Consequently, we interpret these findings as evidence that natural selection shapes genome-wide patterns of neutral polymorphism in C. elegans. Our study provides the first demonstration of such an effect in a partially selfing animal. Explicit models of genetic hitchhiking and background selection can each adequately describe the relationship between recombination rate and SNP density, but only when they incorporate selfing rate. Clarification of the relative roles of genetic hitchhiking and background selection in C. elegans awaits the development of specific theoretical predictions that account for partial self-fertilization and biased sex ratios.
Mesh-terms: Animals; Caenorhabditis elegans :: genetics; Comparative Study; Evolution, Molecular; Inbreeding; Linkage (Genetics):: genetics; Models, Genetic; Polymorphism, Single Nucleotide :: genetics; Recombination, Genetic :: genetics; Regression Analysis; Selection (Genetics) ; Support, Non-U.S. Gov't; Support, U.S. Gov't, Non-P.H.S. ;
Sagiv Shifman,
Michal Bronstein,
Meira Sternfeld,
Anne Pisanté-Shalom,
Efrat Lev-Lehman,
Avraham Weizman,
Ilya Reznik,
Baruch Spivak,
Nimrod Grisaru,
Leon Karp,
Richard Schiffer,
Moshe Kotler,
Rael D Strous,
Marnina Swartz-Vanetik,
Haim Y Knobler,
Eilat Shinar,
Jacques S Beckmann,
Benjamin Yakir,
Neil Risch,
Naomi B Zak,
Ariel Darvasi
Institute of Life Sciences, The Hebrew University of Jerusalem, Israel.
Several lines of evidence have placed the catechol-O-methyltransferase (COMT) gene in the limelight as a candidate gene for schizophrenia. One of these is its biochemical function in metabolism of catecholamine neurotransmitters; another is the microdeletion, on chromosome 22q11, that includes the COMT gene and causes velocardiofacial syndrome, a syndrome associated with a high rate of psychosis, particularly schizophrenia. The interest in the COMT gene as a candidate risk factor for schizophrenia has led to numerous linkage and association analyses. These, however, have failed to produce any conclusive result. Here we report an efficient approach to gene discovery. The approach consists of (i) a large sample size-to our knowledge, the present study is the largest case-control study performed to date in schizophrenia; (ii) the use of Ashkenazi Jews, a well defined homogeneous population; and (iii) a stepwise procedure in which several single nucleotide polymorphisms (SNPs) are scanned in DNA pools, followed by individual genotyping and haplotype analysis of the relevant SNPs. We found a highly significant association between schizophrenia and a COMT haplotype (P=9.5x10-8). The approach presented can be widely implemented for the genetic dissection of other common diseases.
Mesh-terms: Case-Control Studies; Catechol O-Methyltransferase :: genetics; Female; Founder Effect; Gene Frequency; Genetic Predisposition to Disease; Haplotypes :: genetics; Human; Humans; Jews :: genetics; Linkage Disequilibrium; Male; Polymorphism, Single Nucleotide :: genetics; Research Design; Sample Size; Schizophrenia :: genetics; Sex Characteristics;
Roche Molecular Systems, Alameda, California 94501 USA. Soren.Germer@Roche.com
We have developed an accurate, yet inexpensive and high-throughput, method for determining the allele frequency of biallelic polymorphisms in pools of DNA samples. The assay combines kinetic (real-time quantitative) PCR with allele-specific amplification and requires no post-PCR processing. The relative amounts of each allele in a sample are quantified. This is performed by dividing equal aliquots of the pooled DNA between two separate PCR reactions, each of which contains a primer pair specific to one or the other allelic SNP variant. For pools with equal amounts of the two alleles, the two amplifications should reach a detectable level of fluorescence at the same cycle number. For pools that contain unequal ratios of the two alleles, the difference in cycle number between the two amplification reactions can be used to calculate the relative allele amounts. We demonstrate the accuracy and reliability of the assay on samples with known predetermined SNP allele frequencies from 5% to 95%, including pools of both human and mouse DNAs using eight different SNPs altogether. The accuracy of measuring known allele frequencies is very high, with the strength of correlation between measured and known frequencies having an r(2)= .997. The loss of sensitivity as a result of measurement error is typically minimal, compared with that due to sampling error alone, for population samples up to 1000. We believe that by providing a means for SNP genotyping up to thousands of samples simultaneously, inexpensively, and reproducibly, this method is a powerful strategy for detecting meaningful polymorphic differences in candidate gene association studies and genome-wide linkage disequilibrium scans.
Mesh-terms: Animals; DNA :: analysis; Gene Frequency :: genetics; Human; Mathematical Computing; Mice; Mice, Inbred A; Mice, Inbred BALB C; Mice, Inbred C3H; Mice, Inbred C57BL; Mice, Inbred DBA; Mice, Inbred MRL lpr; Mice, Inbred NZB; Polymerase Chain Reaction :: methods; Polymorphism, Single Nucleotide :: genetics; Templates, Genetic;
Hreinn Stefansson,
Jane Sarginson,
Augustine Kong,
Phil Yates,
Valgerdur Steinthorsdottir,
Einar Gudfinnsson,
Steinunn Gunnarsdottir,
Nicholas Walker,
Hannes Petursson,
Caroline Crombie,
Andres Ingason,
Jeffrey R Gulcher,
Kari Stefansson,
David St Clair
deCODE Genetics, Reykjavík, Iceland. hreinn@decode.is
Recently, we identified neuregulin 1 (NRG1) as a susceptibility gene for schizophrenia in the Icelandic population, by a combined linkage and association approach. Here, we report the first study evaluating the relevance of NRG1 to schizophrenia in a population outside Iceland. Markers representing a core at-risk haplotype found in Icelanders at the 5' end of the NRG1 gene were genotyped in 609 unrelated Scottish patients and 618 unrelated Scottish control individuals. This haplotype consisted of five SNP markers and two microsatellites, which all appear to be in strong linkage disequilibrium. For the Scottish patients and control subjects, haplotype frequencies were estimated by maximum likelihood, using the expectation-maximization algorithm. The frequency of the seven-marker haplotype among the Scottish patients was significantly greater than that among the control subjects (10.2% vs. 5.9%, P=.00031). The estimated risk ratio was 1.8, which is in keeping with our report of unrelated Icelandic patients (2.1). Three of the seven markers in the haplotype gave single-point P values ranging from .000064 to .0021 for the allele contributing to the at-risk haplotype. This direct replication of haplotype association in a second population further implicates NRG1 as a factor that contributes to the etiology of schizophrenia.
Mesh-terms: Algorithms; Gene Frequency :: genetics; Genetic Markers :: genetics; Genetic Predisposition to Disease :: genetics; Haplotypes :: genetics; Humans; Linkage Disequilibrium; Microsatellite Repeats :: genetics; Molecular Sequence Data; Neuregulin-1 :: genetics; Polymorphism, Single Nucleotide :: genetics; Reproducibility of Results; Research Support, Non-U.S. Gov't; Schizophrenia :: genetics; Scotland;
Case-control studies are subject to the problem of population stratification, which can occur in ethnically mixed populations and can lead to significant associations being detected at loci that have nothing to do with disease. Here, we describe a way to measure and correct for stratification by genotyping a moderate number of unlinked genetic markers in the same set of cases and controls in which a candidate association was found. The average of association statistics across the markers directly measures stratification. By dividing the candidate association statistic by this average, a P-value can be obtained that corrects for stratification.
Mark T Ross,
Darren V Grafham,
Alison J Coffey,
Steven Scherer,
Kirsten McLay,
Donna Muzny,
Matthias Platzer,
Gareth R Howell,
Christine Burrows,
Christine P Bird,
Adam Frankish,
Frances L Lovell,
Kevin L Howe,
Jennifer L Ashurst,
Robert S Fulton,
Ralf Sudbrak,
Gaiping Wen,
Matthew C Jones,
Matthew E Hurles,
T Daniel Andrews,
Carol E Scott,
Stephen Searle,
Juliane Ramser,
Adam Whittaker,
Rebecca Deadman,
Nigel P Carter,
Sarah E Hunt,
Rui Chen,
Andrew Cree,
Preethi Gunaratne,
Paul Havlak,
Anne Hodgson,
Michael L Metzker,
Stephen Richards,
Graham Scott,
David Steffen,
Erica Sodergren,
David A Wheeler,
Kim C Worley,
Rachael Ainscough,
Kerrie D Ambrose,
M Ali Ansari-Lari,
Swaroop Aradhya,
Robert I S Ashwell,
Anne K Babbage,
Claire L Bagguley,
Andrea Ballabio,
Ruby Banerjee,
Gary E Barker,
Karen F Barlow,
Ian P Barrett,
Karen N Bates,
David M Beare,
Helen Beasley,
Oliver Beasley,
Alfred Beck,
Graeme Bethel,
Karin Blechschmidt,
Nicola Brady,
Sarah Bray-Allen,
Anne M Bridgeman,
Andrew J Brown,
Mary J Brown,
David Bonnin,
Elspeth A Bruford,
Christian Buhay,
Paula Burch,
Deborah Burford,
Joanne Burgess,
Wayne Burrill,
John Burton,
Jackie M Bye,
Carol Carder,
Laura Carrel,
Joseph Chako,
Joanne C Chapman,
Dean Chavez,
Ellson Chen,
Guan Chen,
Yuan Chen,
Zhijian Chen,
Craig Chinault,
Alfredo Ciccodicola,
Sue Y Clark,
Graham Clarke,
Chris M Clee,
Sheila Clegg,
Kerstin Clerc-Blankenburg,
Karen Clifford,
Vicky Cobley,
Charlotte G Cole,
Jen S Conquer,
Nicole Corby,
Richard E Connor,
Robert David,
Joy Davies,
Clay Davis,
John Davis,
Oliver Delgado,
Denise Deshazo,
Pawandeep Dhami,
Yan Ding,
Huyen Dinh,
Steve Dodsworth,
Heather Draper,
Shannon Dugan-Rocha,
Andrew Dunham,
Matthew Dunn,
K James Durbin,
Ireena Dutta,
Tamsin Eades,
Matthew Ellwood,
Alexandra Emery-Cohen,
Helen Errington,
Kathryn L Evans,
Louisa Faulkner,
Fiona Francis,
John Frankland,
Audrey E Fraser,
Petra Galgoczy,
James Gilbert,
Rachel Gill,
Gernot Glöckner,
Simon G Gregory,
Susan Gribble,
Coline Griffiths,
Russell Grocock,
Yanghong Gu,
Rhian Gwilliam,
Cerissa Hamilton,
Elizabeth A Hart,
Alicia Hawes,
Paul D Heath,
Katja Heitmann,
Steffen Hennig,
Judith Hernandez,
Bernd Hinzmann,
Sarah Ho,
Michael Hoffs,
Phillip J Howden,
Elizabeth J Huckle,
Jennifer Hume,
Paul J Hunt,
Adrienne R Hunt,
Judith Isherwood,
Leni Jacob,
David Johnson,
Sally Jones,
Pieter J de Jong,
Shirin S Joseph,
Stephen Keenan,
Susan Kelly,
Joanne K Kershaw,
Ziad Khan,
Petra Kioschis,
Sven Klages,
Andrew J Knights,
Anna Kosiura,
Christie Kovar-Smith,
Gavin K Laird,
Cordelia Langford,
Stephanie Lawlor,
Margaret Leversha,
Lora Lewis,
Wen Liu,
Christine Lloyd,
David M Lloyd,
Hermela Loulseged,
Jane E Loveland,
Jamieson D Lovell,
Ryan Lozado,
Jing Lu,
Rachael Lyne,
Jie Ma,
Manjula Maheshwari,
Lucy H Matthews,
Jennifer McDowall,
Stuart McLaren,
Amanda McMurray,
Patrick Meidl,
Thomas Meitinger,
Sarah Milne,
George Miner,
Shailesh L Mistry,
Margaret Morgan,
Sidney Morris,
Ines Müller,
James C Mullikin,
Ngoc Nguyen,
Gabriele Nordsiek,
Gerald Nyakatura,
Christopher N O'Dell,
Geoffery Okwuonu,
Sophie Palmer,
Richard Pandian,
David Parker,
Julia Parrish,
Shiran Pasternak,
Dina Patel,
Alex V Pearce,
Danita M Pearson,
Sarah E Pelan,
Lesette Perez,
Keith M Porter,
Yvonne Ramsey,
Kathrin Reichwald,
Susan Rhodes,
Kerry A Ridler,
David Schlessinger,
Mary G Schueler,
Harminder K Sehra,
Charles Shaw-Smith,
Hua Shen,
Elizabeth M Sheridan,
Ratna Shownkeen,
Carl D Skuce,
Michelle L Smith,
Elizabeth C Sotheran,
Helen E Steingruber,
Charles A Steward,
Roy Storey,
R Mark Swann,
David Swarbreck,
Paul E Tabor,
Stefan Taudien,
Tineace Taylor,
Brian Teague,
Karen Thomas,
Andrea Thorpe,
Kirsten Timms,
Alan Tracey,
Steve Trevanion,
Anthony C Tromans,
Michele d'Urso,
Daniel Verduzco,
Donna Villasana,
Lenee Waldron,
Melanie Wall,
Qiaoyan Wang,
James Warren,
Georgina L Warry,
Xuehong Wei,
Anthony West,
Siobhan L Whitehead,
Mathew N Whiteley,
Jane E Wilkinson,
David L Willey,
Gabrielle Williams,
Leanne Williams,
Angela Williamson,
Helen Williamson,
Laurens Wilming,
Rebecca L Woodmansey,
Paul W Wray,
Jennifer Yen,
Jingkun Zhang,
Jianling Zhou,
Huda Zoghbi,
Sara Zorilla,
David Buck,
Richard Reinhardt,
Annemarie Poustka,
André Rosenthal,
Hans Lehrach,
Alfons Meindl,
Patrick J Minx,
Ladeana W Hillier,
Huntington F Willard,
Richard K Wilson,
Robert H Waterston,
Catherine M Rice,
Mark Vaudin,
Alan Coulson,
David L Nelson,
George Weinstock,
John E Sulston,
Richard Durbin,
Tim Hubbard,
Richard A Gibbs,
Stephan Beck,
Jane Rogers,
David R Bentley
The human X chromosome has a unique biology that was shaped by its evolution as the sex chromosome shared by males and females. We have determined 99.3% of the euchromatic sequence of the X chromosome. Our analysis illustrates the autosomal origin of the mammalian sex chromosomes, the stepwise process that led to the progressive loss of recombination between X and Y, and the extent of subsequent degradation of the Y chromosome. LINE1 repeat elements cover one-third of the X chromosome, with a distribution that is consistent with their proposed role as way stations in the process of X-chromosome inactivation. We found 1,098 genes in the sequence, of which 99 encode proteins expressed in testis and in various tumour types. A disproportionately high number of mendelian diseases are documented for the X chromosome. Of this number, 168 have been explained by mutations in 113 X-linked genes, which in many cases were characterized with the aid of the DNA sequence.
Mesh-terms: Animals; Antigens, Neoplasm :: genetics; Centromere :: genetics; Chromosomes, Human, X :: genetics; Chromosomes, Human, Y :: genetics; Contig Mapping; Crossing Over, Genetic :: genetics; Dosage Compensation, Genetic; Evolution, Molecular; Female; Genetics, Medical; Genomics; Humans; Linkage (Genetics):: genetics; Male; Polymorphism, Single Nucleotide :: genetics; RNA :: genetics; Repetitive Sequences, Nucleic Acid :: genetics; Research Support, Non-U.S. Gov't; Sequence Analysis, DNA; Sequence Homology, Nucleic Acid; Testis :: metabolism;

