BioInfoBank Library


FP7 Partner
Add BioInfo.PL bioinformatics lab to Your FP7 application
PLoS Genet. 2007 Jan 12;3 (1):e10 17222062 (P,S,G,E,B,D) Cited:12
Neuronal apoptosis inhibitory protein (NAIP, also known as BIRC1) is a member of the conserved inhibitor of apoptosis protein (IAP) family. Lineage-specific rearrangements and expansions of this locus have yielded different copy numbers among primates and rodents, with human retaining a single functional copy and mouse possessing several copies, depending on the strain. Roles for this gene in disease have been documented, but little is known about transcriptional regulation of NAIP. We show here that NAIP has multiple promoters sharing no similarity between human and rodents. Moreover, we demonstrate that multiple, domesticated long terminal repeats (LTRs) of endogenous retroviral elements provide NAIP promoter function in human, mouse, and rat. In human, an LTR serves as a tissue-specific promoter, active primarily in testis. However, in rodents, our evidence indicates that an ancestral LTR common to all rodent genes is the major, constitutive promoter for these genes, and that a second LTR found in two of the mouse genes is a minor promoter. Thus, independently acquired LTRs have assumed regulatory roles for orthologous genes, a remarkable evolutionary scenario. We also demonstrate that 5' flanking regions of IAP family genes as a group, in both human and mouse are enriched for LTR insertions compared to average genes. We propose several potential explanations for these findings, including a hypothesis that recruitment of LTRs near NAIP or other IAP genes may represent a host-cell adaptation to modulate apoptotic responses.

Latest citations:

Cancer Lett. 2009 Dec 29;: 20044205 (P,S,G,E,B,D)
Canada's Michael Smith Genome Sciences Centre, British Columbia Cancer Agency, 675 West 10th Avenue, Vancouver, British Columbia, Canada V5Z 1L3.
A mechanism for survival of prostate cancer cells in an androgen-deprived environment remains elusive. Here, we find that expression of neuronal apoptosis inhibitory protein (NAIP) was significantly increased in vivo and in vitro in response to androgen deprivation therapy (ADT). Increased expression of NAIP corresponded to increased DNA-binding activity of NF-kappaB that physically associated to previously uncharacterized kappaB-like sites in the NAIP locus. Importantly, expression of NAIP was significantly increased (p=0.04) in clinical samples of prostate cancer from patients receiving ADT. Expression of NAIP may be associated with enhanced survival of prostate cancer in response to castration.
PLoS Comput Biol. 2009 Nov ;5 (11):e1000562 19911049 (P,S,G,E,B,D)
The Bioinformatics Centre, Department of Biology and the Biotech Research and Innovation Centre (BRIC), University of Copenhagen, Copenhagen, Denmark. eivind@binf.ku.dk
A major goal in post-genome biology is the complete mapping of the gene regulatory networks for every organism. Identification of regulatory elements is a prerequisite for realizing this ambitious goal. A common problem is finding regulatory patterns in promoters of a group of co-expressed genes, but contemporary methods are challenged by the size and diversity of regulatory regions in higher metazoans. Two key issues are the small amount of information contained in a pattern compared to the large promoter regions and the repetitive characteristics of genomic DNA, which both lead to "pattern drowning". We present a new computational method for identifying transcription factor binding sites in promoters using a discriminatory approach with a large negative set encompassing a significant sample of the promoters from the relevant genome. The sequences are described by a probabilistic model and the most discriminatory motifs are identified by maximizing the probability of the sets given the motif model and prior probabilities of motif occurrences in both sets. Due to the large number of promoters in the negative set, an enhanced suffix array is used to improve speed and performance. Using our method, we demonstrate higher accuracy than the best of contemporary methods, high robustness when extending the length of the input sequences and a strong correlation between our objective function and the correct solution. Using a large background set of real promoters instead of a simplified model leads to higher discriminatory power and markedly reduces the need for repeat masking; a common pre-processing step for other pattern finders.
Cell Mol Life Sci. 2009 Aug 2;: 19649766 (P,S,G,E,B,D)
Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, 16/10 Miklukho-Maklaya st, 117997, Moscow, Russia, e.gogvadze@gmail.com.
Retroelements comprise a considerable fraction of eukaryotic genomes. Since their initial discovery by Barbara McClintock in maize DNA, retroelements have been found in genomes of almost all organisms. First considered as a "junk DNA" or genomic parasites, they were shown to influence genome functioning and to promote genetic innovations. For this reason, they were suggested as an important creative force in the genome evolution and adaptation of an organism to altered environmental conditions. In this review, we summarize the up-to-date knowledge of different ways of retroelement involvement in structural and functional evolution of genes and genomes, as well as the mechanisms generated by cells to control their retrotransposition.
PLoS ONE. 2009 ;4 (6):e5761 19488400 (P,S,G,E,B,D)
Terry Fox Laboratory, BC Cancer Agency, Vancouver, British Columbia, Canada.
The human neuronal apoptosis inhibitory protein (NAIP) gene is no longer principally considered a member of the Inhibitor of Apoptosis Protein (IAP) family, as its domain structure and functions in innate immunity also warrant inclusion in the Nod-Like Receptor (NLR) superfamily. NAIP is located in a region of copy number variation, with one full length and four partly deleted copies in the reference human genome. We demonstrate that several of the NAIP paralogues are expressed, and that novel transcripts arise from both internal and upstream transcription start sites. Remarkably, two internal start sites initiate within Alu short interspersed element (SINE) retrotransposons, and a third novel transcription start site exists within the final intron of the GUSBP1 gene, upstream of only two NAIP copies. One Alu functions alone as a promoter in transient assays, while the other likely combines with upstream L1 sequences to form a composite promoter. The novel transcripts encode shortened open reading frames and we show that corresponding proteins are translated in a number of cell lines and primary tissues, in some cases above the level of full length NAIP. Interestingly, some NAIP isoforms lack their caspase-sequestering motifs, suggesting that they have novel functions. Moreover, given that human and mouse NAIP have previously been shown to employ endogenous retroviral long terminal repeats as promoters, exaptation of Alu repeats as additional promoters provides a fascinating illustration of regulatory innovations adopted by a single gene.
Plant J. 2008 Sep 21;: 18808453 (P,S,G,E,B,D) Cited:1
Keiko Hayashi (National Agricultural Research Center, National Agriculture and Food Research Organization (NARO), 1-2-1 Inada, Jo-etsu, Niigata 943-0193, Japan.
The plant genome contains a large number of disease resistance (R) genes that have evolved through diverse mechanisms. Here, we report that a long terminal repeat (LTR) retrotransposon contributed to the evolution of the rice blast resistance gene Pit. Pit confers race-specific resistance against the fungal pathogen Magnaporthe grisea, and is a member of the nucleotide-binding site-leucine-rich repeat (NBS-LRR) family of R genes. Compared to the nonfunctional allele Pit(Npb), the functional allele Pit(K59) contains four amino acid substitutions and has the LTR retrotransposon Renovator inserted upstream. Pathogenesis assays using chimeric constructs carrying the various regions of Pit(K59) and Pit(Npb) suggest that amino acid substitutions might have a potential effect in Pit resistance; more importantly, the upregulated promoter activity conferred by the Renovator sequence is essential for Pit function. Our data suggest that transposon-mediated transcriptional activation may play an important role in the refunctionalization of additional 'sleeping'R genes in the plant genome.
Annu Rev Genet. 2008 Aug 11;: 18694346 (P,S,G,E,B,D) Cited:1
Department of Molecular Biology and Microbiology, Tufts University School Medicine, Boston, MA 02111; Patric.Jern@tufts.edu.
For millions of years, retroviral infections have challenged vertebrates, occasionally leading to germline integration and inheritance as ERVs, genetic parasites whose remnants today constitute some 7% to 8% of the human genome. Although they have had significant evolutionary side effects, it is useful to view ERVs as fossil representatives of retroviruses extant at the time of their insertion into the germline, not as direct players in the evolutionary process itself. Expression of particular ERVs is associated with several positive physiological functions as well as certain diseases, although their roles in human disease as etiological agents, possible contributing factors, or disease markers-well demonstrated in animal models-remain to be established. Here we discuss ERV contributions to host genome structure and function, including their ability to mediate recombination, and physiological effects on the host transcriptome resulting from their integration, expression, and other events. Expected final online publication date for the Annual Review of Genetics Volume 42 is November 3, 2008. Please see http://www.annualreviews.org/catalog/pubdates.aspx for revised estimates.
Bioinformatics. 2008 Jun 5;: 18535086 (P,S,G,E,B,D)
School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta, GA 30306.
MOTIVATION: Endogenous retrovirus (ERV) elements have been shown to contribute promoter sequences that can initiate transcription of adjacent human genes. However, the extent to which retroviral sequences initiate transcription within the human genome is currently unknown. We analyzed genome sequence and high-throughput expression data to systematically evaluate the presence of retroviral promoters in the human genome. RESULTS: We report the existence of 51,197 ERV-derived promoter sequences that initiate transcription within the human genome, including 1,743 cases where transcription is initiated from ERV sequences that are located in gene proximal promoter or 5' untranslated regions (UTRs). 114 of the ERV-derived transcription start sites can be demonstrated to drive transcription of 97 human genes, producing chimeric transcripts that are initiated within ERV long terminal repeat (LTR) sequences and read-through into known gene sequences. ERV promoters drive tissue-specific and lineage-specific patterns of gene expression and contribute to expression divergence between paralogs. These data illustrate the potential of retroviral sequences to regulate human transcription on a large scale consistent with a substantial effect of ERVs on the function and evolution of the human genome. CONTACT: king.jordan@biology.gatech.edu.
BMC Genomics. 2008 May 22;9 (1):241 18498650 (P,S,G,E,B)
ABSTRACT: BACKGROUND: Retrogenes are processed copies of other genes. This duplication mechanism produces a copy of the parental gene that should not contain introns, and usually does not contain cis-regulatory regions. Here, we computationally address the evolutionary origin of promoter and other cis-regulatory regions in retrogenes using a total of 94 Drosophila retroposition events we recently identified. Previous tissue expression data has revealed that a large fraction of these retrogenes are specifically and/or highly expressed in adult testes of Drosophila. RESULTS: In this work, we infer that retrogenes do not generally carry regulatory regions from aberrant upstream or normal transcripts of their parental genes, and that expression patterns of neighboring genes are not consistently shared by retrogenes. Additionally, transposable elements do not appear to substantially provide regulatory regions to retrogenes. Interestingly, we find that there is an excess of retrogenes in male testis neighborhoods that is not explained by insertional biases of the retroelement machinery used for retroposition. CONCLUSIONS: We conclude that retrogenes' regulatory regions mostly do not represent a random set of existing regulatory regions. On the contrary, our conclusion is that selection is likely to have played an important role in the persistence of autosomal testis biased retrogenes. Selection in favor of retrogenes inserted in male testis neighborhoods and at the sequence level to produce testis expression is postulated to have occurred.
BMC Genomics. 2008 May 17;9 (1):226 18485226 (P,S,G,E,B) Cited:4
ABSTRACT: BACKGROUND: The majority of human non-protein-coding DNA is made up of repetitive sequences, mainly transposable elements (TEs). It is becoming increasingly apparent that many of these repetitive DNA sequence elements encode gene regulatory functions. This fact has important evolutionary implications, since repetitive DNA is the most dynamic part of the genome. We set out to assess the evolutionary rate and pattern of experimentally characterized human transcription factor binding sites (TFBS) that are derived from repetitive versus non-repetitive DNA to test whether repeat-derived TFBS are in fact rapidly evolving. We also evaluated the position-specific patterns of variation among TFBS to look for signs of functional constraint on TFBS derived from repetitive and non-repetitive DNA. RESULTS: We found numerous experimentally characterized TFBS in the human genome, 7-10% of all mapped sites, which are derived from repetitive DNA sequences including simple sequence repeats (SSRs) and TEs. TE-derived TFBS sequences are far less conserved between species than TFBS derived from SSRs and non-repetitive DNA. Despite their rapid evolution, several lines of evidence indicate that TE-derived TFBS are functionally constrained. First of all, ancient TE families, such as MIR and L2, are enriched for TFBS relative to younger families like Alu and L1. Secondly, functionally important positions in TE-derived TFBS, specifically those residues thought to physically interact with their cognate protein binding factors (TF), are more evolutionarily conserved than adjacent TFBS positions. Finally, TE-derived TFBS show position-specific patterns of sequence variation that are highly distinct from random patterns and similar to the variation seen for non-repeat derived sequences of the same TFBS. CONCLUSIONS: The abundance of experimentally characterized human TFBS that are derived from repetitive DNA speaks to the substantial regulatory effects that this class of sequence has on the human genome. The unique evolutionary properties of repeat-derived TFBS are perhaps even more intriguing. TE-derived TFBS in particular, while clearly functionally constrained, evolve extremely rapidly relative to non-repeat derived sites. Such rapidly evolving TFBS are likely to confer species-specific regulatory phenotypes, i.e. divergent expression patterns, on the human evolutionary lineage. This result has practical implications with respect to the widespread use of evolutionary conservation as a surrogate for functionally relevant non-coding DNA. Most TE-derived TFBS would be missed using the kinds of sequence conservation-based screens, such as phylogenetic footprinting, that are used to help characterize non-coding DNA. Thus, the very TFBS that are most likely to yield human-specific characteristics will be neglected by the comparative genomic techniques that are currently de rigeur for the identification of novel regulatory sites.
Nat Rev Genet. 2008 Mar 27;: 18368054 (P,S,G,E,B,D) Cited:6
Cédric Feschotte
Cédric Feschotte is at the Department of Biology, Life Science Building, BOX 19498, University of Texas, Arlington, Texas 76019, USA. cedric@uta.edu.
The control and coordination of eukaryotic gene expression rely on transcriptional and post-transcriptional regulatory networks. Although progress has been made in mapping the components and deciphering the function of these networks, the mechanisms by which such intricate circuits originate and evolve remain poorly understood. Here I revisit and expand earlier models and propose that genomic repeats, and in particular transposable elements, have been a rich source of material for the assembly and tinkering of eukaryotic gene regulatory systems.

Other papers by authors:

PLoS Genet. 2006 Jan 27;2 (1):e2 16440055 (P,S,G,E,B,D) Cited:11
The inbred mouse is an invaluable model for human biology and disease. Nevertheless, when considering genetic mechanisms of variation and disease, it is important to appreciate the significant differences in the spectra of spontaneous mutations that distinguish these species. While insertions of transposable elements are responsible for only ~0.1% of de novo mutations in humans, the figure is 100-fold higher in the laboratory mouse. This striking difference is largely due to the ongoing activity of mouse endogenous retroviral elements. Here we briefly review mouse endogenous retroviruses (ERVs) and their influence on gene expression, analyze mechanisms of interaction between ERVs and the host cell, and summarize the variety of mutations caused by ERV insertions. The prevalence of mouse ERV activity indicates that the genome of the laboratory mouse is presently behind in the "arms race" against invasion.
Gene. 2005 Nov 7;: 16288839 (P,S,G,E,B,D) Cited:8
Terry Fox Laboratory, BC Cancer Agency, 675 West 10th Avenue, Vancouver, BC, Canada V5Z 1L3; Department of Medical Genetics, University of British Columbia, Vancouver, Canada.
Eight percent of the human genome is derived from endogenous retrovirus (ERV) insertions. ERV long terminal repeats (LTRs) contain strong promoters that are known to contribute to the transcriptional regulation of certain human genes. While some LTRs are known to possess bidirectional promoter activity in vitro, only sense orientation LTR promoters have previously been shown to regulate human gene expression. Here we demonstrate that an ERV1 LTR acts as a bidirectional promoter for the human Down syndrome critical region 4 (DSCR4) and DSCR8 genes. We show that while DSCR4 and DSCR8 are essentially co-expressed, their shared LTR promoter is more active in the sense than the antisense orientation. Through deletion analysis of the LTR we have identified positive and negative regulatory elements, and defined a core region of the promoter that is required for transcriptional activity in both orientations. Finally, we show that the ERV LTR also exists in the genomes of several non-human primates, and present evidence that potential transcription factor binding sites in the core region have been maintained throughout primate evolution.
Gene. 2009 Jul 2;: 19577618 (P,S,G,E,B,D)
Terry Fox Laboratory, British Columbia Cancer Agency and Dept. of Medical Genetics, University of British Columbia, Vancouver, BC, Canada.
Gene regulatory changes are thought to be major factors driving species evolution, with creation of new regulatory regions likely being instrumental in contributing to diversity among vertebrates. There is growing appreciation for the role of transposable elements (TEs) in gene regulation and, indeed, laboratory investigations have confirmed many specific examples of mammalian genes regulated by promoters donated by endogenous retroviruses (ERVs) or other TEs. Bioinformatics studies have revealed hundreds of additional instances where this is likely to be the case. Since the long terminal repeats (LTRs) of retroviruses naturally contain abundant transcriptional regulatory signals, roles for ERV LTRs in regulating mammalian genes are eminently plausible. Moreover, it seems reasonable that exaptation of an LTR regulatory module provides opportunities for evolution of new gene regulatory patterns. In this Review we summarize known examples of LTRs that function as human gene alternative promoters, as well as the evidence that LTR exaptation has resulted in a pattern of novel gene expression significantly different from the pattern before LTR insertion or from that of gene orthologs lacking the LTR. Available data suggest that, while new expression patterns can arise as a result of LTR usage, this situation is relatively rare and is largely restricted to the placenta. In many cases, the LTR appears to be a minor, alternative promoter with an expression pattern similar to that of the native promoter(s) and hence likely exerts a subtle overall effect on gene expression. We discuss these findings and offer evolutionary models to explain these trends.
PLoS ONE. 2009 ;4 (6):e5761 19488400 (P,S,G,E,B,D)
Terry Fox Laboratory, BC Cancer Agency, Vancouver, British Columbia, Canada.
The human neuronal apoptosis inhibitory protein (NAIP) gene is no longer principally considered a member of the Inhibitor of Apoptosis Protein (IAP) family, as its domain structure and functions in innate immunity also warrant inclusion in the Nod-Like Receptor (NLR) superfamily. NAIP is located in a region of copy number variation, with one full length and four partly deleted copies in the reference human genome. We demonstrate that several of the NAIP paralogues are expressed, and that novel transcripts arise from both internal and upstream transcription start sites. Remarkably, two internal start sites initiate within Alu short interspersed element (SINE) retrotransposons, and a third novel transcription start site exists within the final intron of the GUSBP1 gene, upstream of only two NAIP copies. One Alu functions alone as a promoter in transient assays, while the other likely combines with upstream L1 sequences to form a composite promoter. The novel transcripts encode shortened open reading frames and we show that corresponding proteins are translated in a number of cell lines and primary tissues, in some cases above the level of full length NAIP. Interestingly, some NAIP isoforms lack their caspase-sequestering motifs, suggesting that they have novel functions. Moreover, given that human and mouse NAIP have previously been shown to employ endogenous retroviral long terminal repeats as promoters, exaptation of Alu repeats as additional promoters provides a fascinating illustration of regulatory innovations adopted by a single gene.
Gene. 2005 Dec 30;364 :2-12 16112824 (P,S,G,E,B,D)
Terry Fox Laboratory, BC Cancer Agency, Vancouver, British Columbia, Canada V5Z 1L3.
Throughout the course of vertebrate evolution, germline retroviral infections have resulted in heritable provirus insertions into host DNA. These endogenous retroviruses (ERVs) contain long terminal repeat (LTR) promoters that can be adopted for use by nearby host genes. It is not known whether the transcription factor (TF) binding sites and tissue-specificities of modern LTR gene promoters have been retained since the time of ERV insertion, or if these features evolved later as the LTR became involved in host gene regulation. To address this issue, we have conducted a case study of the ERV-L LTR promoter of human beta1,3-galactosyltransferase 5 (beta3GAL-T5). We have previously shown that the human beta3GAL-T5 LTR promoter is responsible for the majority of gene transcripts in the colon. The murine beta3gal-t5 gene is also expressed primarily in the colon, despite the absence of an orthologous ERV-L LTR in the mouse genome. We therefore hypothesized that both the ERV-L LTR and the non-retroviral ancestral beta3GAL-T5 promoter were active in the colon at the time of ERV insertion. In support of this hypothesis, we have shown that the orthologous LTRs of four non-human primates are also active in a human colorectal cell line, and that the baboon LTR is active in primary baboon colon tissue. We also present evidence that the functional TF binding sites of the human beta3GAL-T5 LTR promoter were present in the original consensus sequence for this class of LTRs. Upon similar analysis of other ERV sequences, we have concluded that this evolutionary history is shared by certain other LTR gene promoters, and may be a general phenomenon.
BMC Genomics. 2005 Apr 1;6 (1):47 15804358 (P,S,G,E,B) Cited:1
Terry Fox Laboratory, BC Cancer Agency, Vancouver, Canada. cdunn@bccrc.ca
BACKGROUND: Sperm adhesion molecule 1 (SPAM1) is the major mammalian testicular hyaluronidase and is expressed at high levels in sperm cells. SPAM1 protein is important for penetration of the cumulus cell layer surrounding the ovum, and is also involved in zona pellucida binding and sperm intracellular signalling. A previous study had identified SPAM1 as one of the many human genes that initiate within a transposable element. RESULTS: Examination of the human, mouse and rat SPAM1 loci revealed that transcripts initiate within the pol gene of an endogenous retrovirus (ERV) element. This is highly unusual, as all previously identified ERV-initiated cellular gene transcripts initiate within the viral long terminal repeat promoter. The SPAM1 locus therefore represents an example of the evolution of a promoter from protein-coding sequence. We have identified novel alternative promoter and splicing variants of human and murine SPAM1. We show that all transcript variants are expressed primarily in the testis and are predicted to encode identical proteins. CONCLUSION: The testis-specific promoters of the human and mouse SPAM1 genes are derived from sequence that was originally part of an ERV pol gene. This represents the first known example of an ERV-derived promoter acting in a gender-specific manner.
Proc Natl Acad Sci U S A. 2003 Oct 28;100 (22):12841-6 14534330 (P,S,G,E,B) Cited:6
Terry Fox Laboratory, British Columbia Cancer Agency, Vancouver, BC, Canada V5Z 1L3.
LTRs of endogenous retroviruses are known to affect expression of several human genes, typically as a relatively minor alternative promoter. Here, we report that an endogenous retrovirus LTR acts as one of at least two alternative promoters for the human beta1,3-galactosyltransferase 5 gene, involved in type 1 Lewis antigen synthesis, and show that the LTR promoter is most active in the gastrointestinal tract and mammary gland. Indeed, the LTR is the dominant promoter in the colon, indicating that this ancient retroviral element has a major impact on gene expression. Using colorectal cancer cell lines and electrophoretic mobility-shift assays, we found that hepatocyte nuclear factor 1 (HNF-1) binds a site within the retroviral promoter and that expression of HNF-1 and interaction with its binding site correlated with promoter activation. We conclude that HNF-1 is at least partially responsible for the tissue-specific activation of the LTR promoter of human beta 1,3-galactosyltransferase 5. We demonstrate that this tissue-specific transcription factor is implicated in the activation of an LTR gene promoter.
Epigenetics. 2010 Jan 13;5 (1): 20083901 (P,S,G,E,B)
Terry Fox Laboratory, BC Cancer Agency, Vancouver, BC, Canada; Dept. of Medical Genetics, University of British, Columbia.
Phenotypic variation stems from both genetic and epigenetic differences between individuals. In order to elucidate how phenotypes are determined, it is necessary to understand the forces that generate variation in genome sequence as well as its epigenetic state. In both contexts, transposable elements (TEs) may play an important role. It is well established that TE activity is a major generator of genetic variation, but recent research also suggests that TEs contribute to epigenetic variation. Stochastic epigenetic silencing of some TE insertions in mice has been shown to cause phenotypic variability between individuals. However, the prevalence of this phenomenon has never been evaluated. Here, we use 18 insertions of a mouse Endogenous Retrovirus (ERV) family, the Early Transposons (ETns), to detect insertion-dependent determinants of DNA methylation levels and variability between both cells and individuals. We show that the structure and age of insertions influence methylation levels and variability, resulting in a subgroup of loci that displays unexpectedly high variability in methylation and suggesting stochastic events during methylation establishment. Despite variation in methylation according to the age and structure of each locus, homologous CpG sites show similar tendencies in methylation levels across loci, emphasizing the role of the insertion's sequence in methylation determination. Our results show that differences in methylation of ETns between individuals is not a sporadic phenomenon and support the hypothesis that ERVs contribute to phenotypic variability through their stochastic silencing.
J Virol. 2009 Sep 9;: 19741000 (P,S,G,E,B,D)
Helmholtz Zentrum München, German Research Center for Environmental Health, Institute of Virology, Neuherberg, Germany; Medical Clinic III, Medical Faculty Mannheim, University of Heidelberg, Mannheim, Germany; Institute of Virology, University of Veterinary Medicine, Vienna, Austria; Terry Fox Laboratory, British Columbia Cancer Research Centre, Vancouver, Canada; Austrianova Singapore Pte Ltd, Biopolis, Singapore.
The human genome contains more than half a million endogenous retroviral (HERV) LTRs that can be regarded as mobile regulatory modules. Many of these HERV LTRs have been recruited during evolution as transcriptional control elements for cellular gene expression. We have cloned LTR sequences from two HERV families, HERV-H and HERV-L, differing widely in their activity and tissue-specificity into a murine leukemia virus-based promoter conversion vector (ProCon). Various human cell lines were infected with the HERV-MLV hybrid vectors and cell type-specific expression of the reporter gene compared with the promoter specificity of the corresponding HERV LTRs in transient transfection assays. Transcription start site analysis of HERV-MLV hybrid vectors revealed preferential use of the HERV promoter initiation site. Our data show that HERV LTRs function in the context of retroviral vectors in certain cell types and have the potential to be useful as cell type-specific promoters in vector construction.
Nucleic Acids Res. 2009 Jul 15;: 19605564 (P,S,G,E,B,D)
The Terry Fox laboratory, British Columbia Cancer Agency, Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada, Howard Hughes Medical Institute, Rheumatology Division, Washington University School of Medicine, St Louis, MO 63110, USA and Pathology and Laboratory Medicine, University of British Columbia, Vancouver, BC, Canada.
Stochastic expression is a hallmark of the Ly49 family that encode the main MHC class-I-recognizing receptors of mouse natural killer (NK) cells. This highly polygenic and polymorphic family includes both activating and inhibitory receptor genes and is one of genome's fastest evolving loci. The inhibitory Ly49 genes are expressed in a stochastic mono-allelic manner, possibly under the control of an upstream bi-directional early promoter and show mono-allelic DNA methylation patterns. To date, no studies have directly addressed the transcriptional regulation of the activating Ly49 receptors. Our study shows differences in DNA methylation pattern between activating and inhibitory genes in C57BL/6 and F1 hybrid mouse strains. We also show a bias towards bi-allelic expression of the activating receptors based on allele-specific single-cell RT-PCR in F1 hybrid NK cells for Ly49d and Ly49H expression in Ly49h(+/-) mice. Furthermore, we have identified a region of high sequence identity with possible transcriptional regulatory capacity for the activating Ly49 genes. Our results also point to a likely difference between NK and T-cells in their ability to transcribe the activating Ly49 genes. These studies highlight the complex regulation of this rapidly evolving gene family of central importance in mouse NK cell function.

Latest similar papers:

Mol Carcinog. 2009 Dec 18;49 (1):54-67 20025072 (P,S,G,E,B,D)
Department of Safety Assessment, Merck Research Laboratories, West Point, Pennsylvania.
Rodents, mice and rats in particular, are the species of choice for evaluating chemical carcinogenesis. However, different species and strains often respond very differently, undermining the logic of extrapolation of animal results to humans and complicating risk assessment. Intracisternal A particles (IAPs), endogenous retroviral sequences, are an important class of transposable elements that induce genomic mutations and cell transformation by disrupting gene expression. Several lines of evidence support a role of IAPs as mouse-specific genetic factors in responses to toxicity and expression of disease phenotypes. Since multiple subtypes and copies of IAPs are present in the mouse genome, their activity and locations relative to functional genes are of critical importance. This study identified the major "active" subtypes of IAPs (subtype 1/1a) that are responsible for newly transposed IAP insertions described in the literature, and confirmed that (1) polymorphisms for IAP insertions exist among different mouse strains and (2) promoter activity of the LTRs can be modulated by chemicals. This study further identified all the genes in the C57BL/6 mouse genome with IAP subtype 1 and 1a sequences inserted in their proximity, and the major biofunctional categories and cellular signaling networks of those genes. Since many "IAP-associated genes" play important roles in the regulation of cell proliferation, cell cycle, and cell death, the associated IAPs, upon activation, can affect cellular responses to xenobiotics and disease processes, especially carcinogenesis. This systemic analysis provides a solid foundation for further investigations of the role of IAPs as species- and strain-specific disease susceptibility factors.(c) 2009 Wiley-Liss, Inc.
Mol Biol Evol. 2009 Dec 2;: 19955475 (P,S,G,E,B,D)
Graduate University for Advanced Studies, Hayama, Kanagawa 240-0193, Japan.
It has been proposed that the insertion time of an LTR retrotransposon can be estimated by the divergence between the two LTRs at the both ends because their sequences were identical at the insertion event. This method is based on the assumption that the two LTRs accumulate point mutations independently; therefore, the divergence reflects the time since the insertion event. However, if gene conversion occurs between LTRs, the nucleotide divergence will be much smaller than expected with the assumption of the independent accumulation of point mutations. To examine this assumption, we investigated the extent of gene conversion between LTRs by applying a comparative genomic approach to primates (humans and rhesus macaques) and rodents (mice and rats). We found that gene conversion plays a significant role in the molecular evolution of LTRs in primates and rodents, but the extent is quite different. In rodents, most LTRs are subject to extensive gene conversion that reduces the divergence, so that the divergence-based method results in a serious underestimation of the insertion time. In primates, this effect is limited to a small proportion of LTRs. The most likely explanation of the difference involves the minimum length of the interacting sequence (MEPS) for interlocus gene conversion. An empirical estimate of MEPS in human is 300-500 bp, which exceeds the lengths of most of analyzed LTRs. In contrast, MEPS for mice should be much smaller. Thus, MEPS can be an important factor to determine the susceptibility of LTRs to gene conversion, although there are many other factors involved. It is concluded that the divergence method to estimate the insertion time should be applied with special caution because at least some LTRs undergo gene conversion.
PLoS ONE. 2009 ;4 (6):e5761 19488400 (P,S,G,E,B,D)
Terry Fox Laboratory, BC Cancer Agency, Vancouver, British Columbia, Canada.
The human neuronal apoptosis inhibitory protein (NAIP) gene is no longer principally considered a member of the Inhibitor of Apoptosis Protein (IAP) family, as its domain structure and functions in innate immunity also warrant inclusion in the Nod-Like Receptor (NLR) superfamily. NAIP is located in a region of copy number variation, with one full length and four partly deleted copies in the reference human genome. We demonstrate that several of the NAIP paralogues are expressed, and that novel transcripts arise from both internal and upstream transcription start sites. Remarkably, two internal start sites initiate within Alu short interspersed element (SINE) retrotransposons, and a third novel transcription start site exists within the final intron of the GUSBP1 gene, upstream of only two NAIP copies. One Alu functions alone as a promoter in transient assays, while the other likely combines with upstream L1 sequences to form a composite promoter. The novel transcripts encode shortened open reading frames and we show that corresponding proteins are translated in a number of cell lines and primary tissues, in some cases above the level of full length NAIP. Interestingly, some NAIP isoforms lack their caspase-sequestering motifs, suggesting that they have novel functions. Moreover, given that human and mouse NAIP have previously been shown to employ endogenous retroviral long terminal repeats as promoters, exaptation of Alu repeats as additional promoters provides a fascinating illustration of regulatory innovations adopted by a single gene.
BMC Genomics. 2009 Apr 24;10 (1):187 19393062 (P,S,G,E,B)
ABSTRACT: BACKGROUND: The Type I interferons (IFN) have major roles in the innate immune response to viruses, a function that is believed to have led to expansion in the number and complexity of their genes, although these genes have remained confined to single chromosomal region in all mammals so far examined. IFNB and IFNE define the limits of the locus, with all other Type I IFN genes except IFNK distributed between these boundaries, strongly suggesting that the locus has broadened as IFN genes duplicated and then evolved into a series of distinct families. RESULTS: The Type I IFN locus in Bos taurus has undergone significant rearrangement and expansion compared to mouse and human, however, with the constituent genes separated into two sub-loci separated by >700 kb. The IFNW family is greatly expanded, comprising 24 potentially functional genes and at least 8 pseudogenes. The IFNB (n= 6), represented in human and mouse by one copy, are also present as multiple copies in Bos taurus. The IFNT, which encode a non-virally inducible, ruminant-specific IFN secreted by the pre-implantation conceptus, are represented by three genes and two pseudogenes. The latter have sequences intermediate between IFNT and IFNW. A new Type I IFN family (IFNX) of four members, one of which is a pseudogene, appears to have diverged from the IFNA lineage at least 83 million years ago, but is absent in all other sequenced genomes with the possible exception of the horse, a non-ruminant herbivore. CONCLUSION: In summary, we have provided the first comprehensive annotation of the Type I IFN locus in Bos taurus, thereby providing an insight into the functional evolution of the Type I IFN in ruminants. The diversity and global spread of the ruminant species may have required an expansion of the Type I IFN locus and its constituent genes to provide broad anti-viral protection required for foraging and foregut fermentation.
PLoS One. 2009 ;4 (4):e5179 19365549 (P,S,G,E,B,D)
Department of Medical Sciences, Section of Virology, Uppsala University, Uppsala, Sweden.
BACKGROUND: Retroviral LTRs, paired or single, influence the transcription of both retroviral and non-retroviral genomic sequences. Vertebrate genomes contain many thousand endogenous retroviruses (ERVs) and their LTRs. Single LTRs are difficult to detect from genomic sequences without recourse to repetitiveness or presence in a proviral structure. Understanding of LTR structure increases understanding of LTR function, and of functional genomics. Here we develop models of orthoretroviral LTRs useful for detection in genomes and for structural analysis. PRINCIPAL FINDINGS: Although mutated, ERV LTRs are more numerous and diverse than exogenous retroviral (XRV) LTRs. Hidden Markov models (HMMs), and alignments based on them, were created for HML-(human MMTV-like), general-beta-, gamma- and lentiretroviruslike LTRs, plus a general-vertebrate LTR model. Training sets were XRV LTRs and RepBase LTR consensuses. The HML HMM was most sensitive and detected 87% of the HML LTRs in human chromosome 19 at 96% specificity. By combining all HMMs with a low cutoff, for screening, 71% of all LTRs found by RepeatMasker in chromosome 19 were found. HMM consensus sequences had a conserved modular LTR structure. Target site duplications (TG-CA), TATA (occasionally absent), an AATAAA box and a T-rich region were prominent features. Most of the conservation was located in, or adjacent to, R and U5, with evidence for stem loops. Several of the long HML LTRs contained long ORFs inserted after the second A rich module. HMM consensus alignment allowed comparison of functional features like transcriptional start sites (sense and antisense) between XRVs and ERVs. CONCLUSION: The modular conserved and redundant orthoretroviral LTR structure with three A-rich regions is reminiscent of structurally relaxed Giardia promoters. The five HMMs provided a novel broad range, repeat-independent, ab initio LTR detection, with prospects for greater generalisation, and insight into LTR structure, which may aid development of LTR-targeted pharmaceuticals.
J Virol. 2009 Jun ;83 (12):6098-105 19339349 (P,S,G,E,B,D)
Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, Russia.
Many phenotypic differences exist between Homo sapiens and its closest relatives, chimpanzees, and these differences can arise as a result of variations in the regulation of certain genes common to these closely related species. Human-specific endogenous retroviruses (HERVs) and their solitary long terminal repeats (LTRs) are probable candidates for such a role due to the presence of regulatory elements, such as enhancers, promoters, splice sites, and polyadenylation signals. In this study we show for the first time that HERVs can participate in the specific antisense regulation of human gene expression owing to their LTR promoter activity. We found that two HERV LTRs situated in the introns of genes SLC4A8 (for sodium bicarbonate cotransporter) and IFT172 (for intraflagellar transport protein 172) in the antisense orientation serve in vivo as promoters for generating RNAs complementary to the exons of enclosing genes. The antisense transcripts formed from LTR promoter were shown to decrease the mRNA level of the corresponding genes. The human-specific regulation of these genes suggests their involvement in the evolutionary process.
Mol Cell Biol. 2009 May ;29 (9):2456-68 19273603 (P,S,G,E,B,D)
Terry Fox Laboratory, British Columbia Cancer Agency, 675 West 10th Avenue, Vancouver, BC, Canada.
Nonautonomous retrotransposon subfamilies are often amplified in preference to their coding-competent relatives. However, the mechanisms responsible for such replicative success are poorly understood. Here, we demonstrate that the autonomous MusD long terminal repeat (LTR) retrotransposons are subject to greater epigenetic silencing than their nonautonomous cousins, the early transposons (ETns), which are expressed at a 170-fold-higher level than MusD in mouse embryonic stem (ES) cells. We show that, in ES cells, 5' LTRs and the downstream region of MusD elements are more heavily methylated and are associated with less-activating and more-repressive histone modifications than the highly similar ETnII sequences. The internal region of MusD likely contributes to their silencing, as transgenes with MusD, compared to those with ETnII sequences, show reduced reporter gene expression and a higher level of repressive histone marks. Genomic distribution patterns of MusD and ETn elements are consistent with stronger selection against MusD elements within introns, suggesting that MusD-associated silencing marks can negatively impact genes. We propose a model in which nonautonomous retrotransposons may gain transcriptional and retrotranspositional advantages over their coding-competent counterparts by elimination of the CpG-rich retroviral sequence targeting the autonomous subfamilies for silencing.
Genetics. 2007 Dec ;177 (4):1975-85 18073417 (P,S,G,E,B,D) Cited:6
Transposable elements (TEs) represent approximately 45% of the human genome and 50-90% of some grass genomes. While most elements contain inactivating mutations, others are reversibly inactivated (silenced) by epigenetic mechanisms, including cytosine methylation. Previous studies have shown that retrotransposons can influence the expression of adjacent host genes. In this study, the methylation patterns of TEs and their flanking sequences in different tissues were undertaken using a novel technique called transposon methylation display (TMD). TMD was successfully applied on a highly copied ( approximately 1000 copies), newly amplified LTR retrotransposon family in rice called Dasheng. We determined that the methylation status of a subset of LTRs varies in leaves vs. roots. In addition, we determined that tissue-specific LTR methylation correlated with tissue-specific expression of the flanking rice gene. Genes showing tissue-specific expression were in opposite orientation relative to the LTR. Antisense transcripts were detected in the tissue where the sense transcripts from that gene were not detected. Comparative analysis of Dasheng LTR methylation in the two subspecies, japonica vs. indica revealed LTR-mediated differences in subspecies gene expression. Subspecies-specific expression was due either to polymorphic Dasheng insertion sites between the two subspecies or to subspecies-specific methylation of LTRs at the same locus accounted for observed differences in the expression of adjacent genes.
Proc Natl Acad Sci U S A. 2007 Nov 5;: 17984064 (P,S,G,E,B,D) Cited:2
Dactylaplasia, characterized by missing central digital rays, is an inherited mouse limb malformation that depends on two genetic loci. The first locus, Dac, is an insertional mutation around the dactylin gene that is inherited as a semidominant trait. The second locus is an unlinked modifier, mdac/Mdac, that is polymorphic among inbred strains. Mdac dominantly suppresses the dactylaplasia phenotype in mice carrying Dac. However, little is known about either locus or the nature of their interaction. Here we show that Dac is a LTR retrotransposon insertion caused by the type D mouse endogenous provirus element (MusD). This insertion exhibits different epigenetic states and spatiotemporally expresses depending on the mdac/Mdac modifier background. In dactylaplasia mutants (Dac/+ mdac/mdac), the LTRs of the insertion contained unmethylated CpGs and active chromatin. Furthermore, MusD elements expressed ectopically at the apical ectodermal ridge of limb buds, accompanying the dactylaplasia phenotype. On the other hand, in Dac mutants carrying Mdac (Dac/+ Mdac/mdac), the 5' LTR of the insertion was heavily methylated and enriched with inactive chromatin, correlating with inhibition of the dactylaplasia phenotype. Ectopic expression was not observed in the presence of Mdac, which we refined to a 9.4-Mb region on mouse chromosome 13. We report a pathogenic mutation caused by MusD. Our findings indicate that ectopic expression from the MusD insertion correlates with the dactylaplasia phenotype and that Mdac acts as a defensive factor to protect the host genome from pathogenic MusD insertions.
BMC Genomics. 2007 Aug 28;8 (1):291 17725822 (P,S,G,E,B)
ABSTRACT: BACKGROUND: Retrotransposons have been shown to contribute to evolution of both structure and regulation of protein coding genes. It has been postulated that the primary mechanism by which retrotransposons contribute to structural gene evolution is through insertion into an intron or a gene flanking region, and subsequent incorporation into an exon. RESULTS: We found that Long Terminal Repeat (LTR) retrotransposons are associated with 1,057 human genes (5.8%). In 256 cases LTR retrotransposons were observed in protein-coding regions, while 50 distinct protein coding exons in 45 genes were comprised exclusively of LTR RetroTransposon Sequence (LRTS). We go on to reconstruct the evolutionary history of an alternatively spliced exon of the Interleukin 22 receptor, alpha 2 gene (IL22RA2) derived from a sequence of retrotransposon of the Mammalian apparent LTR retrotransposons (MaLR) family. Sequencing and analysis of the homologous regions of genomes of several primates indicate that the LTR retrotransposon was inserted into the IL22RA2 gene at least prior to the divergence of Apes and Old World monkeys from a common ancestor (roughly 25 MYA). We hypothesize that the recruitment of the part of LTR as a novel exon in great ape species occurred prior to the divergence of orangutans and humans from a common ancestor (roughly 14 MYA) as a result of a single mutation in the proto-splice site. CONCLUSIONS: Our analysis of LRTS exonization events has shown that the patterns of LRTS distribution in human exons support the hypothesis that LRTS played a significant role in human gene evolution by providing cis-regulatory sequences. The important alternative way to contribute to the evolution of gene structure and function, a direct incorporation of LTR sequences into protein coding regions was observed less frequently. Combination of computational and experimental approaches used for tracing the history of the LTR exonization process of IL22RA2 gene presents a promising strategy that could facilitate further studies of transposon initiated gene evolution.
Science news