3 research outputs found

    Identification, comprehensive characterization, and comparative genomics of the HERV-K(HML8) integrations in the human genome

    Get PDF
    Around 8% of the human genome is composed by Human Endogenous Retroviruses (HERVs), ancient viral sequences inherited from the primate germ line after their infection by now extinct retroviruses. Given the still underexplored physiological and pathological roles of HERVs, it is fundamental to increase our information about the genomic composition of the different groups, to lay reliable foundation for functional studies. Among HERVs, the most characterized elements belong to the beta-like superfamily HERV-K, comprising 10 groups (HML1-10) with HML2 being the most recent and studied one. Among HMLs, the HML8 group is the only one still lacking a comprehensive genomic description. In the present work, we investigated HML8 sequences' distribution in the human genome (GRCh38/hg38), identifying 23 novel proviruses and characterizing the overall 78 HML8 proviruses in terms of genome structure, phylogeny, and integration pattern. HML8 elements were significantly enriched in human chromosomes 8 and X (p<0.005) while chromosomes 17 and 20 showed fewer integrations than expected (p<0.025 and p<0.005, respectively). Phylogenetic analyses classified HML8 members into 3 clusters, corresponding to the three LTR types MER11A, MER11B and MER11C. Besides different LTR types, common signatures in the internal structure suggested the potential existence of three different ancestral HML8 variants. Accordingly, time of integration estimation coupled with comparative genomics revealed that these three clusters have a different time of integration in the primates' genome, with MER11C elements being significantly younger than MER11A- and MER11B associated proviruses (p<0.005 and p<0.05, respectively). Approximately 30% of the HML8 elements were found co-localized within human genes, sometimes in exonic portions and with the same orientation, deserving further studies for their possible effects on gene expression. Overall, we provide the first detailed picture of the HML8 group distribution and variety among the genome, creating the backbone for the specific analysis of their transcriptional activity in healthy and diseased conditions

    Comprehensive Analysis of HERV Transcriptome in HIV+ Cells: Absence of HML2 Activation and General Downregulation of Individual HERV Loci

    Get PDF
    Human endogenous retrovirus (HERV) expression is currently studied for its possible activation by HIV infection. In this context, the HERV-K(HML2) group is the most investigated: it has been proposed that HIV-1 infection can prompt HML2 transcription, and that HML2 proteins can affect HIV-1 replication, either complementing HIV or possibly influencing antiretroviral therapy. However, little information is available on the expression of other HERV groups in HIV infection. In the present study, we used a bioinformatics pipeline to investigate the transcriptional modulation of approximately 3250 well-characterized HERV loci, comparing their expression in a public RNA-seq profile, including a HIV-1-infected and a control T cell culture. In our pilot study, we found approximately 200 HERV loci belonging to 35 HERV groups that were expressed in one or both conditions, with transcripts per million (TPM) values from 1 to >500. Intriguingly, HML2 elements constituted only the 3% of expressed HERV loci, and in most cases (160) HERV expression was downregulated in the HIV-infected culture, showing from a 1- to 14-fold decrease as compared to uninfected cells. HERV transcriptome has been inferred de novo and employed to predict a total of about 950 HERV open reading frames (ORFs). These have been validated according to the coding potential and estimated abundance of the corresponding transcripts, leading to a set of 57 putative proteins potentially encoded by 23 HERV loci. Analysis showed that some individual loci have a coding potential that deserves further investigation. Among them, a HML6 provirus at locus 19q13.43 was predicted to produce a transcript showing the highest TPM among HERV-derived transcripts, being upregulated in HIV+ cells and inferred to produce Gag and Env puteins with possible biological activity

    HERV-K(HML7) Integrations in the Human Genome: Comprehensive Characterization and Comparative Analysis in Non-Human Primates

    No full text
    Endogenous Retroviruses (ERVs) are ancient relics of infections that affected the primate germ line and constitute about 8% of our genome. Growing evidence indicates that ERVs had a major role in vertebrate evolution, being occasionally domesticated by the host physiology. In addition, human ERV (HERV) expression is highly investigated for a possible pathological role, even if no clear associations have been reported yet. In fact, on the one side, the study of HERV expression in high-throughput data is a powerful and promising tool to assess their actual dysregulation in diseased conditions; but, on the other side, the poor knowledge about the various HERV group genomic diversity and individual members somehow prevented the association between specific HERV loci and a given molecular mechanism of pathogenesis. The present study is focused on the HERV-K(HML7) group that—differently from the other HERV-K members—still remains poorly characterized. Starting from an initial identification performed with the software RetroTector, we collected 23 HML7 proviral insertions and about 160 HML7 solitary LTRs that were analyzed in terms of genomic distribution, revealing a significant enrichment in chromosome X and the frequent localization within human gene introns as well as in pericentromeric and centromeric regions. Phylogenetic analyses showed that HML7 members form a monophyletic group, which based on age estimation and comparative localization in non-human primates had its major diffusion between 20 and 30 million years ago. Structural characterization revealed that besides 3 complete HML7 proviruses, the other group members shared a highly defective structure that, however, still presents recognizable functional domains, making it worth further investigation in the human population to assess the presence of residual coding potential
    corecore