10 research outputs found

    Processed pseudogenes acquired somatically during cancer development

    Get PDF
    Cancer evolves by mutation, with somatic reactivation of retrotransposons being one such mutational process. Germline retrotransposition can cause processed pseudogenes, but whether this occurs somatically has not been evaluated. Here we screen sequencing data from 660 cancer samples for somatically acquired pseudogenes. We find 42 events in 17 samples, especially non-small cell lung cancer (5/27) and colorectal cancer (2/11). Genomic features mirror those of germline LINE element retrotranspositions, with frequent target-site duplications (67%), consensus TTTTAA sites at insertion points, inverted rearrangements (21%), 5′ truncation (74%) and polyA tails (88%). Transcriptional consequences include expression of pseudogenes from UTRs or introns of target genes. In addition, a somatic pseudogene that integrated into the promoter and first exon of the tumour suppressor gene, MGA, abrogated expression from that allele. Thus, formation of processed pseudogenes represents a new class of mutation occurring during cancer development, with potentially diverse functional consequences depending on genomic context. Germline pseudogenes have an important role in human evolution. Here, the authors analyse sequencing data from 660 cancer samples and find evidence for the formation of somatically acquired pseudogenes, a new class of mutation, which may contribute to cancer development

    Processed pseudogenes acquired somatically during cancer development

    Get PDF
    Cancer evolves by mutation, with somatic reactivation of retrotransposons being one such mutational process. Germline retrotransposition can cause processed pseudogenes, but whether this occurs somatically has not been evaluated. Here we screen sequencing data from 660 cancer samples for somatically acquired pseudogenes. We find 42 events in 17 samples, especially non-small cell lung cancer (5/27) and colorectal cancer (2/11). Genomic features mirror those of germline LINE element retrotranspositions, with frequent target-site duplications (67%), consensus TTTTAA sites at insertion points, inverted rearrangements (21%), 5′ truncation (74%) and polyA tails (88%). Transcriptional consequences include expression of pseudogenes from UTRs or introns of target genes. In addition, a somatic pseudogene that integrated into the promoter and first exon of the tumour suppressor gene, MGA, abrogated expression from that allele. Thus, formation of processed pseudogenes represents a new class of mutation occurring during cancer development, with potentially diverse functional consequences depending on genomic context

    PeSV-fisher : identification of somatic and non-somatic structural variants using next generation sequencing data

    No full text
    Next-generation sequencing technologies expedited research to develop efficient computational tools for the identification of structural variants (SVs) and their use to study human diseases. As deeper data is obtained, the existence of higher complexity SVs in some genomes becomes more evident, but the detection and definition of most of these complex rearrangements is still in its infancy. The full characterization of SVs is a key aspect for discovering their biological implications. Here we present a pipeline (PeSV-Fisher) for the detection of deletions, gains, intra- and inter-chromosomal translocations, and inversions, at very reasonable computational costs. We further provide comprehensive information on co-localization of SVs in the genome, a crucial aspect for studying their biological consequences. The algorithm uses a combination of methods based on paired-reads and read-depth strategies. PeSV-Fisher has been designed with the aim to facilitate identification of somatic variation, and, as such, it is capable of analysing two or more samples simultaneously, producing a list of non-shared variants between samples. We tested PeSV-Fisher on available sequencing data, and compared its behaviour to that of frequently deployed tools (BreakDancer and VariationHunter). We have also tested this algorithm on our own sequencing data, obtained from a tumour and a normal blood sample of a patient with chronic lymphocytic leukaemia, on which we have also validated the results by targeted re-sequencing of different kinds of predictions. This allowed us to determine confidence parameters that influence the reliability of breakpoint predictions.Availability:PeSV-Fisher is available at http://gd.crg.eu/tools

    MSIMEP : Predicting microsatellite instability from microarray DNA methylation tumor profiles

    Get PDF
    Altres ajuts: Xunta de Galicia (ED481A-2017/299); Xunta de Galicia (ED481A 2022/491)Deficiency in DNA MMR activity results in tumors with a hypermutator phenotype, termed microsatellite instability (MSI). Beyond its utility in Lynch syndrome screening algorithms, today MSI has gained importance as predictive biomarker for various anti-PD-1 therapies across many different tumor types. Over the past years, many computational methods have emerged to infer MSI using either DNA- or RNA-based approaches. Considering this together with the fact that MSI-high tumors frequently exhibit a hypermethylated phenotype, herein we developed and validated MSIMEP, a computational tool for predicting MSI status from microarray DNA methylation tumor profiles of colorectal cancer samples. We demonstrated that MSIMEP optimized and reduced models have high performance in predicting MSI in different colorectal cancer cohorts. Moreover, we tested its consistency in other tumor types with high prevalence of MSI such as gastric and endometrial cancers. Finally, we demonstrated better performance of both MSIMEP models vis-à-vis a MLH1 promoter methylation-based one in colorectal cancer

    The origins and vulnerabilities of two transmissible cancers in Tasmanian devils

    Get PDF
    Transmissible cancers are clonal lineages that spread through populations via contagious cancer cells. Although rare in nature, two facial tumor clones affect Tasmanian devils. Here we perform comparative genetic and functional characterization of these lineages. The two cancers have similar patterns of mutation and show no evidence of exposure to exogenous mutagens or viruses. Genes encoding PDGF receptors have copy number gains and are present on extrachromosomal double minutes. Drug screening indicates causative roles for receptor tyrosine kinases and sensitivity to inhibitors of DNA repair. Y chromosome loss from a male clone infecting a female host suggests immunoediting. These results imply that Tasmanian devils may have inherent susceptibility to transmissible cancers and present a suite of therapeutic compounds for use in conservation.</p

    Genomic insights into the Ixodes scapularis tick vector of Lyme disease

    No full text
    Ticks transmit more pathogens to humans and animals than any other arthropod. We describe the 2.1 Gbp nuclear genome of the tick, Ixodes scapularis (Say), which vectors pathogens that cause Lyme disease, human granulocytic anaplasmosis, babesiosis and other diseases. The large genome reflects accumulation of repetitive DNA, new lineages of retro-transposons, and gene architecture patterns resembling ancient metazoans rather than pancrustaceans. Annotation of scaffolds representing ∼57% of the genome, reveals 20,486 protein-coding genes and expansions of gene families associated with tick–host interactions. We report insights from genome analyses into parasitic processes unique to ticks, including host ‘questing’, prolonged feeding, cuticle synthesis, blood meal concentration, novel methods of haemoglobin digestion, haem detoxification, vitellogenesis and prolonged off-host survival. We identify proteins associated with the agent of human granulocytic anaplasmosis, an emerging disease, and the encephalitis-causing Langat virus, and a population structure correlated to life-history traits and transmission of the Lyme disease agent

    Genome sequence of Aedes aegypti, a major arbovirus vector

    No full text
    We present a draft sequence of the genome of Aedes aegypti, the primary vector for yellow fever and dengue fever, which at approximately 1376 million base pairs is about 5 times the size of the genome of the malaria vector Anopheles gambiae. Nearly 50% of the Ae. aegypti genome consists of transposable elements. These contribute to a factor of approximately 4 to 6 increase in average gene length and in sizes of intergenic regions relative to An. gambiae and Drosophila melanogaster. Nonetheless, chromosomal synteny is generally maintained among all three insects, although conservation of orthologous gene order is higher (by a factor of approximately 2) between the mosquito species than between either of them and the fruit fly. An increase in genes encoding odorant binding, cytochrome P450, and cuticle domains relative to An. gambiae suggests that members of these protein families underpin some of the biological differences between the two mosquito species
    corecore