119 research outputs found

    Rule-based knowledge aggregation for large-scale protein sequence analysis of influenza A viruses

    Get PDF
    Background: The explosive growth of biological data provides opportunities for new statistical and comparative analyses of large information sets, such as alignments comprising tens of thousands of sequences. In such studies, sequence annotations frequently play an essential role, and reliable results depend on metadata quality. However, the semantic heterogeneity and annotation inconsistencies in biological databases greatly increase the complexity of aggregating and cleaning metadata. Manual curation of datasets, traditionally favoured by life scientists, is impractical for studies involving thousands of records. In this study, we investigate quality issues that affect major public databases, and quantify the effectiveness of an automated metadata extraction approach that combines structural and semantic rules. We applied this approach to more than 90,000 influenza A records, to annotate sequences with protein name, virus subtype, isolate, host, geographic origin, and year of isolation. Results: Over 40,000 annotated Influenza A protein sequences were collected by combining information from more than 90,000 documents from NCBI public databases. Metadata values were automatically extracted, aggregated and reconciled from several document fields by applying user-defined structural rules. For each property, values were recovered from ≥88.8% of records, with accuracy exceeding 96% in most cases. Because of semantic heterogeneity, each property required up to six different structural rules to be combined. Significant quality differences between databases were found: GenBank documents yield values more reliably than documents extracted from GenPept. Using a simple set of semantic rules and a reasoner, we reconstructed relationships between sequences from the same isolate, thus identifying 7640 isolates. Validation of isolate metadata against a simple ontology highlighted more than 400 inconsistencies, leading to over 3,000 property value corrections. Conclusion: To overcome the quality issues inherent in public databases, automated knowledge aggregation with embedded intelligence is needed for large-scale analyses. Our results show that user-controlled intuitive approaches, based on combination of simple rules, can reliably automate various curation tasks, reducing the need for manual corrections to approximately 5% of the records. Emerging semantic technologies possess desirable features to support today's knowledge aggregation tasks, with a potential to bring immediate benefits to this field

    Immunogenicity of Fractional Doses of Tetravalent A/C/Y/W135 Meningococcal Polysaccharide Vaccine: Results from a Randomized Non-Inferiority Controlled Trial in Uganda

    Get PDF
    Meningitis are infections of the lining of the brain and spinal cord and can cause high fever, blood poisoning, and brain damage, as well as result in death in up to 10% of cases. Epidemics of meningitis occur almost every year in parts of sub-Saharan Africa, throughout a high-burden area spanning Senegal to Ethiopia dubbed the “Meningitis Belt.” Most epidemics in Africa are caused by Neisseria meningitidis (mostly serogroup A and W135). Mass vaccination campaigns attempt to control epidemics by administering meningococcal vaccines targeted against these serogroups, among others. However, global shortages of these vaccines are currently seen. We studied the use of fractional (1/5 and 1/10) doses of a licensed vaccine to assess its non-inferiority compared with the normal full dose. In a randomized trial in Uganda, we found that immune response and safety using a 1/5 dose were comparable to full dose for three serogroups (A, Y, W135), though not a fourth (C). In light of current shortages of meningococcal vaccines and their importance in fighting meningitis epidemics around the world, we suggest fractional doses be taken under consideration in mass vaccination campaigns

    AML1/ETO Oncoprotein Is Directed to AML1 Binding Regions and Co-Localizes with AML1 and HEB on Its Targets

    Get PDF
    A reciprocal translocation involving chromosomes 8 and 21 generates the AML1/ETO oncogenic transcription factor that initiates acute myeloid leukemia by recruiting co-repressor complexes to DNA. AML1/ETO interferes with the function of its wild-type counterpart, AML1, by directly targeting AML1 binding sites. However, transcriptional regulation determined by AML1/ETO probably relies on a more complex network, since the fusion protein has been shown to interact with a number of other transcription factors, in particular E-proteins, and may therefore target other sites on DNA. Genome-wide chromatin immunoprecipitation and expression profiling were exploited to identify AML1/ETO-dependent transcriptional regulation. AML1/ETO was found to co-localize with AML1, demonstrating that the fusion protein follows the binding pattern of the wild-type protein but does not function primarily by displacing it. The DNA binding profile of the E-protein HEB was grossly rearranged upon expression of AML1/ETO, and the fusion protein was found to co-localize with both AML1 and HEB on many of its regulated targets. Furthermore, the level of HEB protein was increased in both primary cells and cell lines expressing AML1/ETO. Our results suggest a major role for the functional interaction of AML1/ETO with AML1 and HEB in transcriptional regulation determined by the fusion protein

    Induction of apoptosis in myeloid leukaemic cells by ribozymes targeted against AML1/MTG8

    Get PDF
    The translocation (8;21)(q22;q22) is a karyotypic abnormality detected in acute myeloid leukaemia (AML) M2 and results in the formation of the chimeric fusion gene AML1/MTG8. We previously reported that two hammerhead ribozymes against AML1/MTG8 cleave this fusion transcript and also inhibit the proliferation of myeloid leukaemia cell line Kasumi-1 which possesses t(8;21)(q22;q22). In this study, we investigated the mechanisms of inhibition of proliferation in myeloid leukaemic cells with t(8;21)(q22;q22) by ribozymes. These ribozymes specifically inhibited the growth of Kasumi-1 cells, but did not affect the leukaemic cells without t(8;21)(q22;q22). We observed the morphological changes including chromatin condensation, fragmentation and the formation of apoptotic bodies in Kasumi-1 cells incubated with ribozymes for 7 days. In addition, DNA ladder formation was also detected after incubation with ribozymes which suggested the induction of apoptosis in Kasumi-1 cells by the AML1/MTG8 ribozymes. However, the ribozymes did not induce the expression of CD11b and CD14 antigens in Kasumi-1 cells. The above data suggest that these ribozymes therefore inhibit the growth of myeloid leukaemic cells with t(8;21)(q22;q22) by the induction of apoptosis, but not differentiation. We conclude therefore that the ribozymes targeted against AML1/MTG8 may have therapeutic potential for patients with AML carrying t(8;21)(q22;q22) while, in addition, the product of the chimeric gene is responsible for the pathogenesis of myeloid leukaemia. © 1999 Cancer Research Campaig

    Transgenic Expression of Entire Hepatitis B Virus in Mice Induces Hepatocarcinogenesis Independent of Chronic Liver Injury

    Get PDF
    Hepatocellular carcinoma (HCC), the third leading cause of cancer deaths worldwide, is most commonly caused by chronic hepatitis B virus (HBV) infection. However, whether HBV plays any direct role in carcinogenesis, other than indirectly causing chronic liver injury by inciting the host immune response, remains unclear. We have established two independent transgenic mouse lines expressing the complete genome of a mutant HBV (“preS2 mutant”) that is found at much higher frequencies in people with HCC than those without. The transgenic mice show evidence of stress in the endoplasmic reticulum (ER) and overexpression of cyclin D1 in hepatocytes. These mice do not show any evidence of chronic liver injury, but by 2 years of age a majority of the male mice develop hepatocellular neoplasms, including HCC. Unexpectedly, we also found a significant increase in hepatocarcinogenesis independent of necroinflammation in a transgenic line expressing the entire wildtype HBV. As in the mutant HBV mice, HCC was found only in aged—2-year-old—mice of the wildtype HBV line. The karyotype in all the three transgenic lines appears normal and none of the integration sites of the HBV transgene in the mice is near an oncogene or tumor suppressor gene. The significant increase of HCC incidence in all the three transgenic lines—expressing either mutant or wildtype HBV—therefore argues strongly that in absence of chronic necroinflammation, HBV can contribute directly to the development of HCC

    Contrasting Diversity Patterns of Crenarchaeal, Bacterial and Fungal Soil Communities in an Alpine Landscape

    Get PDF
    International audienceBackground: The advent of molecular techniques in microbial ecology has aroused interest in gaining an understanding about the spatial distribution of regional pools of soil microbes and the main drivers responsible of these spatial patterns. Here, we assessed the distribution of crenarcheal, bacterial and fungal communities in an alpine landscape displaying high turnover in plant species over short distances. Our aim is to determine the relative contribution of plant species composition, environmental conditions, and geographic isolation on microbial community distribution. Methodology/Principal Findings: Eleven types of habitats that best represent the landscape heterogeneity were investigated. Crenarchaeal, bacterial and fungal communities were described by means of Single Strand Conformation Polymorphism. Relationships between microbial beta diversity patterns were examined by using Bray-Curtis dissimilarities and Principal Coordinate Analyses. Distance-based redundancy analyses and variation partitioning were used to estimate the relative contributions of different drivers on microbial beta diversity. Microbial communities tended to be habitat- specific and did not display significant spatial autocorrelation. Microbial beta diversity correlated with soil pH. Fungal beta- diversity was mainly related to soil organic matter. Though the effect of plant species composition was significant for all microbial groups, it was much stronger for Fungi. In contrast, geographic distances did not have any effect on microbial beta diversity. Conclusions/Significance: Microbial communities exhibit non-random spatial patterns of diversity in alpine landscapes. Crenarcheal, bacterial and fungal community turnover is high and associated with plant species composition through different set of soil variables, but is not caused by geographical isolation

    The changing form of Antarctic biodiversity

    Get PDF
    Antarctic biodiversity is much more extensive, ecologically diverse and biogeographically structured than previously thought. Understanding of how this diversity is distributed in marine and terrestrial systems, the mechanisms underlying its spatial variation, and the significance of the microbiota is growing rapidly. Broadly recognizable drivers of diversity variation include energy availability and historical refugia. The impacts of local human activities and global environmental change nonetheless pose challenges to the current and future understanding of Antarctic biodiversity. Life in the Antarctic and the Southern Ocean is surprisingly rich, and as much at risk from environmental change as it is elsewher

    Somatic Mutagenesis with a Sleeping Beauty Transposon System Leads to Solid Tumor Formation in Zebrafish

    Get PDF
    Large-scale sequencing of human cancer genomes and mouse transposon-induced tumors has identified a vast number of genes mutated in different cancers. One of the outstanding challenges in this field is to determine which genes, when mutated, contribute to cellular transformation and tumor progression. To identify new and conserved genes that drive tumorigenesis we have developed a novel cancer model in a distantly related vertebrate species, the zebrafish, Danio rerio. The Sleeping Beauty (SB) T2/Onc transposon system was adapted for somatic mutagenesis in zebrafish. The carp ß-actin promoter was cloned into T2/Onc to create T2/OncZ. Two transgenic zebrafish lines that contain large concatemers of T2/OncZ were isolated by injection of linear DNA into the zebrafish embryo. The T2/OncZ transposons were mobilized throughout the zebrafish genome from the transgene array by injecting SB11 transposase RNA at the 1-cell stage. Alternatively, the T2/OncZ zebrafish were crossed to a transgenic line that constitutively expresses SB11 transposase. T2/OncZ transposon integration sites were cloned by ligation-mediated PCR and sequenced on a Genome Analyzer II. Between 700–6800 unique integration events in individual fish were mapped to the zebrafish genome. The data show that introduction of transposase by transgene expression or RNA injection results in an even distribution of transposon re-integration events across the zebrafish genome. SB11 mRNA injection resulted in neoplasms in 10% of adult fish at ∼10 months of age. T2/OncZ-induced zebrafish tumors contain many mutated genes in common with human and mouse cancer genes. These analyses validate our mutagenesis approach and provide additional support for the involvement of these genes in human cancers. The zebrafish T2/OncZ cancer model will be useful for identifying novel and conserved genetic drivers of human cancers

    Origin and Evolution of TRIM Proteins: New Insights from the Complete TRIM Repertoire of Zebrafish and Pufferfish

    Get PDF
    Tripartite motif proteins (TRIM) constitute a large family of proteins containing a RING-Bbox-Coiled Coil motif followed by different C-terminal domains. Involved in ubiquitination, TRIM proteins participate in many cellular processes including antiviral immunity. The TRIM family is ancient and has been greatly diversified in vertebrates and especially in fish. We analyzed the complete sets of trim genes of the large zebrafish genome and of the compact pufferfish genome. Both contain three large multigene subsets - adding the hsl5/trim35-like genes (hltr) to the ftr and the btr that we previously described - all containing a B30.2 domain that evolved under positive selection. These subsets are conserved among teleosts. By contrast, most human trim genes of the other classes have only one or two orthologues in fish. Loss or gain of C-terminal exons generated proteins with different domain organizations; either by the deletion of the ancestral domain or, remarkably, by the acquisition of a new C-terminal domain. Our survey of fish trim genes in fish identifies subsets with different evolutionary dynamics. trims encoding RBCC-B30.2 proteins show the same evolutionary trends in fish and tetrapods: they evolve fast, often under positive selection, and they duplicate to create multigenic families. We could identify new combinations of domains, which epitomize how new trim classes appear by domain insertion or exon shuffling. Notably, we found that a cyclophilin-A domain replaces the B30.2 domain of a zebrafish fintrim gene, as reported in the macaque and owl monkey antiretroviral TRIM5α. Finally, trim genes encoding RBCC-B30.2 proteins are preferentially located in the vicinity of MHC or MHC gene paralogues, which suggests that such trim genes may have been part of the ancestral MHC
    corecore