450 research outputs found

    Transcriptome Sequencing Demonstrates that Human Papillomavirus Is Not Active in Cutaneous Squamous Cell Carcinoma

    Get PDF
    β-Human papillomavirus (β-HPV) DNA is present in some cutaneous squamous cell carcinomas (cuSCCs), but no mechanism of carcinogenesis has been determined. We used ultra-high-throughput sequencing of the cancer transcriptome to assess whether papillomavirus transcripts are present in these cancers. In all, 67 cuSCC samples were assayed for β-HPV DNA by PCR, and viral loads were measured with type-specific quantitative PCR. A total of 31 SCCs were selected for whole transcriptome sequencing. Transcriptome libraries were prepared in parallel from the HPV18-positive HeLa cervical cancer cell line and HPV16-positive primary cervical and periungual SCCs. Of the tumors, 30% (20/67) were positive for β-HPV DNA, but there was no difference in β-HPV viral load between tumor and normal tissue (P=0.310). Immunosuppression and age were significantly associated with higher viral load (P=0.016 for immunosuppression; P=0.0004 for age). Transcriptome sequencing failed to identify papillomavirus expression in any of the skin tumors. In contrast, HPV16 and HPV18 mRNA transcripts were readily identified in primary cervical and periungual cancers and HeLa cells. These data demonstrate that papillomavirus mRNA expression is not a factor in the maintenance of cuSCCs

    Comparative Analysis of Tandem Repeats from Hundreds of Species Reveals Unique Insights into Centromere Evolution

    Get PDF
    Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. The assumption that the most abundant tandem repeat is the centromere DNA was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and in length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond ~50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution, including the appearance of higher order repeat structures in which several polymorphic monomers make up a larger repeating unit. While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animals and plants. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes

    The NBD-NBD interface is not the sole determinant for transport in ABC transporters

    Get PDF
    International audienceOne of the most exciting scientific challenges in functional genomics concerns the discovery of biologically relevant patterns from gene expression data. For instance, it is extremely useful to provide putative synexpression groups or transcription modules to molecular biologists. We propose a methodology that has been proved useful in real cases. It is described as a prototypical KDD scenario which starts from raw expression data selection until useful patterns are delivered. Our conceptual contribution is (a) to emphasize how to take the most from recent progress in constraint-based mining of set patterns, and (b) to propose a generic approach for gene expression data enrichment. The methodology has been validated on real data sets

    Metagenomic next-generation sequencing of samples from pediatric febrile illness in Tororo, Uganda.

    Get PDF
    Febrile illness is a major burden in African children, and non-malarial causes of fever are uncertain. In this retrospective exploratory study, we used metagenomic next-generation sequencing (mNGS) to evaluate serum, nasopharyngeal, and stool specimens from 94 children (aged 2-54 months) with febrile illness admitted to Tororo District Hospital, Uganda. The most common microbes identified were Plasmodium falciparum (51.1% of samples) and parvovirus B19 (4.4%) from serum; human rhinoviruses A and C (40%), respiratory syncytial virus (10%), and human herpesvirus 5 (10%) from nasopharyngeal swabs; and rotavirus A (50% of those with diarrhea) from stool. We also report the near complete genome of a highly divergent orthobunyavirus, tentatively named Nyangole virus, identified from the serum of a child diagnosed with malaria and pneumonia, a Bwamba orthobunyavirus in the nasopharynx of a child with rash and sepsis, and the genomes of two novel human rhinovirus C species. In this retrospective exploratory study, mNGS identified multiple potential pathogens, including 3 new viral species, associated with fever in Ugandan children

    Complete genome sequence of an astrovirus identified in a domestic rabbit (\u3cem\u3eOryctolagus cuniculus\u3c/em\u3e) with gastroenteritis

    Get PDF
    A colony of domestic rabbits in Tennessee, USA, experienced a high-mortality (~90%) outbreak of enterocolitis. The clinical characteristics were one to six days of lethargy, bloating, and diarrhea, followed by death. Heavy intestinal coccidial load was a consistent finding as was mucoid enteropathy with cecal impaction. Preliminary analysis by electron microscopy revealed the presence of virus-like particles in the stool of one of the affected rabbits. Analysis using the Virochip, a viral detection microarray, suggested the presence of an astrovirus, and follow-up PCR and sequence determination revealed a previously uncharacterized member of that family. Metagenomic sequencing enabled the recovery of the complete viral genome, which contains the characteristic attributes of astrovirus genomes. Attempts to propagate the virus in tissue culture have yet to succeed. Although astroviruses cause gastroenteric disease in other mammals, the pathogenicity of this virus and the relationship to this outbreak remains to be determined. This study therefore defines a viral species and a potential rabbit pathogen

    Plasmodium falciparum Resistance to a Lead Benzoxaborole Due to Blocked Compound Activation and Altered Ubiquitination or Sumoylation.

    Get PDF
    New antimalarial drugs are needed. The benzoxaborole AN13762 showed excellent activity against cultured Plasmodium falciparum, against fresh Ugandan P. falciparum isolates, and in murine malaria models. To gain mechanistic insights, we selected in vitro for P. falciparum isolates resistant to AN13762. In all of 11 independent selections with 100 to 200 nM AN13762, the 50% inhibitory concentration (IC50) increased from 18-118 nM to 180-890 nM, and whole-genome sequencing of resistant parasites demonstrated mutations in prodrug activation and resistance esterase (PfPARE). The introduction of PfPARE mutations led to a similar level of resistance, and recombinant PfPARE hydrolyzed AN13762 to the benzoxaborole AN10248, which has activity similar to that of AN13762 but for which selection of resistance was not readily achieved. Parasites further selected with micromolar concentrations of AN13762 developed higher-level resistance (IC50, 1.9 to 5.0 μM), and sequencing revealed additional mutations in any of 5 genes, 4 of which were associated with ubiquitination/sumoylation enzyme cascades; the introduction of one of these mutations, in SUMO-activating enzyme subunit 2, led to a similar level of resistance. The other gene mutated in highly resistant parasites encodes the P. falciparum cleavage and specificity factor homolog PfCPSF3, previously identified as the antimalarial target of another benzoxaborole. Parasites selected for resistance to AN13762 were cross-resistant with a close analog, AN13956, but not with standard antimalarials, AN10248, or other benzoxaboroles known to have different P. falciparum targets. Thus, AN13762 appears to have a novel mechanism of antimalarial action and multiple mechanisms of resistance, including loss of function of PfPARE preventing activation to AN10248, followed by alterations in ubiquitination/sumoylation pathways or PfCPSF3.IMPORTANCE Benzoxaboroles are under study as potential new drugs to treat malaria. One benzoxaborole, AN13762, has potent activity and promising features, but its mechanisms of action and resistance are unknown. To gain insights into these mechanisms, we cultured malaria parasites with nonlethal concentrations of AN13762 and generated parasites with varied levels of resistance. Parasites with low-level resistance had mutations in PfPARE, which processes AN13762 into an active metabolite; PfPARE mutations prevented this processing. Parasites with high-level resistance had mutations in any of a number of enzymes, mostly those involved in stress responses. Parasites selected for AN13762 resistance were not resistant to other antimalarials, suggesting novel mechanisms of action and resistance for AN13762, a valuable feature for a new class of antimalarial drugs

    Metagenomic characterization of swine slurry in a North American swine farm operation

    Get PDF
    Abstract Modern day large-scale, high-density farming environments are inherently susceptible to viral outbreaks, inadvertently creating conditions that favor increased pathogen transmission and potential zoonotic spread. Metagenomic sequencing has proven to be a useful tool for characterizing the microbial burden in both people, livestock, and environmental samples. International efforts have been successful at characterizing pathogens in commercial farming environments, especially swine farms, however it is unclear whether the full extent of microbial agents have been adequately captured or is representative of farms elsewhere. To augment international efforts we performed metagenomic next-generation sequencing on nine swine slurry and three environmental samples from a United States of America (U.S.A.) farm operation, characterized the microbial composition of slurry, and identified novel viruses. We assembled a remarkable total of 1792 viral genomes, of which 554 were novel/divergent. We assembled 1637 Picobirnavirus genome segments, of which 538 are novel. In addition, we discovered 10 new viruses belonging to a novel taxon: porcine Statoviruses; which have only been previously reported in human, macaques, mouse, and cows. We assembled 3 divergent Posaviruses and 3 swine Picornaviruses. In addition to viruses described, we found other eukaryotic genera such as Entamoeba and Blastocystis, and bacterial genera such as Listeria, Treponema, Peptoclostridium and Bordetella in the slurry. Of these, two species Entamoeba histolytica and Listeria monocytogenes known to cause human disease were detected. Further, antimicrobial resistance genes such as tetracycline and MLS (macrolide, lincosamide, streptogramin) were also identified. Metagenomic surveillance in swine fecal slurry has great potential for novel and antimicrobial resistant pathogen detection

    Identification of a Novel Gammaretrovirus in Prostate Tumors of Patients Homozygous for R462Q RNASEL Variant

    Get PDF
    Ribonuclease L (RNase L) is an important effector of the innate antiviral response. Mutations or variants that impair function of RNase L, particularly R462Q, have been proposed as susceptibility factors for prostate cancer. Given the role of this gene in viral defense, we sought to explore the possibility that a viral infection might contribute to prostate cancer in individuals harboring the R462Q variant. A viral detection DNA microarray composed of oligonucleotides corresponding to the most conserved sequences of all known viruses identified the presence of gammaretroviral sequences in cDNA samples from seven of 11 R462Q-homozygous (QQ) cases, and in one of eight heterozygous (RQ) and homozygous wild-type (RR) cases. An expanded survey of 86 tumors by specific RT-PCR detected the virus in eight of 20 QQ cases (40%), compared with only one sample (1.5%) among 66 RQ and RR cases. The full-length viral genome was cloned and sequenced independently from three positive QQ cases. The virus, named XMRV, is closely related to xenotropic murine leukemia viruses (MuLVs), but its sequence is clearly distinct from all known members of this group. Comparison of gag and pol sequences from different tumor isolates suggested infection with the same virus in all cases, yet sequence variation was consistent with the infections being independently acquired. Analysis of prostate tissues from XMRV-positive cases by in situ hybridization and immunohistochemistry showed that XMRV nucleic acid and protein can be detected in about 1% of stromal cells, predominantly fibroblasts and hematopoietic elements in regions adjacent to the carcinoma. These data provide to our knowledge the first demonstration that xenotropic MuLV-related viruses can produce an authentic human infection, and strongly implicate RNase L activity in the prevention or clearance of infection in vivo. These findings also raise questions about the possible relationship between exogenous infection and cancer development in genetically susceptible individuals

    Gene Function Classification Using Bayesian Models with Hierarchy-Based Priors

    Get PDF
    We investigate the application of hierarchical classification schemes to the annotation of gene function based on several characteristics of protein sequences including phylogenic descriptors, sequence based attributes, and predicted secondary structure. We discuss three Bayesian models and compare their performance in terms of predictive accuracy. These models are the ordinary multinomial logit (MNL) model, a hierarchical model based on a set of nested MNL models, and a MNL model with a prior that introduces correlations between the parameters for classes that are nearby in the hierarchy. We also provide a new scheme for combining different sources of information. We use these models to predict the functional class of Open Reading Frames (ORFs) from the E. coli genome. The results from all three models show substantial improvement over previous methods, which were based on the C5 algorithm. The MNL model using a prior based on the hierarchy outperforms both the non-hierarchical MNL model and the nested MNL model. In contrast to previous attempts at combining these sources of information, our approach results in a higher accuracy rate when compared to models that use each data source alone. Together, these results show that gene function can be predicted with higher accuracy than previously achieved, using Bayesian models that incorporate suitable prior information
    corecore