621 research outputs found
Combining classifiers for improved classification of proteins from sequence or structure
<p>Abstract</p> <p>Background</p> <p>Predicting a protein's structural or functional class from its amino acid sequence or structure is a fundamental problem in computational biology. Recently, there has been considerable interest in using discriminative learning algorithms, in particular support vector machines (SVMs), for classification of proteins. However, because sufficiently many positive examples are required to train such classifiers, all SVM-based methods are hampered by limited coverage.</p> <p>Results</p> <p>In this study, we develop a hybrid machine learning approach for classifying proteins, and we apply the method to the problem of assigning proteins to structural categories based on their sequences or their 3D structures. The method combines a full-coverage but lower accuracy nearest neighbor method with higher accuracy but reduced coverage multiclass SVMs to produce a full coverage classifier with overall improved accuracy. The hybrid approach is based on the simple idea of "punting" from one method to another using a learned threshold.</p> <p>Conclusion</p> <p>In cross-validated experiments on the SCOP hierarchy, the hybrid methods consistently outperform the individual component methods at all levels of coverage.</p> <p>Code and data sets are available at <url>http://noble.gs.washington.edu/proj/sabretooth</url></p
Recommended from our members
Target MRNA Abundance Dilutes MicroRNA and SiRNA Activity
Post-transcriptional regulation by microRNAs and siRNAs depends not only on characteristics of individual binding sites in target mRNA molecules, but also on system-level properties such as overall molecular concentrations. We hypothesize that an intracellular pool of microRNAs/siRNAs faced with a larger number of available predicted target transcripts will downregulate each individual target gene to a lesser extent. To test this hypothesis, we analyzed mRNA expression change from 178 microRNA and siRNA transfection experiments in two cell lines. We find that downregulation of particular genes mediated by microRNAs and siRNAs indeed varies with the total concentration of available target transcripts. We conclude that to interpret and design experiments involving gene regulation by small RNAs, global properties, such as target mRNA abundance, need to be considered in addition to local determinants. We propose that analysis of microRNA/siRNA targeting would benefit from a more quantitative definition, rather than simple categorization of genes as ‘target’ or ‘not a target.’ Our results are important for understanding microRNA regulation and may also have implications for siRNA design and small RNA therapeutics
Recommended from our members
Mouse cytomegalovirus-experienced ILC1s acquire a memory response dependent on the viral glycoprotein m12.
Innate lymphoid cells (ILCs) are tissue-resident sentinels that are essential for early host protection from pathogens at initial sites of infection. However, whether pathogen-derived antigens directly modulate the responses of tissue-resident ILCs has remained unclear. In the present study, it was found that liver-resident type 1 ILCs (ILC1s) expanded locally and persisted after the resolution of infection with mouse cytomegalovirus (MCMV). ILC1s acquired stable transcriptional, epigenetic and phenotypic changes a month after the resolution of MCMV infection, and showed an enhanced protective effector response to secondary challenge with MCMV consistent with a memory lymphocyte response. Memory ILC1 responses were dependent on the MCMV-encoded glycoprotein m12, and were independent of bystander activation by proinflammatory cytokines after heterologous infection. Thus, liver ILC1s acquire adaptive features in an MCMV-specific manner
Arachidonate Metabolism and the Signaling Pathway of Induction of Apoptosis by Oxidized LDL/Oxysterol
Owing at least in part to oxysterol components that can induce apoptosis, oxidized LDL (oxLDL) is cytotoxic to mammalian cells with receptors that can internalize it. Vascular cells possess such receptors, and it appears that the apoptotic response of vascular cells to the oxysterols borne by oxLDL is an important part of the atherogenic effects of oxLDL. Thus, an analysis of the signaling pathway of apoptotic induction by oxysterols is of value in understanding the development of atherosclerotic plaque. In a prior study, we demonstrated an induction of calcium ion flux into cells treated with 25-hydroxycholesterol (25-OHC) and showed that this response is essential for 25-OHC-induced apoptosis. One possible signal transduction pathway initiated by calcium ion fluxes is the activation of cytosolic phospholipase A2 (cPLA2). In the current study, we demonstrate that activation of cPLA2 does occur in both macrophages and fibroblasts treated with 25-OHC or oxLDL. Activation is evidenced by 25-OHC-induced relocalization of cPLA2 to the nuclear envelope and arachidonic acid release. Loss of cPLA2 activity, either through genetic knockout in mice, or by treatment with a cPLA2 inhibitor, results in an attenuation of arachidonic acid release as well as of the apoptotic response to oxLDL in peritoneal macrophages or to 25-OHC in cultured fibroblast and macrophage cell lines
Repetitive Sampling and Control Threshold Improve 16S rRNA Gene Sequencing Results From Produced Waters Associated With Hydraulically Fractured Shale
Sequencing microbial DNA from deep subsurface environments is complicated by a number of issues ranging from contamination to non-reproducible results. Many samples obtained from these environments – which are of great interest due to the potential to stimulate microbial methane generation – contain low biomass. Therefore, samples from these environments are difficult to study as sequencing results can be easily impacted by contamination. In this case, the low amount of sample biomass may be effectively swamped by the contaminating DNA and generate misleading results. Additionally, performing field work in these environments can be difficult, as researchers generally have limited access to and time on site. Therefore, optimizing a sampling plan to produce the best results while collecting the greatest number of samples over a short period of time is ideal. This study aimed to recommend an adequate sampling plan for field researchers obtaining microbial biomass for 16S rRNA gene sequencing, applicable specifically to low biomass oil and gas-producing environments. Forty-nine different samples were collected by filtering specific volumes of produced water from a hydraulically fractured well producing from the Niobrara Shale. Water was collected in two different sampling events 24 h apart. Four to five samples were collected from 11 specific volumes. These samples along with eight different blanks were submitted for analysis. DNA was extracted from each sample, and quantitative polymerase chain reaction (qPCR) and 16S rRNA Illumina MiSeq gene sequencing were performed to determine relative concentrations of biomass and microbial community composition, respectively. The qPCR results varied across sampled volumes, while no discernible trend correlated contamination to volume of water filtered. This suggests that collecting a larger volume of sample may not result in larger biomass concentrations or better representation of a sampled environment. Researchers could prioritize collecting many low volume samples over few high-volume samples. Our results suggest that there also may be variability in the concentration of microbial communities present in produced waters over short (i.e., hours) time scales, which warrants further investigation. Submission of multiple blanks is also vital to determining how contamination or low biomass effects may influence a sample set collected from an unknown environment
Interaction of CarD with RNA polymerase mediates Mycobacterium tuberculosis viability, rifampin resistance, and pathogenesis
Mycobacterium tuberculosis infection continues to cause substantial human suffering. New chemotherapeutic strategies, which require insight into the pathways essential for M. tuberculosis pathogenesis, are imperative. We previously reported that depletion of the CarD protein in mycobacteria compromises viability, resistance to oxidative stress and fluoroquinolones, and pathogenesis. CarD associates with the RNA polymerase (RNAP), but it has been unknown which of the diverse functions of CarD are mediated through the RNAP; this question must be answered to understand the CarD mechanism of action. Herein, we describe the interaction between the M. tuberculosis CarD and the RNAP β subunit and identify point mutations that weaken this interaction. The characterization of mycobacterial strains with attenuated CarD/RNAP β interactions demonstrates that the CarD/RNAP β association is required for viability and resistance to oxidative stress but not for fluoroquinolone resistance. Weakening the CarD/RNAP β interaction also increases the sensitivity of mycobacteria to rifampin and streptomycin. Surprisingly, depletion of the CarD protein did not affect sensitivity to rifampin. These findings define the CarD/RNAP interaction as a new target for chemotherapeutic intervention that could also improve the efficacy of rifampin treatment of tuberculosis. In addition, our data demonstrate that weakening the CarD/RNAP β interaction does not completely phenocopy the depletion of CarD and support the existence of functions for CarD independent of direct RNAP binding
Recommended from our members
FOXA1 mutations alter pioneering activity, differentiation and prostate cancer phenotypes.
Mutations in the transcription factor FOXA1 define a unique subset of prostate cancers but the functional consequences of these mutations and whether they confer gain or loss of function is unknown1-9. Here, by annotating the landscape of FOXA1 mutations from 3,086 human prostate cancers, we define two hotspots in the forkhead domain: Wing2 (around 50% of all mutations) and the highly conserved DNA-contact residue R219 (around 5% of all mutations). Wing2 mutations are detected in adenocarcinomas at all stages, whereas R219 mutations are enriched in metastatic tumours with neuroendocrine histology. Interrogation of the biological properties of wild-type FOXA1 and fourteen FOXA1 mutants reveals gain of function in mouse prostate organoid proliferation assays. Twelve of these mutants, as well as wild-type FOXA1, promoted an exaggerated pro-luminal differentiation program, whereas two different R219 mutants blocked luminal differentiation and activated a mesenchymal and neuroendocrine transcriptional program. Assay for transposase-accessible chromatin using sequencing (ATAC-seq) of wild-type FOXA1 and representative Wing2 and R219 mutants revealed marked, mutant-specific changes in open chromatin at thousands of genomic loci and exposed sites of FOXA1 binding and associated increases in gene expression. Of note, ATAC-seq peaks in cells expressing R219 mutants lacked the canonical core FOXA1-binding motifs (GTAAAC/T) but were enriched for a related, non-canonical motif (GTAAAG/A), which was preferentially activated by R219-mutant FOXA1 in reporter assays. Thus, FOXA1 mutations alter its pioneering function and perturb normal luminal epithelial differentiation programs, providing further support for the role of lineage plasticity in cancer progression
Non-redundant ISGF3 Components Promote NK Cell Survival in an Auto-regulatory Manner during Viral Infection
Global profiling of stimulus-induced polyadenylation in cells using a poly(A) trap
Polyadenylation of mRNA leads to increased protein expression in response to diverse stimuli, but it is difficult to identify mRNAs that become polyadenylated in living cells. Here we describe a click chemistry-compatible nucleoside analog that is selectively incorporated into poly(A) tails of transcripts in cells. Next-generation sequencing of labeled mRNAs enables a transcriptome-wide profile of polyadenylation and provides insights into the mRNA sequence elements that are correlated with polyadenylation
- …