69 research outputs found

    GOATOOLS: A Python library for Gene Ontology analyses.

    Get PDF
    The biological interpretation of gene lists with interesting shared properties, such as up- or down-regulation in a particular experiment, is typically accomplished using gene ontology enrichment analysis tools. Given a list of genes, a gene ontology (GO) enrichment analysis may return hundreds of statistically significant GO results in a "flat" list, which can be challenging to summarize. It can also be difficult to keep pace with rapidly expanding biological knowledge, which often results in daily changes to any of the over 47,000 gene ontologies that describe biological knowledge. GOATOOLS, a Python-based library, makes it more efficient to stay current with the latest ontologies and annotations, perform gene ontology enrichment analyses to determine over- and under-represented terms, and organize results for greater clarity and easier interpretation using a novel GOATOOLS GO grouping method. We performed functional analyses on both stochastic simulation data and real data from a published RNA-seq study to compare the enrichment results from GOATOOLS to two other popular tools: DAVID and GOstats. GOATOOLS is freely available through GitHub: https://github.com/tanghaibao/goatools

    Prediction of HIV-1 virus-host protein interactions using virus and host sequence motifs

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Host protein-protein interaction networks are altered by invading virus proteins, which create new interactions, and modify or destroy others. The resulting network topology favors excessive amounts of virus production in a stressed host cell network. Short linear peptide motifs common to both virus and host provide the basis for host network modification.</p> <p>Methods</p> <p>We focused our host-pathogen study on the binding and competing interactions of HIV-1 and human proteins. We showed that peptide motifs conserved across 70% of HIV-1 subtype B and C samples occurred in similar positions on HIV-1 proteins, and we documented protein domains that interact with these conserved motifs. We predicted which human proteins may be targeted by HIV-1 by taking pairs of human proteins that may interact via a motif conserved in HIV-1 and the corresponding interacting protein domain.</p> <p>Results</p> <p>Our predictions were enriched with host proteins known to interact with HIV-1 proteins ENV, NEF, and TAT (p-value < 4.26E-21). Cellular pathways statistically enriched for our predictions include the T cell receptor signaling, natural killer cell mediated cytotoxicity, cell cycle, and apoptosis pathways. Gene Ontology molecular function level 5 categories enriched with both predicted and confirmed HIV-1 targeted proteins included categories associated with phosphorylation events and adenyl ribonucleotide binding.</p> <p>Conclusion</p> <p>A list of host proteins highly enriched with those targeted by HIV-1 proteins can be obtained by searching for host protein motifs along virus protein sequences. The resulting set of host proteins predicted to be targeted by virus proteins will become more accurate with better annotations of motifs and domains. Nevertheless, our study validates the role of linear binding motifs shared by virus and host proteins as an important part of the crosstalk between virus and host.</p

    GOATOOLS: A Python library for Gene Ontology analyses

    Get PDF
    The biological interpretation of gene lists with interesting shared properties, such as up- or down-regulation in a particular experiment, is typically accomplished using gene ontology enrichment analysis tools. Given a list of genes, a gene ontology (GO) enrichment analysis may return hundreds of statistically significant GO results in a “flat” list, which can be challenging to summarize. It can also be difficult to keep pace with rapidly expanding biological knowledge, which often results in daily changes to any of the over 47,000 gene ontologies that describe biological knowledge. GOATOOLS, a Python-based library, makes it more efficient to stay current with the latest ontologies and annotations, perform gene ontology enrichment analyses to determine over- and under-represented terms, and organize results for greater clarity and easier interpretation using a novel GOATOOLS GO grouping method. We performed functional analyses on both stochastic simulation data and real data from a published RNA-seq study to compare the enrichment results from GOATOOLS to two other popular tools: DAVID and GOstats. GOATOOLS is freely available through GitHub: https://github.com/tanghaibao/goatools

    Reconstructing the history of ocean wildlife around Ascension Island

    Get PDF
    1. In 2016, the UK government announced plans for a large-scale Marine Protected Area around Ascension Island, a UK Overseas Territory in the South Atlantic. 2. To improve baselines for marine life to support ambitious conservation and assess change over time, archives were searched for historical accounts of wildlife from Ascension's discovery in 1501 to the present. For more recent changes, 139 interviews with past and present inhabitants were conducted. 3. Ascension's marine life has, from first discovery to the present, been consistently remarked upon for its exceptional abundance. Historical sources indicate declines in seabird and turtle populations from human exploitation and introduction of rats and cats. They are recovering with good management, although still below pre-settlement abundance. 4. Interviews with residents indicate more recent changes, notably declines in catch per unit of fishing effort at popular shore angling sites, a decline in yellowfin tuna (Thunnus albacares) and increase in Galapagos sharks (Carcharhinus galapagensis). 5. What is very notable, however, based on the interviews, was that there was no temporal signal suggestive of recent systemic decline, in marked contrast to many parts of the world where recent wildlife declines have been pervasive and steep. Ascension represents a remarkable and immensely important centre of abundance in a sea of depletion and change, warranting full protection for all the island's waters

    HIV Protein Sequence Hotspots for Crosstalk with Host Hub Proteins

    Get PDF
    HIV proteins target host hub proteins for transient binding interactions. The presence of viral proteins in the infected cell results in out-competition of host proteins in their interaction with hub proteins, drastically affecting cell physiology. Functional genomics and interactome datasets can be used to quantify the sequence hotspots on the HIV proteome mediating interactions with host hub proteins. In this study, we used the HIV and human interactome databases to identify HIV targeted host hub proteins and their host binding partners (H2). We developed a high throughput computational procedure utilizing motif discovery algorithms on sets of protein sequences, including sequences of HIV and H2 proteins. We identified as HIV sequence hotspots those linear motifs that are highly conserved on HIV sequences and at the same time have a statistically enriched presence on the sequences of H2 proteins. The HIV protein motifs discovered in this study are expressed by subsets of H2 host proteins potentially outcompeted by HIV proteins. A large subset of these motifs is involved in cleavage, nuclear localization, phosphorylation, and transcription factor binding events. Many such motifs are clustered on an HIV sequence in the form of hotspots. The sequential positions of these hotspots are consistent with the curated literature on phenotype altering residue mutations, as well as with existing binding site data. The hotspot map produced in this study is the first global portrayal of HIV motifs involved in altering the host protein network at highly connected hub nodes

    HIVToolbox, an Integrated Web Application for Investigating HIV

    Get PDF
    Many bioinformatic databases and applications focus on a limited domain of knowledge federating links to information in other databases. This segregated data structure likely limits our ability to investigate and understand complex biological systems. To facilitate research, therefore, we have built HIVToolbox, which integrates much of the knowledge about HIV proteins and allows virologists and structural biologists to access sequence, structure, and functional relationships in an intuitive web application. HIV-1 integrase protein was used as a case study to show the utility of this application. We show how data integration facilitates identification of new questions and hypotheses much more rapid and convenient than current approaches using isolated repositories. Several new hypotheses for integrase were created as an example, and we experimentally confirmed a predicted CK2 phosphorylation site. Weblink: [http://hivtoolbox.bio-toolkit.com

    Sequence- and Interactome-Based Prediction of Viral Protein Hotspots Targeting Host Proteins: A Case Study for HIV Nef

    Get PDF
    Virus proteins alter protein pathways of the host toward the synthesis of viral particles by breaking and making edges via binding to host proteins. In this study, we developed a computational approach to predict viral sequence hotspots for binding to host proteins based on sequences of viral and host proteins and literature-curated virus-host protein interactome data. We use a motif discovery algorithm repeatedly on collections of sequences of viral proteins and immediate binding partners of their host targets and choose only those motifs that are conserved on viral sequences and highly statistically enriched among binding partners of virus protein targeted host proteins. Our results match experimental data on binding sites of Nef to host proteins such as MAPK1, VAV1, LCK, HCK, HLA-A, CD4, FYN, and GNB2L1 with high statistical significance but is a poor predictor of Nef binding sites on highly flexible, hoop-like regions. Predicted hotspots recapture CD8 cell epitopes of HIV Nef highlighting their importance in modulating virus-host interactions. Host proteins potentially targeted or outcompeted by Nef appear crowding the T cell receptor, natural killer cell mediated cytotoxicity, and neurotrophin signaling pathways. Scanning of HIV Nef motifs on multiple alignments of hepatitis C protein NS5A produces results consistent with literature, indicating the potential value of the hotspot discovery in advancing our understanding of virus-host crosstalk

    Fishery-Independent Data Reveal Negative Effect of Human Population Density on Caribbean Predatory Fish Communities

    Get PDF
    BACKGROUND: Understanding the current status of predatory fish communities, and the effects fishing has on them, is vitally important information for management. However, data are often insufficient at region-wide scales to assess the effects of extraction in coral reef ecosystems of developing nations. METHODOLOGY/PRINCIPAL FINDINGS: Here, I overcome this difficulty by using a publicly accessible, fisheries-independent database to provide a broad scale, comprehensive analysis of human impacts on predatory reef fish communities across the greater Caribbean region. Specifically, this study analyzed presence and diversity of predatory reef fishes over a gradient of human population density. Across the region, as human population density increases, presence of large-bodied fishes declines, and fish communities become dominated by a few smaller-bodied species. CONCLUSIONS/SIGNIFICANCE: Complete disappearance of several large-bodied fishes indicates ecological and local extinctions have occurred in some densely populated areas. These findings fill a fundamentally important gap in our knowledge of the ecosystem effects of artisanal fisheries in developing nations, and provide support for multiple approaches to data collection where they are commonly unavailable
    corecore