39 research outputs found

    Using PepExplorer to Filter and Organize De Novo Peptide Sequencing Results

    Full text link
    PepExplorer aids in the biological interpretation of de novo sequencing results; this is accomplished by assembling a list of homolog proteins obtained by aligning results from widely adopted de novo sequencing tools against a target‐decoy sequence database. Our tool relies on pattern recognition to ensure that the results satisfy a user‐given false‐discovery rate (FDR). For this, it employs a radial basis function neural network that considers the precursor charge states, de novo sequencing scores, the peptide lengths, and alignment scores. PepExplorer is recommended for studies addressing organisms with no genomic sequence available. PepExplorer is integrated into the PatternLab for proteomics environment, which makes available various tools for downstream data analysis, including the resources for quantitative and differential proteomics. © 2015 by John Wiley & Sons, Inc.Peer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/152617/1/cpbi1327.pd

    Ten Simple Rules for Taking Advantage of Git and GitHub.

    Get PDF
    Bioinformatics is a broad discipline in which one common denominator is the need to produce and/or use software that can be applied to biological data in different contexts. To enable and ensure the replicability and traceability of scientific claims, it is essential that the scientific publication, the corresponding datasets, and the data analysis are made publicly available [1,2]. All software used for the analysis should be either carefully documented (e.g., for commercial software) or, better yet, openly shared and directly accessible to others [3,4]. The rise of openly available software and source code alongside concomitant collaborative development is facilitated by the existence of several code repository services such as SourceForge, Bitbucket, GitLab, and GitHub, among others. These resources are also essential for collaborative software projects because they enable the organization and sharing of programming tasks between different remote contributors. Here, we introduce the main features of GitHub, a popular web-based platform that offers a free and integrated environment for hosting the source code, documentation, and project-related web content for open-source projects. GitHub also offers paid plans for private repositories (see Box 1) for individuals and businesses as well as free plans including private repositories for research and educational use.Biotechnology and Biological Sciences Research CouncilThis is the final version of the article. It first appeared from Public Library of Science via https://doi.org/10.1371/journal.pcbi.1004947

    BioContainers: An open-source and community-driven framework for software standardization

    Get PDF
    Motivation BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters). Availability and Implementation The software is freely available at github.com/BioContainers/.publishedVersio

    Discovering and linking public omics data sets using the Omics Discovery Index.

    Get PDF
    Biomedical data are being produced at an unprecedented rate owing to the falling cost of experiments and wider access to genomics, transcriptomics, proteomics and metabolomics platforms1, 2. As a result, public deposition of omics data is on the increase. This presents new challenges, including finding ways to store, organize and access different types of biomedical data stored on different platforms. Here, we present the Omics Discovery Index (OmicsDI; http://www.omicsdi.org), an open-source platform that enables access, discovery and dissemination of omics data sets

    Caracterização de Genes de Função Desconhecida com Expressão Associada às Formas Infectivas do Trypanosoma cruzi

    No full text
    Submitted by Anderson Silva ([email protected]) on 2013-11-04T12:15:31Z No. of bitstreams: 1 Dissertação Mestrado Felipe Leprevost.pdf: 8130365 bytes, checksum: 2af7aa9ac89832c8e4c552a4d889c5e9 (MD5)Made available in DSpace on 2013-11-04T12:15:31Z (GMT). No. of bitstreams: 1 Dissertação Mestrado Felipe Leprevost.pdf: 8130365 bytes, checksum: 2af7aa9ac89832c8e4c552a4d889c5e9 (MD5) Previous issue date: 2009Fundação Oswaldo Cruz. Instituto Oswaldo Cruz. Rio de Janeiro, RJ, BrasilAtualmente novas abordagens e metodologias de pesquisa nos permitem iniciar projetos de caracterização de um conjunto de proteínas de um determinado organismo em escalas superiores aos realizados há algum tempo. Novas tecnologias para seqüenciamento, clonagem e expressão protéica permitem a geração de uma quantidade grande de novas informações numa fração de tempo relativamente curta. Organismos como o Trypanosoma cruzi, que possuem praticamente metade de seu genoma sem função conhecida, necessitam que estudos sejam realizados para que se consiga atribuir às proteínas de função desconhecida características funcionais. Em um esforço para se conhecer melhor os genes de função desconhecida deste parasita, um grupo de 10 genes foi selecionado com base na sua expressão diferencial durante a metaciclogênese, consistindo de genes anotados como ‘proteína hipotética’ em banco de dados públicos, com domínio identificado pelo banco de famílias protéicas PFAM e com ortólogos em outros organismos. Os 10 genes selecionados foram amplificados e clonados na plataforma de clonagem Gateway e as proteínas recombinantes foram expressas em Escherichia coli. Dos 10 genes destinados à expressão protéica, 8 expressaram proteínas insolúveis as quais foram utilizadas para obtenção de soro policlonal. Os soros obtidos foram utilizados em ensaios de Western Blot para averiguação da expressão protéica ao longo da metaciclogênese do parasita e ensaios de localização celular por microscopia ótica e eletrônica. Foram realizados também ensaios de transfecção em T. cruzi para a expressão das proteínas fusionadas com GFP, visando à determinação da localização celular, utilizando um vetor compatível com a plataforma Gateway, e foram realizados ensaios de imunoprecipitação para todas as 8 proteínas expressas. Os resultados obtidos no presente trabalho auxiliam na atribuição de informações experimentais a genes e proteínas anotados como ‘proteína hipotética de função desconhecida’ e avaliam a possibilidade de implementação de um estudo sistemático com este para uma aplicação em média/larga escalaRecent research approaches and methodologies allow us to develop projects for the characterization of proteins of a given organism in scales greater than possible some time ago. New technologies for sequencing, cloning and protein expression allow the generation of large amounts of new information in relatively short time. Organisms such as Trypanosoma cruzi, which have nearly half of its genome with no known function, require studies to assign functions to proteins of unknown function. In an effort to characterize the genes of unknown function of this parasite, a group of 10 genes was selected based on their differential expression during metacyclogenesis. The genes were annotated as 'hypothetical protein' in public databases, with domains identified in the PFAM database and with orthologs in other organisms. The 10 selected genes were amplified and cloned in Gateway cloning platform and the recombinant proteins were expressed in Escherichia coli. Eight of the 10 genes expressed insoluble proteins which were used to obtain polyclonal serum. The sera were used in Western Blot analysis to verify protein expression throughout the parasite metacyclogenesis as well as cellular localization by optical and electron microscopy. To determine cellular localization, transfection experiments were conducted in T. cruzi for the expression of proteins fused with GFP using a vector compatible with the Gateway platform. Immunoprecipitation assays were also performed for the 8 expressed proteins. The results obtained in this work uncover experimental information of genes annotated as 'hypothetical protein of unknown function' and assess the possibility of implementing a systematic approach with this methodology in medium to large scal

    Nesvilab/philosopher: Philosopher 4.1.1

    No full text
    Added Changed Fixed Fixed an issue with the ion mobility column not showing in empty PSM tables. Fixed an issue where high-scoring peptides mapping to complicated razor cases where not showing in the reports

    Differences in Extracellular Vesicle Protein Cargo Are Dependent on Head and Neck Squamous Cell Carcinoma Cell of Origin and Human Papillomavirus Status

    No full text
    To identify potential extracellular vesicle (EV) biomarkers in head and neck squamous cell carcinoma (HNSCC), we evaluated EV protein cargo and whole cell lysates (WCL) from HPV-positive and -negative HNSCC cell lines, as well as normal oral keratinocytes and HPV16-transformed cells. EVs were isolated from serum-depleted, conditioned cell culture media by polyethylene glycol (PEG) precipitation/ultracentrifugation. EV and WCL preparations were analyzed by LC-MS/MS. Candidate proteins detected at significantly higher levels in EV compared with WCL, or compared with EV from normal oral keratinocytes, were identified and confirmed by Wes Simple Western protein analysis. Our findings suggest that these proteins may be potential HNSCC EV markers as proteins that may be (1) selectively included in EV cargo for export from the cell as a strategy for metastasis, tumor cell survival, or modification of tumor microenvironment, or (2) representative of originating cell composition, which may be developed for diagnostic or prognostic use in clinical liquid biopsy applications. This work demonstrates that our method can be used to reliably detect EV proteins from HNSCC, normal keratinocyte, and transformed cell lines. Furthermore, this work has identified HNSCC EV protein candidates for continued evaluation, specifically tenascin-C, HLA-A, E-cadherin, EGFR, EPHA2, and cytokeratin 19

    Enhancing multiplexed cysteine chemoproteomics by uniting FragPipe with solid-phase compatible dialkoxydiphenylsilane reagents

    No full text
    The human proteome harbors tens of thousands of ligandable or potentially druggable cysteine residues. Consequently, pinpointing the optimal covalent molecule for each cysteine residue represents an exciting means to close the druggability gap, namely the ~96% of human proteins not yet targeted by an FDA approved drug. Realizing the full therapeutic potential of the cysteineome will require comprehensive proteome-wide cysteine-compound structure activity relationship (SAR) analysis. While mass spectrometry-based chemoproteomic platforms have made significant inroads into this challenge, achieving comprehensive cysteine-SAR necessitates technical innovation in two key areas: (1) streamlined sample preparation workflows and (2) high throughput and high coverage data acquisition. Here we report the silane-based Cleavable Linkers for Isotopically labeled Proteomics (sCLIP) method. sCLIP streamlines sample preparation with unparalleled early-stage isobaric labeling and sample pooling, allowing for high coverage and increased sample throughput via customized low cost 6-plex sample multiplexing. The sCLIP method is distinguished by its unprecedented click-assembled isobaric tags, in which the reporter group is encoded in the sCLIP capture reagent and balancer in the pan cysteine-reactive probe. When paired with a custom FragPipe data analysis workflow and applied to cysteine-reactive fragment screens, sCLIP proteomics revealed established and unprecedented cysteine-ligand pairs, including the discovery that the mitochondrial uncoupling agent FCCP acts as a covalent-reversible cysteine-reactive electrophile

    Unveiling the partners of the DRBD2-mRNP complex, an RBP in Trypanosoma cruzi and ortholog to the yeast SR-protein Gbp2

    No full text
    Submitted by Manoel Barata ([email protected]) on 2019-08-02T18:54:45Z No. of bitstreams: 1 s12866-019-1505.pdf: 2567810 bytes, checksum: 8cedb9c28885f4fc96904def1459961f (MD5)Approved for entry into archive by Manoel Barata ([email protected]) on 2019-08-15T19:23:11Z (GMT) No. of bitstreams: 1 s12866-019-1505.pdf: 2567810 bytes, checksum: 8cedb9c28885f4fc96904def1459961f (MD5)Made available in DSpace on 2019-08-15T19:23:11Z (GMT). No. of bitstreams: 1 s12866-019-1505.pdf: 2567810 bytes, checksum: 8cedb9c28885f4fc96904def1459961f (MD5) Previous issue date: 2019Fundação Oswaldo Cruz. Instituto Carlos Chagas.Curitiba, PR, Brasil.Instituto de Biologia Molecular do Paraná. Curitiba, PR, Brasil.Fundação Oswaldo Cruz. Instituto Carlos Chagas.Curitiba, PR, Brasil / Instituto de Biologia Molecular do Paraná. Curitiba, PR, Brasil.Medical Science Unit I. Department of Pathology. University of Michigan. Michigan, USA.Fundação Oswaldo Cruz. Instituto Carlos Chagas.Curitiba, PR, Brasil.Fundação Oswaldo Cruz. Instituto Carlos Chagas.Curitiba, PR, Brasil.Fundação Oswaldo Cruz. Instituto Carlos Chagas. Curitiba, PR, Brasil.RNA-binding proteins (RBPs) are well known as key factors in gene expression regulation in eukaryotes. These proteins associate with mRNAs and other proteins to form mRNP complexes that ultimately determine the fate of target transcripts in the cell. This association is usually mediated by an RNA-recognition motif (RRM). In the case of trypanosomatids, these proteins play a paramount role, as gene expression regulation is mostly posttranscriptional. Despite their relevance in the life cycle of Trypanosoma cruzi, the causative agent of Chagas’ disease, to date, few RBPs have been characterized in this parasite. And as results , we investigated the role of DRBD2 in T. cruzi, an RBP with two RRM domains that is associated with cytoplasmic translational complexes. We show that DRBD2 is an ortholog of the Gbp2 in yeast, an SR-rich protein involved in mRNA quality control and export. We used an immunoprecipitation assay followed by shotgun proteomics and RNA-seq to assess the interaction partners of the DRBD2-mRNP complex in epimastigotes. The analysis identified mostly proteins involved in RNA metabolism and regulation, such as ALBA1, ALBA3, ALBA4, UBP1, UBP2, DRBD3, and PABP2. The RNA-seq results showed that most of the transcripts regulated by the DRBD2 complex mapped to hypothetical proteins related to multiple processes, such as to biosynthetic process, DNA metabolic process, protein modification, and response to stress. Thus, It concluded that, the identification of regulatory proteins in the DRBD2-mRNP complex corroborates the important role of DRBD2 in gene expression regulation in T. cruzi. We consider these results an important contribution to future studies regarding gene expression regulation in T. cruzi, especially in the field of RNA-binding proteins
    corecore