147 research outputs found
Multiplierz: An Extensible API Based Desktop Environment for Proteomics Data Analysis
BACKGROUND. Efficient analysis of results from mass spectrometry-based proteomics experiments requires access to disparate data types, including native mass spectrometry files, output from algorithms that assign peptide sequence to MS/MS spectra, and annotation for proteins and pathways from various database sources. Moreover, proteomics technologies and experimental methods are not yet standardized; hence a high degree of flexibility is necessary for efficient support of high- and low-throughput data analytic tasks. Development of a desktop environment that is sufficiently robust for deployment in data analytic pipelines, and simultaneously supports customization for programmers and non-programmers alike, has proven to be a significant challenge. RESULTS. We describe multiplierz, a flexible and open-source desktop environment for comprehensive proteomics data analysis. We use this framework to expose a prototype version of our recently proposed common API (mzAPI) designed for direct access to proprietary mass spectrometry files. In addition to routine data analytic tasks, multiplierz supports generation of information rich, portable spreadsheet-based reports. Moreover, multiplierz is designed around a "zero infrastructure" philosophy, meaning that it can be deployed by end users with little or no system administration support. Finally, access to multiplierz functionality is provided via high-level Python scripts, resulting in a fully extensible data analytic environment for rapid development of custom algorithms and deployment of high-throughput data pipelines. CONCLUSION. Collectively, mzAPI and multiplierz facilitate a wide range of data analysis tasks, spanning technology development to biological annotation, for mass spectrometry-based proteomics research.Dana-Farber Cancer Institute; National Human Genome Research Institute (P50HG004233); National Science Foundation Integrative Graduate Education and Research Traineeship grant (DGE-0654108
Recommended from our members
Proteomic Analysis Reveals CACN-1 Is a Component of the Spliceosome in Caenorhabditis elegans
Cell migration is essential for embryonic development and tissue formation in all animals. cacn-1 is a conserved gene of unknown molecular function identified in a genome-wide screen for genes that regulate distal tip cell migration in the nematode worm Caenorhabditis elegans. In this study we take a proteomics approach to understand CACN-1 function. To isolate CACN-1−interacting proteins, we used an in vivo tandem-affinity purification strategy. Tandem-affinity purification−tagged CACN-1 complexes were isolated from C. elegans lysate, analyzed by mass spectrometry, and characterized bioinformatically. Results suggest significant interaction of CACN-1 with the C. elegans spliceosome. All of the identified interactors were screened for distal tip cell migration phenotypes using RNAi. Depletion of many of these factors led to distal tip cell migration defects, particularly a failure to stop migrating, a phenotype commonly seen in cacn-1 deficient animals. The results of this screen identify eight novel regulators of cell migration and suggest CACN-1 may participate in a protein network dedicated to high-fidelity gonad development. The composition of proteins comprising the CACN-1 network suggests that this critical developmental module may exert its influence through alternative splicing or other post-transcriptional gene regulation
Recommended from our members
Genome-scale Proteome Quantification by DEEP SEQ Mass Spectrometry
Advances in chemistry and massively parallel detection underlie DNA sequencing platforms that are poised for application in personalized medicine. In stark contrast, systematic generation of protein-level data lags well-behind genomics in virtually every aspect: depth of coverage, throughput, ease of sample preparation, and experimental time. Here, to bridge this gap, we develop an approach based on simple detergent lysis and single-enzyme digest, extreme, orthogonal separation of peptides, and true nanoflow LC-MS/MS that provides high peak capacity and ionization efficiency. This automated, deep efficient peptide sequencing and quantification (DEEP SEQ) mass spectrometry platform provides genome-scale proteome coverage equivalent to RNA-seq ribosomal profiling and accurate quantification for multiplexed isotope labels. In a model of the embryonic to epiblast transition in murine stem cells, we unambiguously quantify 11,352 gene products that span 70% of Swiss-Prot and capture protein regulation across the full detectable range of high-throughput gene expression and protein translation
Recommended from our members
An RS Motif within the Epstein-Barr Virus BLRF2 Tegument Protein Is Phosphorylated by SRPK2 and Is Important for Viral Replication
Epstein-Barr virus (EBV) is a gammaherpesvirus that causes infectious mononucleosis, B cell lymphomas, and nasopharyngeal carcinoma. Many of the genes required for EBV virion morphogenesis are found in all herpesviruses, but some are specific to gammaherpesviruses. One of these gamma-specific genes, BLRF2, encodes a tegument protein that has been shown to be essential for replication in other gammaherpesviruses. In this study, we identify BLRF2 interacting proteins using binary and co-complex protein assays. Serine/Arginine-rich Protein Kinase 2 (SRPK2) was identified by both assays and was further shown to phosphorylate an RS motif in the BLRF2 C-terminus. Mutation of this RS motif (S148A+S150A) abrogated the ability of BLRF2 to support replication of a murine gammaherpesvirus 68 genome lacking the BLRF2 homolog (ORF52). We conclude that the BLRF2 RS motif is phosphorylated by SRPK2 and is important for viral replication
Recommended from our members
Structure of a pseudokinase domain switch that controls oncogenic activation of Jak kinases
The V617F mutation in the Jak2 pseudokinase domain causes myeloproliferative neoplasms, and the equivalent mutation in Jak1 (V658F) is found in T-cell leukemias. Crystal structures of wild type and V658F mutant human Jak1 pseudokinase reveal a conformational switch that remodels a linker segment encoded by exon 12, which is also a site of mutations in Jak2. This switch is required for V617F-mediated Jak2 activation, and possibly for physiologic Jak activation
Identification of FAM111A as an SV40 Host Range Restriction and Adenovirus Helper Factor
The small genome of polyomaviruses encodes a limited number of proteins that are highly dependent on interactions with host cell proteins for efficient viral replication. The SV40 large T antigen (LT) contains several discrete functional domains including the LXCXE or RB-binding motif, the DNA binding and helicase domains that contribute to the viral life cycle. In addition, the LT C-terminal region contains the host range and adenovirus helper functions required for lytic infection in certain restrictive cell types. To understand how LT affects the host cell to facilitate viral replication, we expressed full-length or functional domains of LT in cells, identified interacting host proteins and carried out expression profiling. LT perturbed the expression of p53 target genes and subsets of cell-cycle dependent genes regulated by the DREAM and the B-Myb-MuvB complexes. Affinity purification of LT followed by mass spectrometry revealed a specific interaction between the LT C-terminal region and FAM111A, a previously uncharacterized protein. Depletion of FAM111A recapitulated the effects of heterologous expression of the LT C-terminal region, including increased viral gene expression and lytic infection of SV40 host range mutants and adenovirus replication in restrictive cells. FAM111A functions as a host range restriction factor that is specifically targeted by SV40 LT
Covalent targeting of remote cysteine residues to develop CDK12 and CDK13 inhibitors
Cyclin-dependent kinases 12 and 13 (CDK12 and CDK13) play critical roles in the regulation of gene transcription. However, the absence of CDK12 and CDK13 inhibitors has hindered the ability to investigate the consequences of their inhibition in healthy cells and cancer cells. Here we describe the rational design of a first-in-class CDK12 and CDK13 covalent inhibitor, THZ531. Co-crystallization of THZ531 with CDK12–cyclin K indicates that THZ531 irreversibly targets a cysteine located outside the kinase domain. THZ531 causes a loss of gene expression with concurrent loss of elongating and hyperphosphorylated RNA polymerase II. In particular, THZ531 substantially decreases the expression of DNA damage response genes and key super-enhancer-associated transcription factor genes. Coincident with transcriptional perturbation, THZ531 dramatically induced apoptotic cell death. Small molecules capable of specifically targeting CDK12 and CDK13 may thus help identify cancer subtypes that are particularly dependent on their kinase activities.United States. National Institutes of Health (HG002668)United States. National Institutes of Health (CA109901
- …