226 research outputs found

    mspecLINE: bridging knowledge of human disease with the proteome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Public proteomics databases such as PeptideAtlas contain peptides and proteins identified in mass spectrometry experiments. However, these databases lack information about human disease for researchers studying disease-related proteins. We have developed mspecLINE, a tool that combines knowledge about human disease in MEDLINE with empirical data about the detectable human proteome in PeptideAtlas. mspecLINE associates diseases with proteins by calculating the semantic distance between annotated terms from a controlled biomedical vocabulary. We used an established semantic distance measure that is based on the co-occurrence of disease and protein terms in the MEDLINE bibliographic database.</p> <p>Results</p> <p>The mspecLINE web application allows researchers to explore relationships between human diseases and parts of the proteome that are detectable using a mass spectrometer. Given a disease, the tool will display proteins and peptides from PeptideAtlas that may be associated with the disease. It will also display relevant literature from MEDLINE. Furthermore, mspecLINE allows researchers to select proteotypic peptides for specific protein targets in a mass spectrometry assay.</p> <p>Conclusions</p> <p>Although mspecLINE applies an information retrieval technique to the MEDLINE database, it is distinct from previous MEDLINE query tools in that it combines the knowledge expressed in scientific literature with empirical proteomics data. The tool provides valuable information about candidate protein targets to researchers studying human disease and is freely available on a public web server.</p

    Gene–Environment Interactions at Nucleotide Resolution

    Get PDF
    Interactions among genes and the environment are a common source of phenotypic variation. To characterize the interplay between genetics and the environment at single nucleotide resolution, we quantified the genetic and environmental interactions of four quantitative trait nucleotides (QTN) that govern yeast sporulation efficiency. We first constructed a panel of strains that together carry all 32 possible combinations of the 4 QTN genotypes in 2 distinct genetic backgrounds. We then measured the sporulation efficiencies of these 32 strains across 8 controlled environments. This dataset shows that variation in sporulation efficiency is shaped largely by genetic and environmental interactions. We find clear examples of QTN:environment, QTN: background, and environment:background interactions. However, we find no QTN:QTN interactions that occur consistently across the entire dataset. Instead, interactions between QTN only occur under specific combinations of environment and genetic background. Thus, what might appear to be a QTN:QTN interaction in one background and environment becomes a more complex QTN:QTN:environment:background interaction when we consider the entire dataset as a whole. As a result, the phenotypic impact of a set of QTN alleles cannot be predicted from genotype alone. Our results instead demonstrate that the effects of QTN and their interactions are inextricably linked both to genetic background and to environmental variation

    Identification of Extracellular Segments by Mass Spectrometry Improves Topology Prediction of Transmembrane Proteins

    Get PDF
    Transmembrane proteins play crucial role in signaling, ion transport, nutrient uptake, as well as in maintaining the dynamic equilibrium between the internal and external environment of cells. Despite their important biological functions and abundance, less than 2% of all determined structures are transmembrane proteins. Given the persisting technical difficulties associated with high resolution structure determination of transmembrane proteins, additional methods, including computational and experimental techniques remain vital in promoting our understanding of their topologies, 3D structures, functions and interactions. Here we report a method for the high-throughput determination of extracellular segments of transmembrane proteins based on the identification of surface labeled and biotin captured peptide fragments by LC/MS/MS. We show that reliable identification of extracellular protein segments increases the accuracy and reliability of existing topology prediction algorithms. Using the experimental topology data as constraints, our improved prediction tool provides accurate and reliable topology models for hundreds of human transmembrane proteins

    Transcription Factors Mat2 and Znf2 Operate Cellular Circuits Orchestrating Opposite- and Same-Sex Mating in Cryptococcus neoformans

    Get PDF
    Cryptococcus neoformans is a human fungal pathogen that undergoes a dimorphic transition from a unicellular yeast to multicellular hyphae during opposite sex (mating) and unisexual reproduction (same-sex mating). Opposite- and same-sex mating are induced by similar environmental conditions and involve many shared components, including the conserved pheromone sensing Cpk1 MAPK signal transduction cascade that governs the dimorphic switch in C. neoformans. However, the homeodomain cell identity proteins Sxi1Ξ±/Sxi2a encoded by the mating type locus that are essential for completion of sexual reproduction following cell–cell fusion during opposite-sex mating are dispensable for same-sex mating. Therefore, identification of downstream targets of the Cpk1 MAPK pathway holds the key to understanding molecular mechanisms governing the two distinct developmental fates. Thus far, homology-based approaches failed to identify downstream transcription factors which may therefore be species-specific. Here, we applied insertional mutagenesis via Agrobacterium-mediated transformation and transcription analysis using whole genome microarrays to identify factors involved in C. neoformans differentiation. Two transcription factors, Mat2 and Znf2, were identified as key regulators of hyphal growth during same- and opposite-sex mating. Mat2 is an HMG domain factor, and Znf2 is a zinc finger protein; neither is encoded by the mating type locus. Genetic, phenotypic, and transcriptional analyses of Mat2 and Znf2 provide evidence that Mat2 is a downstream transcription factor of the Cpk1 MAPK pathway whereas Znf2 functions as a more terminal hyphal morphogenesis determinant. Although the components of the MAPK pathway including Mat2 are not required for virulence in animal models, Znf2, as a hyphal morphology determinant, is a negative regulator of virulence. Further characterization of these elements and their target circuits will reveal genes controlling biological processes central to fungal development and virulence

    An analysis-ready and quality controlled resource for pediatric brain white-matter research

    Get PDF
    We created a set of resources to enable research based on openly-available diffusion MRI (dMRI) data from the Healthy Brain Network (HBN) study. First, we curated the HBN dMRI data (N = 2747) into the Brain Imaging Data Structure and preprocessed it according to best-practices, including denoising and correcting for motion effects, susceptibility-related distortions, and eddy currents. Preprocessed, analysis-ready data was made openly available. Data quality plays a key role in the analysis of dMRI. To optimize QC and scale it to this large dataset, we trained a neural network through the combination of a small data subset scored by experts and a larger set scored by community scientists. The network performs QC highly concordant with that of experts on a held out set (ROC-AUC = 0.947). A further analysis of the neural network demonstrates that it relies on image features with relevance to QC. Altogether, this work both delivers resources to advance transdiagnostic research in brain connectivity and pediatric mental health, and establishes a novel paradigm for automated QC of large datasets
    • …
    corecore