72 research outputs found

    In-Depth Analysis of Zero-Length Crosslinking for Structural Mass Spectrometry

    Get PDF
    The completion of the Human Genome Project revealed the sequence identity of essentially every human protein. However, in most cases, amino acid sequences alone convey little implication on the protein static structures, its dynamic conformational changes, and most importantly, its functions. To fully understand the behaviors and properties of macromolecular complexes, solving their 3D structures is necessary and highly critical. Under this rationale, structural genomics collaborations were initiated aiming to determine high-resolution structures of as many proteins and protein folds as possible, relying mostly on X-ray crystallography and NMR spectroscopy. Yet, very large, highly flexible or disordered, and dynamic protein complexes can exceed the capabilities of these high-resolution techniques. Although computational molecular modeling can be utilized, such structures are highly speculative and often inaccurate unless supported by actual experimental data. Structural mass spectrometry recently emerged as an alternative method which can provide medium-resolution spatial information capable of complementing computational approaches, and are applicable to heterogeneous samples with potentially no limit on complex sizes. In particular, chemical crosslinking coupled with mass spectrometry, has recently received considerable interest. Most recent progress focused on developing crosslinkers with special properties such as enrichment tags, isotopic labeling sites, or MS-cleavable bonds along with accompanying data analysis strategies and software packages. These crosslinkers insert their spacer arm between proximal amino acid residues, greatly reducing the stringency of the derived distance constraints. In contrast, zero-length crosslinkers are crosslinks which do not add any extra atoms to the product crosslinked peptides, therefore providing the tightest possible spatial constraints but rendering enrichment and isotopic labeling strategies inapplicable. As a result, zero-length crosslinking received limited attention and no software tools have previously been specifically developed for it. In this thesis project, we developed a multi-tiered mass spectrometry data acquisition and computational data analysis strategy along with a dedicated software tool to enhance identification of zero-length crosslinks in complex samples. Label-free comparison and targeted high-resolution mass spectrometry were utilized to filter out the vast majority of non-crosslinked peptides and increase confidence of crosslink identification, compensating for the lack of enrichment techniques and characteristic MS patterns employed by non-zero-length crosslinking methods. Each step from mass spectrometer acquisition parameters to MS/MS spectra evaluation functions was optimized based on zero-length crosslinking datasets of proteins with known crystal structures. Our pipeline was then applied to probe structures and conformational changes of mini-spectrin, a 90 kDa recombinant protein that closely mimics erythrocyte spectrin\u27s dynamic dimer-tetramer equilibrium. Compared to previous analyses performed in our laboratory, the current strategy more than doubled the number of identified crosslinks and significantly reduced analysis time per experiment from months to just several days. Distance constraints derived from mini-spectrin crosslinks were used as inputs in subsequent homology modeling, allowing development of experimentally-verified medium-resolution structures for wild-type mini-spectrin tetramer and both wild-type and hereditary elliptocytosis (HE) mutant mini-spectrin dimers. The structure models, in combination with independent biophysical experiments, illustrated how such distal HE-related mutations destabilized spectrin dimer-tetramer equilibrium by simultaneously lowering thermal stability of tetramer and giving rise to a more-compact, more-stable closed dimer conformation

    Lower Bound for Convex Hull Area and Universal Cover Problems

    Full text link
    In this paper, we provide a lower bound for an area of the convex hull of points and a rectangle in a plane. We then apply this estimate to establish a lower bound for a universal cover problem. We showed that a convex universal cover for a unit length curve has area at least 0.232239. In addition, we show that a convex universal cover for a unit closed curve has area at least 0.0879873.Comment: 12 pages, 9 figure

    Systematic Analysis of Pleiotropy in C. elegans Early Embryogenesis

    Get PDF
    Pleiotropy refers to the phenomenon in which a single gene controls several distinct, and seemingly unrelated, phenotypic effects. We use C. elegans early embryogenesis as a model to conduct systematic studies of pleiotropy. We analyze high-throughput RNA interference (RNAi) data from C. elegans and identify “phenotypic signatures”, which are sets of cellular defects indicative of certain biological functions. By matching phenotypic profiles to our identified signatures, we assign genes with complex phenotypic profiles to multiple functional classes. Overall, we observe that pleiotropy occurs extensively among genes involved in early embryogenesis, and a small proportion of these genes are highly pleiotropic. We hypothesize that genes involved in early embryogenesis are organized into partially overlapping functional modules, and that pleiotropic genes represent “connectors” between these modules. In support of this hypothesis, we find that highly pleiotropic genes tend to reside in central positions in protein-protein interaction networks, suggesting that pleiotropic genes act as connecting points between different protein complexes or pathways

    Systematic Analysis of Pleiotropy in C. elegans Early Embryogenesis

    Get PDF
    Pleiotropy refers to the phenomenon in which a single gene controls several distinct, and seemingly unrelated, phenotypic effects. We use C. elegans early embryogenesis as a model to conduct systematic studies of pleiotropy. We analyze high-throughput RNA interference (RNAi) data from C. elegans and identify “phenotypic signatures”, which are sets of cellular defects indicative of certain biological functions. By matching phenotypic profiles to our identified signatures, we assign genes with complex phenotypic profiles to multiple functional classes. Overall, we observe that pleiotropy occurs extensively among genes involved in early embryogenesis, and a small proportion of these genes are highly pleiotropic. We hypothesize that genes involved in early embryogenesis are organized into partially overlapping functional modules, and that pleiotropic genes represent “connectors” between these modules. In support of this hypothesis, we find that highly pleiotropic genes tend to reside in central positions in protein-protein interaction networks, suggesting that pleiotropic genes act as connecting points between different protein complexes or pathways

    Processed overexpression RNA-seq (salmon)

    No full text
    Processed RNA-seq data from a gene overexpression experiment (done with salmon).There are three control and three overexpression replicates.For students to perform differential expression analysis with sleuth.</p

    Yeast RNA-seq data and transcriptome for kallisto-sleuth demo session

    No full text
    Paired-end RNA-seq data (indicated by 1 and 2 tags) for Saccharomyces cerevisiae under aerobic and anaerobic growth conditions. Each conditions consist of two replicates, as designated by r1 and r2 tags.The reference transcriptome for Saccharomyces cerevisiae is also provided.For teaching students to run kallisto and sleuth pipeline.</p

    Sample mass spectrometry data for human bottom-up proteomics

    No full text
    Sample mass spectrometry data from a bottom-up proteomics analysis of healthy human proteins. Original data source: https://zenodo.org/record/4274987.Data were collected on Thermo's Q-Exactive Plus mass spectrometer after digestion by trypsin.A human proteome FASTA file from UniProt is also included.</p
    • …
    corecore