6,204 research outputs found
Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets
Motivation: The combination of liquid chromatography and mass spectrometry (LC/MS) has been widely used for large-scale comparative studies in systems biology, including proteomics, glycomics and metabolomics. In almost all experimental design, it is necessary to compare chromatograms across biological or technical replicates and across sample groups. Central to this is the peak alignment step, which is one of the most important but challenging preprocessing steps. Existing alignment tools do not take into account the structural dependencies between related peaks that co-elute and are derived from the same metabolite or peptide. We propose a direct matching peak alignment method for LC/MS data that incorporates related peaks information (within each LC/MS run) and investigate its effect on alignment performance (across runs). The groupings of related peaks necessary for our method can be obtained from any peak clustering method and are built into a pairwise peak similarity score function. The similarity score matrix produced is used by an approximation algorithm for the weighted matching problem to produce the actual alignment result.<p></p>
Results:
We demonstrate that related peak information can improve alignment performance. The performance is evaluated on a set of benchmark datasets, where our method performs competitively compared to other popular alignment tools.<p></p>
Availability: The proposed alignment method has been implemented
as a stand-alone application in Python, available for download at
http://github.com/joewandy/peak-grouping-alignment.<p></p>
Updates in metabolomics tools and resources: 2014-2015
Data processing and interpretation represent the most challenging and time-consuming steps in high-throughput metabolomic experiments, regardless of the analytical platforms (MS or NMR spectroscopy based) used for data acquisition. Improved machinery in metabolomics generates increasingly complex datasets that create the need for more and better processing and analysis software and in silico approaches to understand the resulting data. However, a comprehensive source of information describing the utility of the most recently developed and released metabolomics resources—in the form of tools, software, and databases—is currently lacking. Thus, here we provide an overview of freely-available, and open-source, tools, algorithms, and frameworks to make both upcoming and established metabolomics researchers aware of the recent developments in an attempt to advance and facilitate data processing workflows in their metabolomics research. The major topics include tools and researches for data processing, data annotation, and data visualization in MS and NMR-based metabolomics. Most in this review described tools are dedicated to untargeted metabolomics workflows; however, some more specialist tools are described as well. All tools and resources described including their analytical and computational platform dependencies are summarized in an overview Table
Non-targeted LC-MS based metabolomics analysis of the urinary steroidal profile
The urinary steroidal fraction has been extensively explored as non-invasive alternative to monitor pathological conditions as well as to unveil the illicit intake of pseudo-endogenous anabolic steroids in sport. However, the majority of previous approaches involved the a priori selection of potentially relevant target analytes. Here we describe the non-targeted analysis of the urinary steroidal profiles. The workflow includes minimal sample pretreatment and normalization according to the specific gravity of urine, a 20Â min reverse phase ultra-performance liquid chromatographic separation hyphenated to electrospray time-of-flight mass spectrometry. As initial validation, we analyzed a set of quality control urines spiked with glucurono- and sulfo-conjugated steroids at physiological ranges. We then applied the method for the analysis of samples collected after single transdermal administration of testosterone in hypogonadal men. The method allowed profiling of approximately three thousand metabolic features, including steroids of clinical and forensic relevance. It successfully identified metabolic pathways mostly responsible for groups clustering even in the context of high inter-individual variability and allowed the detection of currently unknown metabolic features correlating with testosterone administration. These outcomes set the stage for future studies aimed at implementing currently monitored urinary steroidal markers both in clinical and forensic analysis
Chemometric tools for automated method-development and data interpretation in liquid chromatography
The thesis explores the challenges and advancements in the field of liquid chromatography (LC), particularly focusing on complex sample analysis using high-resolution mass spectrometry (MS) and two-dimensional (2D) LC techniques. The research addresses the need for efficient optimization and data-handling strategies in modern LC practice. The thesis is divided into several chapters, each addressing specific aspects of LC and polymer analysis. Chapter 2 provides an overview of the need for chemometric tools in LC practice, discussing methods for processing and analyzing data from 1D and 2D-LC systems and how chemometrics can be utilized for method development and optimization. Chapter 3 introduces a novel approach for interpreting the molecular-weight distribution and intrinsic viscosity of polymers, allowing quantitative analysis of polymer properties without prior knowledge of their interactions. This method correlates the curvature parameter of the Mark-Houwink plot with the polymer's structural and chemical properties. Chapters 4 and 5 focus on the analysis of cellulose ethers (CEs), essential in various industrial applications. A new method is presented for mapping the substitution degree and composition of CE samples, providing detailed compositional distributions. Another method involves a comprehensive 2D LC-MS/MS approach for analyzing hydroxypropyl methyl cellulose (HPMC) monomers, revealing subtle differences in composition between industrial HPMC samples. Chapter 6 introduces AutoLC, an algorithm for automated and interpretive development of 1D-LC separations. It uses retention modeling and Bayesian optimization to achieve optimal separation within a few iterations, significantly improving the efficiency of gradient LC separations. Chapter 7 focuses on the development of an open-source algorithm for automated method development in 2D-LC-MS systems. This algorithm improves separation performance by refining gradient profiles and accurately predicting peak widths, enhancing the reliability of complex gradient LC separations. Chapter 8 addresses the challenge of gradient deformation in LC instruments. An algorithm based on the stable function corrects instrument-specific gradient deformations, enabling accurate determination of analyte retention parameters and improving data comparability between different sources. Chapter 9 introduces a novel approach using capacitively-coupled-contactless-conductivity detection (C4D) to measure gradient profiles without adding tracer components. This method enhances inter-system transferability of retention models for polymers, overcoming the limitations of UV-absorbance detectable tracer components. Chapter 10 discusses practical choices and challenges faced in the thesis chapters, highlighting the need for well-defined standard samples in industrial polymer analysis and emphasizing the importance of generalized problem-solving approaches. The thesis identifies future research directions, emphasizing the importance of computational-assisted methods for polymer analysis, the utilization of online reaction modulation techniques, and exploring continuous distributions obtained through size-exclusion chromatography (SEC) in conjunction with triple detection. Chemometric tools are recognized as essential for gaining deeper insights into polymer chemistry and improving data interpretation in the field of LC
Latent protein trees
Unbiased, label-free proteomics is becoming a powerful technique for
measuring protein expression in almost any biological sample. The output of
these measurements after preprocessing is a collection of features and their
associated intensities for each sample. Subsets of features within the data are
from the same peptide, subsets of peptides are from the same protein, and
subsets of proteins are in the same biological pathways, therefore, there is
the potential for very complex and informative correlational structure inherent
in these data. Recent attempts to utilize this data often focus on the
identification of single features that are associated with a particular
phenotype that is relevant to the experiment. However, to date, there have been
no published approaches that directly model what we know to be multiple
different levels of correlation structure. Here we present a hierarchical
Bayesian model which is specifically designed to model such correlation
structure in unbiased, label-free proteomics. This model utilizes partial
identification information from peptide sequencing and database lookup as well
as the observed correlation in the data to appropriately compress features into
latent proteins and to estimate their correlation structure. We demonstrate the
effectiveness of the model using artificial/benchmark data and in the context
of a series of proteomics measurements of blood plasma from a collection of
volunteers who were infected with two different strains of viral influenza.Comment: Published in at http://dx.doi.org/10.1214/13-AOAS639 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
A new family of diverse skin peptides from the microhylid frog genus phrynomantis
A wide range of frogs produce skin poisons composed of bioactive peptides for defence against pathogens, parasites and predators. While several frog families have been thoroughly screened for skin-secreted peptides, others, like the Microhylidae, have remained mostly unexplored. Previous studies of microhylids found no evidence of peptide secretion, suggesting that this defence adaptation was evolutionarily lost. We conducted transcriptome analyses of the skins of Phrynomantis bifasciatus and Phrynomantis microps, two African microhylid species long suspected to be poisonous. Our analyses reveal 17 evolutionary related transcripts that diversified from to those of cytolytic peptides found in other frog families. The 19 peptides predicted to be processed from these transcripts, named phrynomantins, show a striking structural diversity that is distinct from any previously identified frog skin peptide. Functional analyses of five phrynomantins confirm the loss of a cytolytic function and the absence of insecticidal or proinflammatory activity, suggesting that they represent an evolutionary transition to a new, yet unknown function. Our study shows that peptides have been retained in the defence poison of at least one microhylid lineage and encourages research on similarly understudied taxa to further elucidate the diversity and evolution of skin defence molecules
- …