79 research outputs found

    CLUSS: Clustering of protein sequences based on a new similarity measure

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The rapid burgeoning of available protein data makes the use of clustering within families of proteins increasingly important. The challenge is to identify subfamilies of evolutionarily related sequences. This identification reveals phylogenetic relationships, which provide prior knowledge to help researchers understand biological phenomena. A good evolutionary model is essential to achieve a clustering that reflects the biological reality, and an accurate estimate of protein sequence similarity is crucial to the building of such a model. Most existing algorithms estimate this similarity using techniques that are not necessarily biologically plausible, especially for hard-to-align sequences such as proteins with different domain structures, which cause many difficulties for the alignment-dependent algorithms. In this paper, we propose a novel similarity measure based on matching amino acid subsequences. This measure, named SMS for Substitution Matching Similarity, is especially designed for application to non-aligned protein sequences. It allows us to develop a new alignment-free algorithm, named CLUSS, for clustering protein families. To the best of our knowledge, this is the first alignment-free algorithm for clustering protein sequences. Unlike other clustering algorithms, CLUSS is effective on both alignable and non-alignable protein families. In the rest of the paper, we use the term "<it>phylogenetic</it>" in the sense of "<it>relatedness of biological functions</it>".</p> <p>Results</p> <p>To show the effectiveness of CLUSS, we performed an extensive clustering on COG database. To demonstrate its ability to deal with hard-to-align sequences, we tested it on the GH2 family. In addition, we carried out experimental comparisons of CLUSS with a variety of mainstream algorithms. These comparisons were made on hard-to-align and easy-to-align protein sequences. The results of these experiments show the superiority of CLUSS in yielding clusters of proteins with similar functional activity.</p> <p>Conclusion</p> <p>We have developed an effective method and tool for clustering protein sequences to meet the needs of biologists in terms of phylogenetic analysis and prediction of biological functions. Compared to existing clustering methods, CLUSS more accurately highlights the functional characteristics of the clustered families. It provides biologists with a new and plausible instrument for the analysis of protein sequences, especially those that cause problems for the alignment-dependent algorithms.</p

    Nuclear magnetic resonance spectroscopy for structural characterization of bioactive compounds

    Get PDF
    The structural assignment of a new natural product molecule is not only to establish the 3D structure of a compound, but potentially to provide the basis for research in a multitude of disciplines, ultimately generating new therapeutic agents and/or new understanding of disease biology. The development of modern spectroscopic techniques has transformed the structure assignment process, which previously was essentially based on chemical degradation or derivatization followed by partial or total synthesis. Notably, it was only in the specialization era of the spectroscopic structural assignment of natural products that the field of marine natural products chemistry took shape. Today the processes of marine and terrestrial natural product isolation and structural determination are frequently streamlined and expeditious due to the spectacular advances in chromatographic and spectroscopic technologies as well as chemical synthesis. The NMR spectroscopy is a powerful tool in structure elucidation because the properties it displays can be related to the molecular structure. The chemical environment of a particular nucleus is associated with the chemical shift (d, ppm), and the area of a resonance, usually presented as its relative integral, is related to the number of nuclei giving rise to the NMR signal. The interactions between individual nuclei, mediated by electrons in a chemical bond, determine the coupling constant (J, Hz). In this chapter we will present the techniques commonly used, basic concepts, and how they are useful for chemists in the structural elucidation of mainly bioactive marine natural products. Its complex planar structure is determined by 1H and 13C NMR analysis strongly supported by other 1D (DEPT) and 2D (COSY, TOCSY, HSQC/HMQC, HMBC) NMR techniques. The stereochemistry is generally based on NOE experiments (NOE difference, NOESY, and ROESY), 1H–1H and 1H–13C coupling constants, chiral derivatizing agents, and also in empirical procedures comparing the chemical shifts of unknown vicinal and proximal centers with libraries of configurationally known stereomodels. However, the most reliable option to assign all the 3D structure of a marine natural product still is their total synthesis. The use of NMR hyphenated with other chromatographic and spectroscopic techniques and microcoil probes and narrow diameter tube probes for the structural elucidation of bioactive marine natural products, mainly associated with the quantitative NMR determinations, will be also briefly described. The chapter will finish with a description of the structural characterization of several types of marine natural products using all the referred NMR techniques followed by a small reference to the misassignments that still are very common

    Further Support to the Uncoupling-to-Survive Theory: The Genetic Variation of Human UCP Genes Is Associated with Longevity

    Get PDF
    In humans Uncoupling Proteins (UCPs) are a group of five mitochondrial inner membrane transporters with variable tissue expression, which seem to function as regulators of energy homeostasis and antioxidants. In particular, these proteins uncouple respiration from ATP production, allowing stored energy to be released as heat. Data from experimental models have previously suggested that UCPs may play an important role on aging rate and lifespan. We analyzed the genetic variability of human UCPs in cohorts of subjects ranging between 64 and 105 years of age (for a total of 598 subjects), to determine whether specific UCP variability affects human longevity. Indeed, we found that the genetic variability of UCP2, UCP3 and UCP4 do affect the individual's chances of surviving up to a very old age. This confirms the importance of energy storage, energy use and modulation of ROS production in the aging process. In addition, given the different localization of these UCPs (UCP2 is expressed in various tissues including brain, hearth and adipose tissue, while UCP3 is expressed in muscles and Brown Adipose Tissue and UCP4 is expressed in neuronal cells), our results may suggest that the uncoupling process plays an important role in modulating aging especially in muscular and nervous tissues, which are indeed very responsive to metabolic alterations and are very important in estimating health status and survival in the elderly

    A haplotype map of allohexaploid wheat reveals distinct patterns of selection on homoeologous genomes

    Get PDF
    BACKGROUND: Bread wheat is an allopolyploid species with a large, highly repetitive genome. To investigate the impact of selection on variants distributed among homoeologous wheat genomes and to build a foundation for understanding genotype-phenotype relationships, we performed population-scale re-sequencing of a diverse panel of wheat lines. RESULTS: A sample of 62 diverse lines was re-sequenced using the whole exome capture and genotyping-by-sequencing approaches. We describe the allele frequency, functional significance, and chromosomal distribution of 1.57 million single nucleotide polymorphisms and 161,719 small indels. Our results suggest that duplicated homoeologous genes are under purifying selection. We find contrasting patterns of variation and inter-variant associations among wheat genomes; this, in addition to demographic factors, could be explained by differences in the effect of directional selection on duplicated homoeologs. Only a small fraction of the homoeologous regions harboring selected variants overlapped among the wheat genomes in any given wheat line. These selected regions are enriched for loci associated with agronomic traits detected in genome-wide association studies. CONCLUSIONS: Evidence suggests that directional selection in allopolyploids rarely acted on multiple parallel advantageous mutations across homoeologous regions, likely indicating that a fitness benefit could be obtained by a mutation at any one of the homoeologs. Additional advantageous variants in other homoelogs probably either contributed little benefit, or were unavailable in populations subjected to directional selection. We hypothesize that allopolyploidy may have increased the likelihood of beneficial allele recovery by broadening the set of possible selection targets

    Gender differences in the use of cardiovascular interventions in HIV-positive persons; the D:A:D Study

    Get PDF
    Peer reviewe

    Iron Behaving Badly: Inappropriate Iron Chelation as a Major Contributor to the Aetiology of Vascular and Other Progressive Inflammatory and Degenerative Diseases

    Get PDF
    The production of peroxide and superoxide is an inevitable consequence of aerobic metabolism, and while these particular "reactive oxygen species" (ROSs) can exhibit a number of biological effects, they are not of themselves excessively reactive and thus they are not especially damaging at physiological concentrations. However, their reactions with poorly liganded iron species can lead to the catalytic production of the very reactive and dangerous hydroxyl radical, which is exceptionally damaging, and a major cause of chronic inflammation. We review the considerable and wide-ranging evidence for the involvement of this combination of (su)peroxide and poorly liganded iron in a large number of physiological and indeed pathological processes and inflammatory disorders, especially those involving the progressive degradation of cellular and organismal performance. These diseases share a great many similarities and thus might be considered to have a common cause (i.e. iron-catalysed free radical and especially hydroxyl radical generation). The studies reviewed include those focused on a series of cardiovascular, metabolic and neurological diseases, where iron can be found at the sites of plaques and lesions, as well as studies showing the significance of iron to aging and longevity. The effective chelation of iron by natural or synthetic ligands is thus of major physiological (and potentially therapeutic) importance. As systems properties, we need to recognise that physiological observables have multiple molecular causes, and studying them in isolation leads to inconsistent patterns of apparent causality when it is the simultaneous combination of multiple factors that is responsible. This explains, for instance, the decidedly mixed effects of antioxidants that have been observed, etc...Comment: 159 pages, including 9 Figs and 2184 reference

    The Geomechanics of CO2 Storage in Deep Sedimentary Formations

    Get PDF
    This paper provides a review of the geomechanics and modeling of geomechanics associated with geologic carbon storage (GCS), focusing on storage in deep sedimentary formations, in particular saline aquifers. The paper first introduces the concept of storage in deep sedimentary formations, the geomechanical processes and issues related with such an operation, and the relevant geomechanical modeling tools. This is followed by a more detailed review of geomechanical aspects, including reservoir stress-strain and microseismicity, well integrity, caprock sealing performance, and the potential for fault reactivation and notable (felt) seismic events. Geomechanical observations at current GCS field deployments, mainly at the In Salah CO2 storage project in Algeria, are also integrated into the review. The In Salah project, with its injection into a relatively thin, low-permeability sandstone is an excellent analogue to the saline aquifers that might be used for large scale GCS in parts of Northwest Europe, the U.S. Midwest, and China. Some of the lessons learned at In Salah related to geomechanics are discussed, including how monitoring of geomechanical responses is used for detecting subsurface geomechanical changes and tracking fluid movements, and how such monitoring and geomechanical analyses have led to preventative changes in the injection parameters. Recently, the importance of geomechanics has become more widely recognized among GCS stakeholders, especially with respect to the potential for triggering notable (felt) seismic events and how such events could impact the long-term integrity of a CO{sub 2} repository (as well as how it could impact the public perception of GCS). As described in the paper, to date, no notable seismic event has been reported from any of the current CO{sub 2} storage projects, although some unfelt microseismic activities have been detected by geophones. However, potential future commercial GCS operations from large power plants will require injection at a much larger scale. For such largescale injections, a staged, learn-as-you-go approach is recommended, involving a gradual increase of injection rates combined with continuous monitoring of geomechanical changes, as well as siting beneath a multiple layered overburden for multiple flow barrier protection, should an unexpected deep fault reactivation occur

    Selection of boron reagents for Suzuki-Miyaura coupling

    Get PDF
    corecore