140 research outputs found
Scott : A method for representing graphs asrooted trees for graph canonization
International audienceGraphs increasingly stand out as an essential data structurein the field of data sciences. To study graphs, or sub-graphs, that char-acterize a set of observations, it is necessary to describe them formally,in order to characterize equivalence relations that make sense in thescope of the considered application domain. Hence we seek to define acanonical graph notation, so that two isomorphic (sub) graphs have thesame canonical form. Such notation could subsequently be used to indexand retrieve graphs or to embed them efficiently in some metric space.Sequential optimized algorithms solving this problem exist, but do notdeal with labeled edges, a situation that occurs in important applicationdomains such as chemistry. We present in this article a new algorithmbased on graph rewriting that provides a general and complete solution tothe graph canonization problem. Although not reported here, the formalproof of the validity of our algorithm has been established. This claim isclearly supported empirically by our experimentation on synthetic com-binatorics as well as natural graphs. Furthermore, our algorithm supportsdistributed implementations, leading to efficient computing perspectives
Visual Network Analysis of Dynamic Metabolic Pathways
Abstract. We extend our previous work on the exploration of static metabolic
networks to evolving, and therefore dynamic, pathways. We apply our visualization software to data from a simulation of early metabolism. Thereby, we show
that our technique allows us to test and argue for or against different scenarios for
the evolution of metabolic pathways. This supports a profound and efficient analysis of the structure and properties of the generated metabolic networks and its
underlying components, while giving the user a vivid impression of the dynamics
of the system. The analysis process is inspired by Ben Shneiderman’s mantra of
information visualization. For the overview, user-defined diagrams give insight
into topological changes of the graph as well as changes in the attribute set associated with the participating enzymes, substances and reactions. This way, “interesting features” in time as well as in space can be recognized. A linked view
implementation enables the navigation into more detailed layers of perspective
for in-depth analysis of individual network configuration
Building ProteomeTools based on a complete synthetic human proteome.
We describe ProteomeTools, a project building molecular and digital tools from the human proteome to facilitate biomedical research. Here we report the generation and multimodal liquid chromatography-tandem mass spectrometry analysis of \u3e330,000 synthetic tryptic peptides representing essentially all canonical human gene products, and we exemplify the utility of these data in several applications. The resource (available at http://www.proteometools.org) will be extended to \u3e1 million peptides, and all data will be shared with the community via ProteomicsDB and ProteomeXchange
Multifaceted SlyD from Helicobacter pylori: implication in [NiFe] hydrogenase maturation
SlyD belongs to the FK506-binding protein (FKBP) family with both peptidylprolyl isomerase (PPIase) and chaperone activities, and is considered to be a ubiquitous cytosolic protein-folding facilitator in bacteria. It possesses a histidine- and cysteine-rich C-terminus binding to selected divalent metal ions (e.g., Ni2+, Zn2+), which is important for its involvement in the maturation processes of metalloenzymes. We have determined the solution structure of C-terminus-truncated SlyD from Helicobacter pylori (HpSlyDΔC). HpSlyDΔC folds into two well-separated, orientation-independent domains: the PPIase-active FKBP domain and the chaperone-active insert-in-flap (IF) domain. The FKBP domain consists of a four-stranded antiparallel β-sheet with an α-helix on one side, whereas the IF domain folds into a four-stranded antiparallel β-sheet accompanied by a short α-helix. Intact H. pylori SlyD binds both Ni2+ and Zn2+, with dissociation constants of 2.74 and 3.79 μM respectively. Intriguingly, binding of Ni2+ instead of Zn2+ induces protein conformational changes around the active sites of the FKBP domain, implicating a regulatory role of nickel. The twin-arginine translocation (Tat) signal peptide from the small subunit of [NiFe] hydrogenase (HydA) binds the protein at the IF domain. Nickel binding and the recognition of the Tat signal peptide by the protein suggest that SlyD participates in [NiFe] hydrogenase maturation processes
A Kernel for Open Source Drug Discovery in Tropical Diseases
Open source drug discovery, a promising alternative avenue to conventional patent-based drug development, has so far remained elusive with few exceptions. A major stumbling block has been the absence of a critical mass of preexisting work that volunteers can improve through a series of granular contributions. This paper introduces the results from a newly assembled computational pipeline for identifying protein targets for drug discovery in ten organisms that cause tropical diseases. We have also experimentally tested two promising targets for their binding to commercially available drugs, validating one and invalidating the other. The resulting kernel provides a base of drug targets and lead candidates around which an open source community can nucleate. We invite readers to donate their judgment and in silico and in vitro experiments to develop these targets to the point where drug optimization can begin
Molecular dynamics simulations and in silico peptide ligand screening of the Elk-1 ETS domain
Background: The Elk-1 transcription factor is a member of a group of proteins called ternary complex factors, which serve as a paradigm for gene regulation in response to extracellular signals. Its deregulation has been linked
to multiple human diseases including the development of tumours. The work herein aims to inform the design of
potential peptidomimetic compounds that can inhibit the formation of the Elk-1 dimer, which is key to Elk-1
stability. We have conducted molecular dynamics simulations of the Elk-1 ETS domain followed by virtual screening.
Results: We show the ETS dimerisation site undergoes conformational reorganisation at the a1b1 loop. Through
exhaustive screening of di- and tri-peptide libraries against a collection of ETS domain conformations representing the dynamics of the loop, we identified a series of potential binders for the Elk-1 dimer interface. The di-peptides showed no particular preference toward the binding site; however, the tri-peptides made specific interactions with residues: Glu17, Gln18 and Arg49 that are pivotal to the dimer interface.
Conclusions: We have shown molecular dynamics simulations can be combined with virtual peptide screening to obtain an exhaustive docking protocol that incorporates dynamic fluctuations in a receptor. Based on our findings, we suggest experimental binding studies to be performed on the 12 SILE ranked tri-peptides as possible compounds for the design of inhibitors of Elk-1 dimerisation. It would also be reasonable to consider the score ranked tri-peptides as a comparative test to establish whether peptide size is a determinant factor of binding to the ETS domain
The Carbohydrate-Binding Site in Galectin-3 Is Preorganized To Recognize a Sugarlike Framework of Oxygens: Ultra-High-Resolution Structures and Water Dynamics
The recognition of carbohydrates by proteins is a fundamental aspect of communication within and between living cells. Understanding the molecular basis of carbohydrate-protein interactions is a prerequisite for the rational design of synthetic ligands. Here we report the high- to ultrahigh-resolution crystal structures of the carbohydrate recognition domain of galectin-3 (Gal3C) in the ligand-free state (1.08 angstrom at 100 K, 1.25 angstrom at 298 K) and in complex with lactose (0.86 angstrom) or glycerol (0.9 angstrom). These structures reveal striking similarities in the positions of water and carbohydrate oxygen atoms in all three states, indicating that the binding site of Gal3C is preorganized to coordinate oxygen atoms in an arrangement that is nearly optimal for the recognition of beta-galactosides. Deuterium nuclear magnetic resonance (NMR) relaxation dispersion experiments and molecular dynamics simulations demonstrate that all water molecules in the lactose-binding site exchange with bulk water on a time scale of nanoseconds or shorter. Nevertheless, molecular dynamics simulations identify transient water binding at sites that agree well with those observed by crystallography, indicating that the energy landscape of the binding site is maintained in solution. All heavy atoms of glycerol are positioned like the corresponding atoms of lactose in the Gal3C complexes. However, binding of glycerol to Gal3C is insignificant in solution at room temperature, as monitored by NMR spectroscopy or isothermal titration calorimetry under conditions where lactose binding is readily detected. These observations make a case for protein cryo-crystallography as a valuable screening method in fragment-based drug discovery and further suggest that identification of water sites might inform inhibitor design
Open Babel: An open chemical toolbox
Background: A frequent problem in computational modeling is the interconversion of chemical structures between different formats. While standard interchange formats exist (for example, Chemical Markup Language) and de facto standards have arisen (for example, SMILES format), the need to interconvert formats is a continuing problem due to the multitude of different application areas for chemistry data, differences in the data stored by different formats (0D versus 3D, for example), and competition between software along with a lack of vendorneutral formats. Results: We discuss, for the first time, Open Babel, an open-source chemical toolbox that speaks the many languages of chemical data. Open Babel version 2.3 interconverts over 110 formats. The need to represent such a wide variety of chemical and molecular data requires a library that implements a wide range of cheminformatics algorithms, from partial charge assignment and aromaticity detection, to bond order perception and canonicalization. We detail the implementation of Open Babel, describe key advances in the 2.3 release, and outline a variety of uses both in terms of software products and scientific research, including applications far beyond simple format interconversion. Conclusions: Open Babel presents a solution to the proliferation of multiple chemical file formats. In addition, it provides a variety of useful utilities from conformer searching and 2D depiction, to filtering, batch conversion, and substructure and similarity searching. For developers, it can be used as a programming library to handle chemical data in areas such as organic chemistry, drug design, materials science, and computational chemistry. It is freely available under an open-source license fro
- …