Search CORE

University of St. Andrews - Pure

St Andrews Research Repository

Chemistry in Bioinformatics

Author: Mitchell John B O
Murray-Rust Peter
Rzepa Henry S
Publication venue
Publication date: 19/05/2005
Field of study

A preprint of an invited submission to BioMedCentral Bioinformatics. This short manuscript is an overview or the current problems and opportunities in publishing chemical information. Full details of technology are given in the sibling manuscript http://www.dspace.cam.ac.uk/handle/1810/34579 The manuscript is the authors' preprint although it has been automatically transformed into this archived PDF by the submission system. The authors are not responsible for the formattingChemical information is now seen as critical for most areas of life sciences. But unlike Bioinformatics, where data is Openly available and freely re−usable, most chemical information is closed and cannot be re−distributed without permission. This has led to a failure to adopt modern informatics and software techniques and therefore paucity of chemistry in bioinformatics. New technology, however, offers the hope of making chemical data (compounds and properties) Free during the authoring process. We argue that the technology is already available; we require a collective agreement to enhance publication protocols

University of St. Andrews - Pure

Semantic physical science.

Author: Murray-Rust Peter
Rzepa Henry S
Publication venue: J Cheminform
Publication date: 03/08/2012
Field of study

The articles in this special issue arise from a workshop and symposium held in January 2012 (Semantic Physical Science'). We invited people who shared our vision for the potential of the web to support chemical and related subjects. Other than the initial invitations, we have not exercised any control over the content of the contributed articles.RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are

Crossref

Springer - Publisher Connector

St Andrews Research Repository

The vicinal difluoro motif : the synthesis and conformation of erythro- and threo-diastereoisomers of 1,2-difluorodiphenylethanes, 2,3-difluorosuccinic acids and their derivatives

Author: O'Hagan David
Rzepa Henry S.
Schuler Martin
Slawin Alexandra M. Z.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/12/2013
Field of study

Background: It is well established that vicinal fluorines (RCHF-CHFR) prefer to adopt a gauche rather than an anti conformation when placed along aliphatic chains. This has been particularly recognised for 1,2-difluoroethane and extends to 2,3-difluorobutane and longer alkyl chains. It follows in these latter cases that if erythro and threo vicinal difluorinated stereoisomers are compared, they will adopt different overall conformations if the fluorines prefer to be gauche in each case. This concept is explored in this paper with erythro- and threo- diastereoisomers of 2,3-difluorosuccinates. Results: A synthetic route to 2,3-difluorosuccinates has been developed through erythro- and threo- 1,2-difluoro-1,2-diphenylethane which involved the oxidation of the aryl rings to generate the corresponding 2,3- difluorosuccinic acids. Ester and amide derivatives of the erythro- and threo- 2,3-difluorosuccinic acids were then prepared. The solid and solution state conformation of these compounds was assessed by X-ray crystallography and NMR. Ab initio calculations were also carried out to model the conformation of erythro- and threo- 1,2-difluoro-1,2-diphenylethane as these differed from the 2,3-difluorosuccinates. Conclusion: In general the overall chain conformations of the 2,3-difluorosuccinates diastereoisomers were found to be influenced by the fluorine gauche effect. The study highlights the prospects of utilising the vicinal difluorine motif (RCHF-CHFR) as a tool for influencing the conformation of performance organic molecules and particularly tuning conformation by selecting specific diastereoisomers (erythro or threo).Publisher PDFPeer reviewe

Chemistry in bioinformatics.

Author: Mitchell John BO
Murray-Rust Peter
Rzepa Henry S
Publication venue: BMC Bioinformatics
Publication date: 19/05/2005
Field of study

Chemical information is now seen as critical for most areas of life sciences. But unlike Bioinformatics, where data is openly available and freely re-usable, most chemical information is closed and cannot be re-distributed without permission. This has led to a failure to adopt modern informatics and software techniques and therefore paucity of chemistry in bioinformatics. New technology, however, offers the hope of making chemical data (compounds and properties) free during the authoring process. We argue that the technology is already available; we require a collective agreement to enhance publication protocols.Rights : This article is licensed under the BioMed Central license at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution License'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are

Springer - Publisher Connector

University of St. Andrews - Pure

St Andrews Research Repository

A data-oriented approach to making new molecules as a student experiment: AI-enabling FAIR publication of NMR data for organic esters

Author: Kuhn Stefan
Rzepa Henry S.
Publication venue: 'Wiley'
Publication date: 02/08/2021
Field of study

open access articleThe lack of machine-readable data is a major obstacle in the application of NMR in artificial intelligence. As a way to overcome this, a procedure for capturing primary NMR Spectroscopic instrumental data annotated with rich metadata and publication in a FAIR data repository is described as part of an undergraduate student laboratory experiment in a chemistry department. This couples the techniques of chemical synthesis of a never before made organic ester with illustration of modern data management practices and serves to raise student awareness of how FAIR data might improve research quality and replicability. Searches of the registered metadata are shown which enable actionable Finding and Accessing of such data. The potential for Re-use of the data in AI-applications is discussed

Crossref

De Montfort University Open Research Archive

Recommended from our members

Extracting and re-using research data from chemistry e-theses: the SPECTRa-T project

Author: Downing Jim
Harvey Matt
Morgan Peter
Murray-Rust Peter
Rzepa Henry S
Stewart Diana
Tonge Alan
Townsend Joseph A
Publication venue: 11th International Symposium on Electronic Theses and Dissertations
Publication date: 01/06/2008
Field of study

Scientific e-theses are data-rich resources, but much of the information they contain is not readily accessible. For chemistry, the SPECTRa-T project has addressed this problem by developing data-mining techniques to extract experimental data, creating RDF (Resource Description Framework) triples for exposure to sophisticated Semantic Web searches. We used OSCAR3, an Open Source chemistry text-mining tool, to parse and extract data from theses in PDF, and from theses in Office Open XML document format. Theses in PDF suffered data corruption and a loss of formatting that prevented the identification of chemical objects. Theses in .docx yielded semantically rich SciXML that enabled the additional extraction of associated data. Chemical objects were placed in a data repository, and RDF triples deposited in a triplestore. Data-mining from chemistry e-theses is both desirable and feasible; but the use of PDF, the de facto format standard for deposit in most repositories, prevents the optimal extraction of data for semantic querying. In order to facilitate this, we recommend that universities also require deposition of chemistry e-theses in an XML document format. Further work is required to clarify the complex IPR issues and ensure that they do not become an unwarranted barrier to data extraction and re-use