Search CORE

11 research outputs found

Non-Parametric Approximations for Anisotropy Estimation in Two-dimensional Differentiable Gaussian Random Fields

Author: A Chorti
A Papoulis
AM Schmidt
AM Yaglom
AT Weaver
BV Gnedenko
C Lantuéjoul
D Bihan Le
D Hristopulos
D Hristopulos
D Hristopulos
Dionissios T. Hristopulos
DL Zimmerman
E Pardo-Igúzquiza
E Pebesma
F Richard
GC Wick
H Kazianka
H Wackernagel
J Guilleminot
JP Bouchaud
K Okada
L Feng
L Isserlis
L Wang
M Ecker
M Ecker
M Lillah
M Sambridge
M Siotani
Manolis P. Petrakis
P Fisher
P Levy
SC Olhede
SI Ranganathan
SW Park
T Bobach
W Feller
X Jiang
Z Zhang
Publication venue
Publication date: 30/11/2016
Field of study

Spatially referenced data often have autocovariance functions with elliptical isolevel contours, a property known as geometric anisotropy. The anisotropy parameters include the tilt of the ellipse (orientation angle) with respect to a reference axis and the aspect ratio of the principal correlation lengths. Since these parameters are unknown a priori, sample estimates are needed to define suitable spatial models for the interpolation of incomplete data. The distribution of the anisotropy statistics is determined by a non-Gaussian sampling joint probability density. By means of analytical calculations, we derive an explicit expression for the joint probability density function of the anisotropy statistics for Gaussian, stationary and differentiable random fields. Based on this expression, we obtain an approximate joint density which we use to formulate a statistical test for isotropy. The approximate joint density is independent of the autocovariance function and provides conservative probability and confidence regions for the anisotropy parameters. We validate the theoretical analysis by means of simulations using synthetic data, and we illustrate the detection of anisotropy changes with a case study involving background radiation exposure data. The approximate joint density provides (i) a stand-alone approximate estimate of the anisotropy statistics distribution (ii) informed initial values for maximum likelihood estimation, and (iii) a useful prior for Bayesian anisotropy inference.Comment: 39 pages; 8 figure

arXiv.org e-Print Archive

Crossref

Institutional Repository of the Technical University of Crete

Automatic identification of relevant chemical compounds from patents

Author: Akhondi S.A. (Saber)
Bobach C. (Claudia)
Doornenbal M. (Marius)
Gregory M. (Michelle)
Ilchmann G. (Gabriele)
Irmer M. (Matthias)
Kors J.A. (Jan)
Maier M. (Michael)
Nau H. (Heike)
Rey H. (Hinnerk)
Schwörer M. (Markus)
Sheehan M. (Mark)
Toomey J. (John)
Publication venue: 'Oxford University Press (OUP)'
Publication date: 28/12/2018
Field of study

In commercial research and development projects, public disclosure of new chemical compounds often takes place in patents. Only a small proportion of these compounds are published in journals, usually a few years after the patent. Patent authorities make available the patents but do not provide systematic continuous chemical annotations. Content databases such as Elsevier’s Reaxys provide such services mostly based on manual excerptions, which are time-consuming and costly. Automatic text-mining approaches help overcome some of the limitations of the manual process. Different text-mining approaches exist to extract chemical entities from patents. The majority of them have been developed using sub-sections of patent documents and focus on mentions of compounds. Less attention has been given to relevancy of a compound in a patent. Relevancy of a compound to a patent is based on the patent’s context. A relevant compound plays a major role within a patent. Identification of relevant compounds reduces the size of the extracted data and improves the usefulness of patent resources (e.g. supports identifying the main compounds). Annotators of databases like Reaxys only annotate relevant compounds. In this study, we design an automated system that extracts chemical entities from patents and classifies their relevance. The goldstandard set contained 18 789 chemical entity annotations. Of these, 10% were relevant compounds, 88% were irrelevant and 2% were equivocal. Our compound recognition system was based on proprietary tools. The performance (F-score) of the system on compound recognition was 84% on the development set and 86% on the test set. The relevancy classification system had an F-score of 86% on the development set and 82% on the test set. Our system can extract chemical compounds from patents and classify their relevance with high performance. This enables the extension of the Reaxys database by means of automation

Erasmus University Digital Repository

Entwicklung der periprothetischen Knochendichte nach Implantation einer zementfreien Kurzschaftendoprothese im Zeitraum von 3 Jahren

Author: Ahmed GA
Augustin L
Bobach C
Ishaque BA
Jahnke A
Rickert M
Publication venue: German Medical Science GMS Publishing House; Düsseldorf
Publication date: 23/10/2017
Field of study

German Medical Science

ClassyFire: automated chemical classification with a comprehensive, computable taxonomy

Author: A Dalby
A Zhukova
AC Guo
AF Fliri
AJ Cain
B Smith
C Bobach
Christoph Steinbeck
Craig Knox
D Weininger
D Wishart
David S. Wishart
DM Lowe
DS Wishart
E Fahy
Eoin Fahy
Evan Bolton
FB Rogers
Gareth Owen
HJ Feldman
HP Singh
J Day-Richter
J Hastings
J Hastings
Janna Hastings
Leonid Chepelev
LL Chepelev
M Ashburner
M Gell-Mann
M Kanehisa
N Fridman Noy
P Ertl
P Moreno
R Caspi
R Hoehndorf
Roman Eisner
Russell Greiner
S Kim
SA Rahman
SC Goodacre
Shankar Subramanian
T Jewison
TR Gruber
V Law
V Malyuto
W Bremser
Yannick Djoumbou Feunang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

PubChem chemical structure standardization

Author: Volker D. Hähnke
Sunghwan Kim
Evan E. Bolton
FK Brown
M Hann
J Gasteiger
T Engel
A Varnek
M Vogt
J Brecher
D Weininger
D Weininger
A McNaught
SR Heller
S Ash
RW Homer
AA Gakh
AA Gakh
R Panico
HA Favre
GJ Leigh
A Dalby
WA Warr
S Urbaczek
SA Akhondi
EC Meng
JC Baber
M Hendlich
S Urbaczek
D Young
RA Sayle
AR Katritzky
E Ferrari
RM Balabin
J Elguero
T Scior
M Sitzmann
P Pospisil
F Oellien
NP Todorov
T Kalliokoski
SW Muchmore
HA Duarte
YH Jang
J Hastings
C Bobach
SV Trepalin
YC Martin
F Milletti
JR Greenwood
S Urbaczek
A Gobbi
WA Warr
PV Schleyer
D Lloyd
MK Cyranski
M Randic
A Stanger
E Hückel
E Hückel
A Kekulé
A Kekulé
WC Herndon
M Randic
BDJ Blazic
I Gutman
F Cai
Z Rashid
SK Kearsley
P Hansen
B Blessington
E Martin
D Fourches
RD Clark
KS Egorova
T Oprea
P Tiikkainen
S Kim
S Kim
YL Wang
J McEntyre
EE Bolton
EE Bolton
S Kim
WA Warr
M Fanton
FB Rogers
G Audi
HC Ehrlich
NM O’Boyle
AM Clark
J Brecher
M Razinger
M Perdih
T Cieplak
DJ Wild
G Schneider
RS Cahn
P Ertl
HL Morgan
J Figueras
WD Ihlenfeldt
WD Ihlenfeldt
S Kim
Publication venue: BMC
Publication date: 01/01/2011
Field of study

Abstract Background PubChem is a chemical information repository, consisting of three primary databases: Substance, Compound, and BioAssay. When individual data contributors submit chemical substance descriptions to Substance, the unique chemical structures are extracted and stored into Compound through an automated process called structure standardization. The present study describes the PubChem standardization approaches and analyzes them for their success rates, reasons that cause structures to be rejected, and modifications applied to structures during the standardization process. Furthermore, the PubChem standardization is compared to the structure normalization of the IUPAC International Chemical Identifier (InChI) software, as manifested by conversion of the InChI back into a chemical structure. Results The observed rejection rate for substances processed by PubChem standardization was 0.36%, which is predominantly attributed to structures with invalid atom valences that cannot be readily corrected without additional information from contributors. Of all structures that pass standardization, 44% are modified in the process, reducing the count of unique structures from 53,574,724 in substance to 45,808,881 in compound as identified by de-aromatized canonical isomeric SMILES. Even though the processing time is very low on average (only 0.4% of structures have individual standardization time above 0.1 s), total standardization time is completely dominated by edge cases: 90% of the time to standardize all structures in PubChem substance is spent on the 2.05% of structures with the highest individual standardization time. It is worth noting that 60% of the structures obtained from PubChem structure standardization are not identical to the chemical structure resulting from the InChI (primarily due to preferences for a different tautomeric form). Conclusions Standardization of chemical structures is complicated by the diversity of chemical information and their representations approaches. The PubChem standardization is an effective and efficient tool to account for molecular diversity and to eliminate invalid/incomplete structures. Further development will concentrate on improved tautomer consideration and an expanded stereocenter definition. Modifications are difficult to thoroughly validate, with slight changes often affecting many thousands of structures and various edge cases. The PubChem structure standardization service is accessible as a public resource (https://pubchem.ncbi.nlm.nih.gov/standardize), and via programmatic interfaces

Crossref

ucs.sulsellib.net

Directory of Open Access Journals

Many InChIs and quite some feat

Author: A Barth
A Dalby
A Drefahl
A Gakh
A Gaulton
A Gobbi
A Kazakov
A Kos
A McNaught
A Monge
A Simon
A Toropov
A Tropsha
A Williams
A Yerin
AA Toropov
AA Toropov
AA Toropov
AA Toropov
AE Day
AJ Carroll
AJ Carroll
AJ Lawson
AJ Pawson
AJ Williams
AJ Williams
AJ Williams
AJ Williams
AJ Williams
AJ Williams
AJ Williams
AL Teixeira
AM Richard
AM Richard
AM Wassermann
AP Toropova
AR Kinjo
AT Valko
AV Zakharov
B Chen
B Hardy
B Plainchont
B Zhou
B Zhou
BD McKay
C Bertinetto
C Bertinetto
C Bobach
C Hill
C Laurence
C Ludwig
C Southan
C Southan
C Southan
C Southan
C Steinbeck
C Steinbeck
C Steinbeck
C Zhang
D Goldmann
D Jessop
D Jessop
D Weininger
D Weininger
DR Burgess
DR Burgess
DS Wishart
DS Wishart
DS Wishart
DS Wishart
DS Wishart
DS Wishart
E Fahy
E Gregori-Puigjané
E Martin
E Willighagen
E Zass
E Zass
EE Bolton
EL Schymanski
EL Willighagen
EO Cannon
F Mu
G Grethe
G Ivan
G Ivan
G Iván
G Wohlgemuth
GDJ Davis
GR Magoon
H Haraldsdottir
H Jenkins
H Kalchhauser
H Kraut
H Redestig
HL Morgan
I Pletnev
I Schomburg
ID Brown
IS Yadav
IV Filippov
J Barthelmes
J Chambers
J Chambers
J Choi
J Downing
J Frey
J Frey
J Galgonek
J Gu
J Hastings
J Hastings
J Hummel
J Masciocchi
J Nielsen
J Park
J Peironcely
J Rhodes
J Thibault
J Townsend
JD Westbrook
JG Frey
JG Frey
JJ Langham
JL Sharman
JM Fostel
JN Currano
JR McDaniel
JW May
K Degtyarenko
K Degtyarenko
K Haug
K Henrick
K Hettne
K Nöh
K Tallapragada
K Tanaka
KB Arvidson
KM Hettne
KP Seiler
KR Taylor
L Ahmed
L Chepelev
L Chepelev
L Fabian
L Sumner
LG Nashev
M Annies
M Borkum
M Brown
M Fanton
M Hilbig
M Kuhn
M Kuhn
M Kuhn
M Kuhn
M Lang
M Nowotka
M Rojas-Chertó
M Samwald
M Sitzmann
M Zimmermann
M Zimmermann
MD Prasanna
MD Prasanna
MD Stobbe
MD Stobbe
ME Cass
MH Maeda
MJ Herrgard
MK Gilson
N Jeliazkova
N O’Boyle
N O’Boyle
NM O’Boyle
NM O’Boyle
NT Kochev
O Casher
O Casher
O Fiehn
O Spjuth
O Spjuth
O Spjuth
P Carbonell
P Matos de
P Murray-Rust
P Murray-Rust
P Murray-Rust
P Murray-Rust
P Murray-Rust
P Tiikkainen
PW Rose
R Dunkel
R Gledhill
R Huang
R Kiss
R Klinger
R Ordog
R Ramakrishnan
R Shirley
R Smith
RC Murphy
RD Benz
RD Finn
RJ Schenck
RJM Weber
RW Homer
S Ash
S Bachrach
S Chavan
S Heller
S Kuhn
S Moco
S Muresan
S Muresan
S Orchard
SA Akhondi
SG Spanton
SJ Coles
SJ Coles
SM Bachrach
SP Kelley
SR Heller
SR Johnson
SV Trepalin
T Altman
T Bernard
T Ginex
T Kind
T Liu
T Thalheim
T Thalheim
T Velden
T Will
TJ Bruno
TS Totton
U Rossler
U Schmidt
V Guilloux
V Law
V Ruusmann
V Wakelam
W Bremser
W Ihlenfeldt
W Phadungsukanan
W-D Ihlenfeldt
WA Warr
WA Warr
Wendy A. Warr
X Qu
Y Liu
Y Qiao
Y Sushko
YA Ba
YS Cho
Z Szabadka
Z Szabadka
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref