Search CORE

638 research outputs found

Modeling reactivity to biological macromolecules with a deep multitask network

Author: Dang Na Le
Hughes Tyler B.
Miller Grover P.
Swamidass S. Joshua
Publication venue: Digital Commons@Becker
Publication date: 01/01/2016
Field of study

Most small-molecule drug candidates fail before entering the market, frequently because of unexpected toxicity. Often, toxicity is detected only late in drug development, because many types of toxicities, especially idiosyncratic adverse drug reactions (IADRs), are particularly hard to predict and detect. Moreover, drug-induced liver injury (DILI) is the most frequent reason drugs are withdrawn from the market and causes 50% of acute liver failure cases in the United States. A common mechanism often underlies many types of drug toxicities, including both DILI and IADRs. Drugs are bioactivated by drug-metabolizing enzymes into reactive metabolites, which then conjugate to sites in proteins or DNA to form adducts. DNA adducts are often mutagenic and may alter the reading and copying of genes and their regulatory elements, causing gene dysregulation and even triggering cancer. Similarly, protein adducts can disrupt their normal biological functions and induce harmful immune responses. Unfortunately, reactive metabolites are not reliably detected by experiments, and it is also expensive to test drug candidates for potential to form DNA or protein adducts during the early stages of drug development. In contrast, computational methods have the potential to quickly screen for covalent binding potential, thereby flagging problematic molecules and reducing the total number of necessary experiments. Here, we train a deep convolution neural networkthe XenoSite reactivity modelusing literature data to accurately predict both sites and probability of reactivity for molecules with glutathione, cyanide, protein, and DNA. On the site level, cross-validated predictions had area under the curve (AUC) performances of 89.8% for DNA and 94.4% for protein. Furthermore, the model separated molecules electrophilically reactive with DNA and protein from nonreactive molecules with cross-validated AUC performances of 78.7% and 79.8%, respectively. On both the site- and molecule-level, the model’s performances significantly outperformed reactivity indices derived from quantum simulations that are reported in the literature. Moreover, we developed and applied a selectivity score to assess preferential reactions with the macromolecules as opposed to the common screening traps. For the entire data set of 2803 molecules, this approach yielded totals of 257 (9.2%) and 227 (8.1%) molecules predicted to be reactive only with DNA and protein, respectively, and hence those that would be missed by standard reactivity screening experiments. Site of reactivity data is an underutilized resource that can be used to not only predict if molecules are reactive, but also show where they might be modified to reduce toxicity while retaining efficacy. The XenoSite reactivity model is available at http://swami.wustl.edu/xenosite/p/reactivity

Directory of Open Access Journals

Digital Commons@Becker

PubMed Central

FigShare

Simple data-driven context-sensitive lemmatization

Author: Chrupała Grzegorz
Publication venue
Publication date: 01/01/2006
Field of study

Lemmatization for languages with rich inflectional morphology is one of the basic, indispensable steps in a language processing pipeline. In this paper we present a simple data-driven context-sensitive approach to lemmatizating word forms in running text. We treat lemmatization as a classification task for Machine Learning, and automatically induce class labels. We achieve this by computing a Shortest Edit Script (SES) between reversed input and output strings. A SES describes the transformations that have to be applied to the input string (word form) in order to convert it to the output string (lemma). Our approach shows competitive performance on a range of typologically different languages

Repositorio Institucional de la Universidad de Alicante

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Irish Universities

DCU Online Research Access Service

Recommended from our members

A novel knowledge discovery based approach for supplier risk scoring with application in the HVAC industry

Author: Chuddher Bilal Akbar
Publication venue: Brunel University London
Publication date: 01/01/2015
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University LondonThis research has led to a novel methodology for assessment and quantification of supply risks in the supply chain. The research has built on advanced Knowledge Discovery techniques and has resulted to a software implementation to be able to do so. The methodology developed and presented here resembles the well-known consumer credit scoring methods as it leads to a similar metric, or score, for assessing a supplier’s reliability and risk of conducting business with that supplier. However, the focus is on a wide range of operational metrics rather than just financial, which credit scoring techniques typically focus on. The core of the methodology comprises the application of Knowledge Discovery techniques to extract the likelihood of possible risks from within a range of available datasets. In combination with cross-impact analysis, those datasets are examined for establish the inter-relationships and mutual connections among several factors that are likely contribute to risks associated with particular suppliers. This approach is called conjugation analysis. The resulting parameters become the inputs into a logistic regression which leads to a risk scoring model the outcome of the process is the standardized risk score which is analogous to the well-known consumer risk scoring model, better known as FICO score. The proposed methodology has been applied to an Air Conditioning manufacturing company. Two models have been developed. The first identifies the supply risks based on the data about purchase orders and selected risk factors. With this model the likelihoods of delivery failures, quality failures and cost failures are obtained. The second model built on the first one but also used the actual data about the performance of supplier to identify risks of conducting business with particular suppliers. Its target was to provide quantitative measures of an individual supplier’s risk level. The supplier risk scoring model is tested on the data acquired from the company for its performance analysis. The supplier risk scoring model achieved 86.2% accuracy, while the area under curve (AUC) was 0.863. The AUC curve is much higher than required model’s validity threshold value of 0.5. It represents developed model’s validity and reliability for future data. The numerical studies conducted with real-life datasets have demonstrated the effectiveness of the proposed methodology and system as well as its future potential for industrial adoption

Brunel University Research Archive

On new maximal supergravity and its BPS domain-walls

Author: Guarino Adolfo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 06/02/2014
Field of study

We revise the SU(3)-invariant sector of

\mathcal{N}=8

supergravity with dyonic SO(8) gaugings. By using the embedding tensor formalism, analytic expressions for the scalar potential, superpotential(s) and fermion mass terms are obtained as a function of the electromagnetic phase

\omega

and the scalars in the theory. Equipped with these results, we explore non-supersymmetric AdS critical points at

\omega \neq 0

for which perturbative stability could not be analysed before. The

\omega

-dependent superpotential is then used to derive first-order flow equations and obtain new BPS domain-wall solutions at

\omega \neq 0

. We numerically look at steepest-descent paths motivated by the (conjectured) RG flows.Comment: 40 pages (30 pages + appendices), 3 tables, 6 figures. v2: References added and discussion in section 4.2 clarified. v3: References added, published version. v4: Fixed typo

arXiv.org e-Print Archive

CiteSeerX

Springer - Publisher Connector

Bern Open Repository and Information System (BORIS)

Open-source resources and standards for Arabic word structure analysis: Fine grained morphological analysis of Arabic text corpora

Author: Sawalha Majdi Shaker Salem
Publication venue: University of Leeds
Publication date: 01/01/2011
Field of study

Morphological analyzers are preprocessors for text analysis. Many Text Analytics applications need them to perform their tasks. The aim of this thesis is to develop standards, tools and resources that widen the scope of Arabic word structure analysis - particularly morphological analysis, to process Arabic text corpora of different domains, formats and genres, of both vowelized and non-vowelized text. We want to morphologically tag our Arabic Corpus, but evaluation of existing morphological analyzers has highlighted shortcomings and shown that more research is required. Tag-assignment is significantly more complex for Arabic than for many languages. The morphological analyzer should add the appropriate linguistic information to each part or morpheme of the word (proclitic, prefix, stem, suffix and enclitic); in effect, instead of a tag for a word, we need a subtag for each part. Very fine-grained distinctions may cause problems for automatic morphosyntactic analysis – particularly probabilistic taggers which require training data, if some words can change grammatical tag depending on function and context; on the other hand, finegrained distinctions may actually help to disambiguate other words in the local context. The SALMA – Tagger is a fine grained morphological analyzer which is mainly depends on linguistic information extracted from traditional Arabic grammar books and prior knowledge broad-coverage lexical resources; the SALMA – ABCLexicon. More fine-grained tag sets may be more appropriate for some tasks. The SALMA –Tag Set is a theory standard for encoding, which captures long-established traditional fine-grained morphological features of Arabic, in a notation format intended to be compact yet transparent. The SALMA – Tagger has been used to lemmatize the 176-million words Arabic Internet Corpus. It has been proposed as a language-engineering toolkit for Arabic lexicography and for phonetically annotating the Qur’an by syllable and primary stress information, as well as, fine-grained morphological tagging

White Rose E-theses Online

OpenGrey Repository

Advances in structure elucidation of small molecules using mass spectrometry

Author: A Alexander
A Broersen
A Castro
A Cuadros-Inostroza
A Droit
A Fredenhagen
A Gordin
A Kameyama
A Kameyama
A Kerber
A Koulman
A Luedemann
A Makarov
A Makarov
A Makarov
A Mitch
A Nordstrom
A Pelander
A Ramos-Fernandez
A Schmidt
A Schreiber
A Serb
A Zhang
A-EF Nassar
AC Lee
AD Hegeman
AD Southam
AG Marshall
AG Marshall
AG Pereira-Medrano
AH Grange
AH Grange
AH Grange
AH Payne
AI Nepomuceno
AJ Alexander
AJ Richard
AJ Williams
AK Vrkic
AL Heaton
AL Piccinelli
AL Rockwood
AL Rockwood
AL Rockwood
AM Jennifer
AN Lane
AV Xianmei Cai
AW Hill
AWT Bristow
B Christensen
B Fan
B Portet
B Wen
BD Nourse
BL Ackermann
BL Milman
BO Keller
BP Koch
BS Mitrevski
BY Renard
C Birkemeyer
C Brunnée
C Hopley
C Pan
C Prakash
C Seger
C Tuniz
C Vafiadi
C Wittmann
C Zhou
CA Marchant
CA Mueller
CA Smith
CE Wujcik
CW Klampfl
D Eric
D Kuehl
D Ryan
D Schwudke
D Sorensen
D Strapoc
DB Robb
DB Robb
DD Stranz
DE Garcia
DF Hochstrasser
DJ Ashline
DJ Weston
DJ Weston
DK Williams Jr
DM Drexler
DM Good
DM Hawkins
DM Horn
DQ Liu
DQ Liu
DR Albaugh
DS Cornett
DS Wishart
DS Wishart
DS Wishart
DW Hill
DW Hill
E Allard
E Dudley
E Gelpí
E Gelpí
E Gorlach
E Hoffmann De
E Pittenauer
E Rijke de
E Rosenberg
E Skoczynska
E Ventola
E Werner
EA Kapp
EA Syrstad
EC Tatsis
ECM Chen
EL Schymanski
EL Schymanski
EL Schymanski
EM Thurman
EM Thurman
EM Thurman
EP Go
ER Wickremsinhe
EW Deutsch
EW Taylor
F Cuyckens
F Cuyckens
F Kuhn
F Matsuda
F Matsuda
F Milletti
F Pont
F Sacher
F Steiner
F Xu
FF Hsu
FF Hsu
FF Hsu
FW McLafferty
FW McLafferty
FW McLafferty
FW McLafferty
G Bouchoux
G Bringmann
G Chen
G Hopfgartner
G Miliauskas
G Schlotterbeck
G Yan
GB Ge
GE Hofmeister
GJ Berkel Van
GJ Dear
GL Gauthier
GL Glish
GS Frysinger
GS Gorman
H Budzikiewicz
H Chen
H Chen
H Choi
H Gallart-Ayala
H Hayen
H Hayen
H Hong
H Horai
H Kaspar
H Lu
H Neuweger
H Oberacher
H Oberacher
H Rodriguez
H Song
H Zhang
H Zhang
H Zhang
H Zhang
HA Clark
HF Sturt
HJ Cooper
HJ Sterling
HK Lim
HK Lim
I Ferrer
I Francois
I Marchi
I Molnár-Perl
IA Kaltashov
ID Wilson
IG Zenkevich
IM Lazar
J Dalluge
J Delaney
J Diana
J Downing
J Draper
J Han
J Hummel
J Hummel
J Meija
J Schiller
J Schmidt
J Segura
J Somuramasami
J Souady
J Zhang
J Zhang
J-L Faulon
JA Falkner
JA Falkner
JB Fenn
JC Bradley
JC Dickens
JC Fjeldsted
JC Hannis
JC Schwartz
JCL Erve
JD Williams
JE Biller
JE Elias
JEP Syka
JG Stroh
JH Futrell
JH Gross
JH Zhu
JH Zhu
JI Haleem
JK Baker
JK Wolken
JL Holmes
JL Little
JL Wolfender
JM Halket
JM Kirk
JM Phalp
JR Wickens
JS Brodbelt
JS Forrester
JS Sinninghe Damsté
JS Splitter
JSB Vlieger de
JT Watson
K Akiyama
K Biemann
K Dettmer
K Dreisewerd
K Guo
K Heberger
K Hobby
K Horvath
K Kandasamy
K Katerina
K Laniewski
K Levsen
K Levsen
K Miyamoto
K Qian
K Schug
K Varmuza
K Yang
KG Lloyd
KP Bateman
KR Jonscher
KW Cheng
KX Wan
L Calcagnile
L Dinan
L Feldberg
L Karsten
L Leclercq
L Li
L Li
L Mondello
L Ramaley
L Sleno
L Sleno
L Yang
L Zhang
LA McDonnell
LC Short
LM Fell
M Adahchour
M Badertscher
M Bedair
M Bogusz
M Brown
M Eggink
M Emmerling
M Fernandes-Whaley
M Gergov
M Gfrerer
M Gu
M Hamacher
M Heinonen
M Holcapek
M Ibanez
M Jalali-Heravi
M Karas
M Karelson
M Kellmann
M Kiffe
M Krauss
M Krummen
M Lehane
M Mann
M Okamoto
M Palit
M Pavlic
M Pulfer
M Scheurell
M Scholz
M Trunzer
M Wind
M Wind
M Yao
M Zhu
MA Eash
ME Elyashberg
ME Hansen
MG Zampolli
ML Bandu
MM Savitski
MM Siegel
MM Yao
MP Balogh
MP Balogh
MP Balogh
MP Washburn
MR Anari
MR Anari
MS Bereman
MS Molchanova
MT Olson
MT Rodgers
MT Sheldon
N Hertkorn
N Huang
N Jaitly
N Ohashi
N Reig
NB Cech
NE Manicke
O Corcoran
O David Sparkman
O Fiehn
O Fiehn
O Pelkonen
OM Saad
OV Krokhin
P Ausloos
P Calza
P Dwivedi
P Fontana
P Giavalisco
P Kiousi
P Lampen
P Marriott
P McCormack
P Mendes
P Murray-Rust
P Schmitt-Kopplin
P Zhu
PA Sutton
PB Lukka
PC Carvalho
PE Adams
PE Sauer
PGA Pedrioli
Q Li
Q Li
Q Xiong
R Almeida
R Baigorri
R Harkewicz
R Hellborg
R Kaliszan
R Knochenmuss
R Kostiainen
R Li
R Mylonas
R Nakabayashi
R Ramanathan
R Samudrala
R Schiewek
R Wu
R Zenobi
RA Scheltema
RA Shellie
RA Zubarev
RA Zubarev
RB Cody
RD Loss
RF Staack
RG Cooks
RG Cooks
RG Dromey
RH Perry
RJ Beynon
RJ Mortishire-Smith
RJ Mortishire-Smith
RK Snider
RM Smith
RM Smith
RP Lattimer
RS Plumb
RS Plumb
RT Kelly
S Bocker
S Bocker
S Borth
S Bourcier
S Buckingham
S Christophoridou
S Dresen
S Dua
S Ekins
S Jarussophon
S Kim
S Kothari
S Ma
S Nojima
S Ojanpera
S Rogers
S Sang
S Su
S Trimpin
S Urayama
S Wolf
SA McLuckey
SC Bell
SC Habicht
SE Ong
SE Scheppele
SE Stein
SE Stein
SE Stein
SE Stein
SF Anabel
SG Roussis
SG Villas-Bôas
SJ Bos
SJ Gaskell
SJ Rochfort
SJ Valentine
SS Ebada
SS Rubakhin
SY Ow
T Alon
T Alon
T Beier
T Chen
T Kind
T Kind
T Kind
T Kind
T Kind
T Lynch
T Reemtsma
T Shinkawa
TA Lydic
TA Ternes
TA Ternes
TG Payne
TJ Kauppila
TM Kertesz
TM Kertesz
TM Schaub
TR Covey
TR Northen
TR Sana
TRI Cataldi
V Exarchou
V Kovácik
V Sanz-Nebot
V Vukics
V Zaikin
VA Petyuk
VI Babushok
VV Mihaleva
W Timm
W Windig
W Zhong
W Zou
WC Byrdwell
WC Byrdwell
WC Yang
WF Smyth
WMA Niessen
WTB Anthony
X Feng
X Han
X Liang
X-J Li
XY Zhu
Y Cai
Y Chen
Y Chen
Y Duan
Y Konishi
Y Lin
Y Liu
Y Park
Y Sawada
Y Shinbo
Y Wang
Y Wang
YA Jeilani
YK Wang
YR Luo
Z Tozuka
Z Yeping
ZP Yao
Publication venue: Springer Vienna
Publication date: 01/01/2010
Field of study

The structural elucidation of small molecules using mass spectrometry plays an important role in modern life sciences and bioanalytical approaches. This review covers different soft and hard ionization techniques and figures of merit for modern mass spectrometers, such as mass resolving power, mass accuracy, isotopic abundance accuracy, accurate mass multiple-stage MS(n) capability, as well as hybrid mass spectrometric and orthogonal chromatographic approaches. The latter part discusses mass spectral data handling strategies, which includes background and noise subtraction, adduct formation and detection, charge state determination, accurate mass measurements, elemental composition determinations, and complex data-dependent setups with ion maps and ion trees. The importance of mass spectral library search algorithms for tandem mass spectra and multiple-stage MS(n) mass spectra as well as mass spectral tree libraries that combine multiple-stage mass spectra are outlined. The successive chapter discusses mass spectral fragmentation pathways, biotransformation reactions and drug metabolism studies, the mass spectral simulation and generation of in silico mass spectra, expert systems for mass spectral interpretation, and the use of computational chemistry to explain gas-phase phenomena. A single chapter discusses data handling for hyphenated approaches including mass spectral deconvolution for clean mass spectra, cheminformatics approaches and structure retention relationships, and retention index predictions for gas and liquid chromatography. The last section reviews the current state of electronic data sharing of mass spectra and discusses the importance of software development for the advancement of structure elucidation of small molecules

CiteSeerX

Crossref

Springer - Publisher Connector

PubMed Central

Acoustic seafloor classification using the Weyl transform of multibeam echosounder backscatter mosaic

Author: Lazendic Srdan
Montereale Gavazzi Giacomo
Pizurica Aleksandra
Zhao Ting
Zhao Yuxin
Publication venue: 'MDPI AG'
Publication date: 01/01/2021
Field of study

The use of multibeam echosounder systems (MBES) for detailed seafloor mapping is increasing at a fast pace. Due to their design, enabling continuous high-density measurements and the coregistration of seafloor’s depth and reflectivity, MBES has become a fundamental instrument in the advancing field of acoustic seafloor classification (ASC). With these data becoming available, recent seafloor mapping research focuses on the interpretation of the hydroacoustic data and automated predictive modeling of seafloor composition. While a methodological consensus on which seafloor sediment classification algorithm and routine does not exist in the scientific community, it is expected that progress will occur through the refinement of each stage of the ASC pipeline: ranging from the data acquisition to the modeling phase. This research focuses on the stage of the feature extraction; the stage wherein the spatial variables used for the classification are, in this case, derived from the MBES backscatter data. This contribution explored the sediment classification potential of a textural feature based on the recently introduced Weyl transform of 300 kHz MBES backscatter imagery acquired over a nearshore study site in Belgian Waters. The goodness of the Weyl transform textural feature for seafloor sediment classification was assessed in terms of cluster separation of Folk’s sedimentological categories (4-class scheme). Class separation potential was quantified at multiple spatial scales by cluster silhouette coefficients. Weyl features derived from MBES backscatter data were found to exhibit superior thematic class separation compared to other well-established textural features, namely: (1) First-order Statistics, (2) Gray Level Co-occurrence Matrices (GLCM), (3) Wavelet Transform and (4) Local Binary Pattern (LBP). Finally, by employing a Random Forest (RF) categorical classifier, the value of the proposed textural feature for seafloor sediment mapping was confirmed in terms of global and by-class classification accuracies, highest for models based on the backscatter Weyl features. Further tests on different backscatter datasets and sediment classification schemes are required to further elucidate the use of the Weyl transform of MBES backscatter imagery in the context of seafloor mapping

Multidisciplinary Digital Publishing Institute

Ghent University Academic Bibliography