Search CORE

21 research outputs found

Automated Identification and Classification of Stereochemistry: Chirality and Double Bond Stereoisomerism

Author: Falcao Andre O
Leal João P.
Teixeira Ana L.
Publication venue
Publication date: 27/02/2013
Field of study

Stereoisomers have the same molecular formula and the same atom connectivity and their existence can be related to the presence of different three-dimensional arrangements. Stereoisomerism is of great importance in many different fields since the molecular properties and biological effects of the stereoisomers are often significantly different. Most drugs for example, are often composed of a single stereoisomer of a compound, and while one of them may have therapeutic effects on the body, another may be toxic. A challenging task is the automatic detection of stereoisomers using line input specifications such as SMILES or InChI since it requires information about group theory (to distinguish stereoisomers using mathematical information about its symmetry), topology and geometry of the molecule. There are several software packages that include modules to handle stereochemistry, especially the ones to name a chemical structure and/or view, edit and generate chemical structure diagrams. However, there is a lack of software capable of automatically analyzing a molecule represented as a graph and generate a classification of the type of isomerism present in a given atom or bond. Considering the importance of stereoisomerism when comparing chemical structures, this report describes a computer program for analyzing and processing steric information contained in a chemical structure represented as a molecular graph and providing as output a binary classification of the isomer type based on the recommended conventions. Due to the complexity of the underlying issue, specification of stereochemical information is currently limited to explicit stereochemistry and to the two most common types of stereochemistry caused by asymmetry around carbon atoms: chiral atom and double bond. A Webtool to automatically identify and classify stereochemistry is available at http://nams.lasige.di.fc.ul.pt/tools.ph

arXiv.org e-Print Archive

Universidade de Lisboa: Repositório.UL

Rationale, study design, and analysis plan of the Alveolar Recruitment for ARDS Trial (ART): Study protocol for a randomized controlled trial

Author: Abreu Matheus O.
Acerbi Paulo S. C.
Albuquerque Regis B.
Aldrighi Jose R.
Alencastro Andre
Almeida Marilia S.
Almeida Patricia S.
Almeida Samara D.
Almeida Samara D.
Alves Janine D.
Amaral Karine A. E. H.
Amaro Andreson F.
Amato Marcelo B. P.
Amato Marcelo B. P.
Amorim Denise S.
Amorim Fabio F.
Andrade Ana H. V.
Andrade Isaac G.
Andrade Lucia C.
Andrade Luciana A. S.
Andrade Wandalvo
Apoena Pablo
Araujo Neto Jose A.
Araujo Arthur C.
Araujo Jose F.
Araujo Marcelo E. U.
Araujo Mario F. A.
Arduini Rodrigo G.
Assef Maria G. P. L.
Azevedo Luciano C. P.
Azevedo Luciano C. P.
Backes Fabiane
Bainy Marina P.
Baptista Filho Mario L. A.
Barbosa Pierry O.
Barra Williams F.
Barros Dalton
Baruzzi Claudio
Bastos Ana C. A. G.
Bastos Rafael S.
Batista Roseane A.
Becker Daniel A.
Bergo Ricardo R.
Berto Paula
Berwanger Otavio
Berwanger Otavio
Bigolin Rodrigo
Bitencourt Wesley S.
Bitencourt Wesley S.
Blattner Clarissa
Boldo Rodrigo
Boniatti Marcio M.
Borges Marcos C.
Bozi Giovana G.
Brandao Andre L. S. B.
Brilenger Caroline O.
Brilhante Yuzeth Nobrega
Broilo Fabiano P.
Burigo Ana C.
Cacau Lucas A. P.
Caixeta Carlos R.
Caldeira Milton
Caldeira Milton
Canavessi Hugo Schlebinger
Canzi Regina A.
Cappi Sylas B.
Carbonell Roberto C. C.
Cardoso Lucienne T. Q.
Carmona Cesar V.
Carneiro Mauricio
Carneiro Saul R.
Carvalho Alexandre G. R.
Carvalho Carlos R. R.
Carvalho Carlos R. R.
Carvalho Fabricio R. T.
Carvalho Fernanda L. G.
Carvalho Frederico B.
Carvalho Frederico B.
Carvalho Ivana L. V.
Carvalho Vitor O.
Carvalho Waneska L. N.
Caser Eliana
Castanelo Carlos
Cavalcante Eulalia
Cavalcanti Alexandre B.
Cavalcanti Alexandre B.
Cavalcanti Raphael Ali
Cerantola Rodrigo B.
Chagas Filho Aldir
Chamy Gauco
Chaves Filho Francisco G.
Chiasso Tatiana M.
Coelho Adalberto M.
Coelho Edward, Jr.
Coelho Milena P. P. M.
Conde Katia A. P.
Cordeiro Rodrigo B.
Correa Fabiano G.
Correa Tiago A.
Correa Viviane M.
Correia Emmanuel I. S.
Cortegiani Andrea
Costa Andre F.
Costa Maristela C.
Costa Ramon T.
Couto Wivian A. D.
Cramer Amanda S.
Cunha Adenard F. C.
Dadam Michelli M.
Damasceno Bruna
de Oliveira Filho Wilson
de Oliveira Filho Wilson
Decio Janaina C.
Demarzo Sergio E.
Dias Alysson P.
Dias Fernando S.
Dias Polyana P. L. C.
Diaz S. Edgard
Diaz-Quijano Fredi A.
Diaz-Quijano Fredi A.
Domingues Sergio M., Jr.
Dragosavak Desanka
Duarte Juliana N.
Duarte Pericles A. D.
Duarte Robson
Dutra Victor G.
Eberhart Neto Ervin
Falcao Antonio
Falcao Jansen G.
Farran Jorge
Felizardo Livia R. S. M.
Ferguson Niall
Ferraz Iris L.
Ferreira Neto Fleury
Ferreira Ana P.
Ferreira Cassia M.
Ferreira Claiton S.
Ferreira Denise M.
Ferreira Edgard V.
Ferreira Elaine M.
Ferreira Firmino H., Jr.
Ferreira Reinaldo
Festti Josiane
Fialkow Lea
Figueiredo Adelaide C.
Figueiredo Luciana C.
Filgueira Feto Jose E.
Flores Dimitri G.
Flores Dimitri G.
Foernges Rafael B.
Franca Gustavo G. P.
Francisco Renata S.
Franke Cristiano
Galassi Marcela S.
Gallego Raquel C. N.
Galvao Endi L.
Garcez Melissa C. M.
Garcia L. Sandra M. C.
Gatti Ciro
Geha Nadia N.
Germano Almir
Germiniani Bruno C.
Giacomassi Ivens W. S.
Giancursi Thiago S.
Giannini Fabio
Gil Fernando S. U.
Gimenez Francielli M. P.
Giuberti Adriana F. T.
Giuberti Jonas, Jr.
Gois Aecio
Gomes Bruno C.
Gomes Tania M.
Goncalves Neto Graciliano J. L.
Goncalves Fernanda A. F.
Goncalves Iran, Jr.
Gonzalez C. Octavio
Gorski Anthony G.
Grion Cintia M. C.
Grion Cintia M. C.
Guadalupe Erika G. L.
Guerra Andre
Guerreiro Marcio O.
Guimaraes Daniela M. Q. S.
Guimaraes Helio P.
Guimaraes Helio P.
Guimaraes Helio P.
Gurgel Sanderland J. T.
Guyatt Gordon
Hajjar Ludhmila A.
Heirel Debora C. B.
Henriques Lilian A.
Herek Andrea
Hinestrosa Alfredo
Hirota Adriana S.
Holanda Marcelo A.
Hopf Joao L. S.
Horner Marina B. W.
Inagaki Alexandre S.
Infantini Rodrigo M.
Isola Alexandre
Jesus Karinne R.
Kauss Ivanil A. M.
Kawaguchi Ines A. L.
Kazue Priscila
Kleber Wladimy
Kleber Wladmy
Kodama Alessandra A.
Kretzer Lara
Kuroda Cristina M.
Lago Roberto
Laprovita Maria P.
Larangeira Alexandre S.
Laube Gilberto, Jr.
Leao Rosangela M.
Leite Petronio A.
Lima Cyntia M. L. S.
Lima Fernando B.
Lima Maria H. B. S.
Lima Zildamara B.
Lira Jose A.
Lopes Renato D.
Lovato Wilson
Lucena Debora N. L.
Lunardi Maria C.
Luzzi Sergio
Maccari Juara G.
Macedo Pedro L.
Machado Fernando O.
Machado Flavia R.
Machado Luis A. M. W.
Machado Nelma J. N.
Magalhaes Nascimento Francine J.
Maia Israel S.
Maia Marcelo O.
Malbouison Luiz M.
Marcilino Antonielen
Margalho Silviano B.
Margarida Kathia
Marino Nathalia F.
Marques Leonardo S.
Marraccini Thiago
Martinez Amadeu
Martins Edna T. J.
Martins Eliauria R.
Martins Eliauria R.
Martins Gloria A.
Martins Marcele F.
Martins Marcio A.
Martins Marcio A.
Martins Mariana L.
Matsubara Rosely R.
Matsui Mirna
Mazza Bruno F.
Mecatti Giovana C.
Medeiros Luciana G.
Mendez Vanessa M. F.
Mendonca Angela
Menescal Brena
Merluzzi Thalita
Mezzaroba Ana L.
Miranda Whiniton
Miura Claudia
Monteiro Livia L.
Moraes Ana P. P.
Moraes Rafael B.
Morais Jussara E. P.
Morais Mirene O.
Morales Daniela
Moreira Cora L. C. B.
Moreira Fabiana B. R.
Moreira Monique A.
Moreno Marcelo S.
Morong Aline S.
Morsch Rafaela D.
Nassar Antonio P., Jr.
Naves Sergio A.
Nedel Wagner
Neto Jeronimo C. B.
Nienstedt Esteban C.
Nobrega Marciano S.
Nogueira Filho Wilson
Nogueira Eduardo E. F.
Nomoto Silmara H.
Nunes Andre L.
Odir Isaura
Oliveira Aline E.
Oliveira Andre L. V.
Oliveira Clezio S.
Oliveira Glauce L.
Oliveira Ivonaldo M.
Oliveira Katia R.
Oliveira Luiz R. C.
Oliveira Marcelo E.
Oliveira Roselaine P.
Oliveira Roselaine P.
Oliveira Roselaine P.
Oliveira Tatiana A.
Oliveira Vanessa M.
Oliver Wilson R.
Ornellas Izadora B.
Orsatti Vinicius N.
Park Marcelo
Passos Denise B. V. G.
Patrocinio Ana C. L.
Paula Ludmila N.
Pavia Caio L. P.
Pedro A.
Peixoto Daniela C.
Peliser Priscila
Pellegrini Jose A. S.
Pellegrini Jose A. S.
Pena Felipe M.
Pereira Antonio J.
Pereira Cesar A.
Pereira Sheila A.
Pincelli Mariangela
Pinheiro Filho Gilvan R.
Pinheiro Filho Gilvan R.
Pinto Walkyria A. M.
Piovesan Maysa Z. R.
Piras Claudio
Pizarro Camilo
Pompilio Carlos E.
Poquiriqui Rodolfo M. B.
Potratz Jorge L.
Prado Karen F.
Prado Luiz F. A.
Rabelo Livia A.
Rabelo Melina V.
Rahal Luciana
Raineri Santi M.
Ramos Joroastro E., Jr.
Rangel Vivian P. L.
Ray Alexandre
Rech Tatiana H.
Regenga Marisa M.
Regenga Marisa M.
Rego Leila R. M.
Reis Andrezza T. J. B.
Reis Diego L.
Reis Diego L.
Reis Helder
Rezende Claudnei M.
Rezende Ederlon
Rezende Ederlon A. C.
Rezende Valeria M. C.
Ribeiro Gisele F. M.
Ribeiro Rubens A. B.
Ribeiro Wagner
Rieder Marcelo M.
Rocha Edson P.
Rodrigues Ricardo G.
Romano Edson
Romano Edson
Romano Marcelo
Rossetti Santana Heloisa B.
Rosso Deorgelis
Sa Alexandre
Sala Andrea D.
Sales Joao A. L., Jr.
Salgado Diamantino R.
Salomao Maria C.
Sandri Priscila
Santana Hericalizandra S. R.
Santiago Roberta R. S.
Santos Gheisa D.
Santos Grazielle O.
Santos Jose R. P.
Santos Lucio S.
Santos Moreno C.
Santos Patricia L.
Santos Priscila J. C. D.
Santos Thiago M.
Sarat Saturnino Campo, Jr.
Savi Augusto
Scaglia Nris C.
Schaich Felipe
Schettino Guilherme P. P.
Schiavetto Paulo M.
Schievano Fabiana R.
Schulz Luis F.
Schwarz Patricia
Schwarz Patricia
Seeger Gabriela M.
Segundo Valerio J.
Severino Marta A.
Silva Albano S.
Silva Aline C. F.
Silva Alline O.
Silva Anselmo C.
Silva Dafne C. B.
Silva Fabiano D.
Silva Nelson B.
Silva Patricia N.
Silva Rosicley S.
Silva Rozangela R.
Silva Sabrina F.
Silva Sandra R. B.
Silva Silvangela G. A.
Smith Thiago C.
Sousa Neto Jefferson A.
Sousa Marcelo F.
Souza Ana R.
Souza Marcia L. V. D.
Souza Marcia M. F.
Starling Claudia M.
Suguitani Edmundo O.
Suzuki Vivian C.
Suzumura Erica A.
Suzumura Erica A.
Tagliari Luciana
Taino Bruno
Taira Elisabete E.
Takahashi Luzia
Takahashi Luzia N.
Takatani Rodrigo R.
Tallo Fernando S.
Tamazato Edys Y.
Taniguchi Leandro
Tarkieltaub Elcio
Tavares Daniele
Tavares Daniele C. C.
Tavares Marcel V.
Tavares Roberta C.
Tcherniakovisk Leo
Teixeira Cassiano
Teixeira Cassiano
Teixeira Cassiano
Teixeira Cristina
Teixeira Luciano O.
Telles Jose M. M.
Thompson Marlus M.
Toledo Diogo
Tomizuka Carlos I.
Torres Franciele C. C.
Toufen Carlos, Jr.
Tozo Tatiane C.
Trindade Renata S.
Tucci Mauro
Uratani Cristiana C. S.
Vanzuita Raquel
Vasconcelos Marcia O. M.
Vasconcelos Paula T.
Vassallo Paula F.
Vendrame Leticia S.
Verdeal Juan C.
Vianna Arthur
Vieira Debora F. V. B.
Vieira Silvia R. R.
Vieira Vitor M.
Vieira Vitor M.
Viera Filho Edesio
Walter Stephen
Wawrzeniak Iuri C.
Werneck Vinicius
Westphal Glauco
Westphal Glauco
Winveler Georgia F. P.
Wysocki Natacha
Xavier Patricia A.
Yamada Sergio S.
Zanta Camila C.
Zoghbi Karina K.
Publication venue: LONDON
Publication date: 28/08/2012
Field of study

Background: Acute respiratory distress syndrome (ARDS) is associated with high in-hospital mortality. Alveolar recruitment followed by ventilation at optimal titrated PEEP may reduce ventilator-induced lung injury and improve oxygenation in patients with ARDS, but the effects on mortality and other clinical outcomes remain unknown. This article reports the rationale, study design, and analysis plan of the Alveolar Recruitment for ARDS Trial (ART). Methods/Design: ART is a pragmatic, multicenter, randomized (concealed), controlled trial, which aims to determine if maximum stepwise alveolar recruitment associated with PEEP titration is able to increase 28-day survival in patients with ARDS compared to conventional treatment (ARDSNet strategy). We will enroll adult patients with ARDS of less than 72 h duration. The intervention group will receive an alveolar recruitment maneuver, with stepwise increases of PEEP achieving 45 cmH(2)O and peak pressure of 60 cmH2O, followed by ventilation with optimal PEEP titrated according to the static compliance of the respiratory system. In the control group, mechanical ventilation will follow a conventional protocol (ARDSNet). In both groups, we will use controlled volume mode with low tidal volumes (4 to 6 mL/kg of predicted body weight) and targeting plateau pressure <= 30 cmH2O. The primary outcome is 28-day survival, and the secondary outcomes are: length of ICU stay; length of hospital stay; pneumothorax requiring chest tube during first 7 days; barotrauma during first 7 days; mechanical ventilation-free days from days 1 to 28; ICU, in-hospital, and 6-month survival. ART is an event-guided trial planned to last until 520 events (deaths within 28 days) are observed. These events allow detection of a hazard ratio of 0.75, with 90% power and two-tailed type I error of 5%. All analysis will follow the intention-to-treat principle. Discussion: If the ART strategy with maximum recruitment and PEEP titration improves 28-day survival, this will represent a notable advance to the care of ARDS patients. Conversely, if the ART strategy is similar or inferior to the current evidence-based strategy (ARDSNet), this should also change current practice as many institutions routinely employ recruitment maneuvers and set PEEP levels according to some titration method.Hospital do Coracao (HCor) as part of the Program 'Hospitais de Excelencia a Servico do SUS (PROADI-SUS)'Brazilian Ministry of Healt

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositório Institucional UNIFESP

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Universidade de São Paulo

Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis

Author: Aguilar Daniel
Aittokallio Tero
Allaart Cornelia F.
Ammad-ud-din Muhammad
Anton Bernat
Azencott Chloe-Agathe
Balagurusamy Venkat S. K.
Barton Anne
Bellón Víctor
Boeva Valentina
Bonet Jaume
Bridges S. Louis
Bunte Kerstin
Calaza Manuel
Cheng Lu
Chheda Himanshu
Coenen Marieke
Corander Jukka
Criswell Lindsey
Criswell Lindsey
Cui Jing
de Vries Niek
Dillenberger Donna
Dumontier Michel
Eksi Ridvan
Elmarakeby Haitham
Falcao Andre O.
Fornés Oriol
Friend Stephen
Friend Stephen
García-García Javier
Gerlag Danielle
Goldenberg Anna
Gopalacharyulu Peddinti
Greenberg Jeff
Gregersen Peter K.
Gregersen Peter K.
Guan Yuanfang
Guan Yuanfang
Guney Emre
Hajiloo Mohsen
Heath Lenwood S.
Hidru Daniel
Hoff Bruce
Huizinga Tom W. J.
Jaiswal Alok
Kaski Samuel
Khalfaoui Beyrem
Khan Suleiman Ali
Klareskog Lars
Klareskog Lars
Kramer Eric R.
Kremer Joel
Kurreeman Fina
Li Hongdong
Long Quan
Louis Bridges Jr. S.
Mangravite Lara M.
Mangravite Lara M.
Mariette Xavier
Marttinen Pekka
Marín Manuel Alejandro
Mezlini Aziz M.
Miceli Corinne
Michaud Kaleb
Molparia Bhuvan
Moore Jonathan D.
Moreland Larry
Moreland Larry
Neto Elias Chaibub
Norman Thea
Oliva Baldo
Oliva Baldo
Opiyo Stephen Obol
Padyukov Leonid
Padyukov Leonid
Pandey Gaurav
Panwar Bharat
Pappas Dimitrios
Pirinen Matti
Planas-Iglesias Joan
Plenge Robert
Plenge Robert
Poglayen Daniel
Pratap Abhishek
Saarela Janna
Saevarsdottir Saedis
Saevarsdottir Saedis
Samwald Matthias
Savage Richard S.
Shadick Nancy
Sieberts Solveig K.
Stahl Eli
Stolovitzky Gustavo
Stolovitzky Gustavo
Stoven Véronique
Suver Christine
Tak Paul P.
Tang Hao
Tang Jing
Torkamani Ali
Vert Jean-Phillipe
Wang Bo
Wang Tao
Weinblatt Michael
Wennerberg Krister
Wineinger Nathan E.
Xiao Guanghua
Xie Yang
Yeung Rae
Zhan Xiaowei
Zhao Cheng
Zhu Fan
Zhu Jun
Publication venue
Publication date: 01/01/2016
Field of study

Correction: vol 7, 13205, 2016, doi:10.1038/ncomms13205Rheumatoid arthritis (RA) affects millions world-wide. While anti-TNF treatment is widely used to reduce disease progression, treatment fails in Bone-third of patients. No biomarker currently exists that identifies non-responders before treatment. A rigorous community-based assessment of the utility of SNP data for predicting anti-TNF treatment efficacy in RA patients was performed in the context of a DREAM Challenge (http://www.synapse.org/RA_Challenge). An open challenge framework enabled the comparative evaluation of predictions developed by 73 research groups using the most comprehensive available data and covering a wide range of state-of-the-art modelling methodologies. Despite a significant genetic heritability estimate of treatment non-response trait (h(2) = 0.18, P value = 0.02), no significant genetic contribution to prediction accuracy is observed. Results formally confirm the expectations of the rheumatology community that SNP information does not significantly improve predictive performance relative to standard clinical traits, thereby justifying a refocusing of future efforts on collection of other data.Peer reviewe

Analysis and Comparison of Vector Space and Metric Space Representations in QSAR Modeling

Author: Andre O. Falcao
Samina Kausar
Publication venue: 'MDPI AG'
Publication date: 01/04/2019
Field of study

The performance of quantitative structure−activity relationship (QSAR) models largely depends on the relevance of the selected molecular representation used as input data matrices. This work presents a thorough comparative analysis of two main categories of molecular representations (vector space and metric space) for fitting robust machine learning models in QSAR problems. For the assessment of these methods, seven different molecular representations that included RDKit descriptors, five different fingerprints types (MACCS, PubChem, FP2-based, Atom Pair, and ECFP4), and a graph matching approach (non-contiguous atom matching structure similarity; NAMS) in both vector space and metric space, were subjected to state-of-art machine learning methods that included different dimensionality reduction methods (feature selection and linear dimensionality reduction). Five distinct QSAR data sets were used for direct assessment and analysis. Results show that, in general, metric-space and vector-space representations are able to produce equivalent models, but there are significant differences between individual approaches. The NAMS-based similarity approach consistently outperformed most fingerprint representations in model quality, closely followed by Atom Pair fingerprints. To further verify these findings, the metric space-based models were fitted to the same data sets with the closest neighbors removed. These latter results further strengthened the above conclusions. The metric space graph-based approach appeared significantly superior to the other representations, albeit at a significant computational cost

Directory of Open Access Journals

Noncontiguous Atom Matching Structural Similarity Function

Author: Ana L. Teixeira (443973)
Andre O. Falcao (1773145)
Publication venue
Publication date
Field of study

Measuring similarity between molecules is a fundamental problem in cheminformatics. Given that similar molecules tend to have similar physical, chemical, and biological properties, the notion of molecular similarity plays an important role in the exploration of molecular data sets, query-retrieval in molecular databases, and in structure–property/activity modeling. Various methods to define structural similarity between molecules are available in the literature, but so far none has been used with consistent and reliable results for all situations. We propose a new similarity method based on atom alignment for the analysis of structural similarity between molecules. This method is based on the comparison of the bonding profiles of atoms on comparable molecules, including features that are seldom found in other structural or graph matching approaches like chirality or double bond stereoisomerism. The similarity measure is then defined on the annotated molecular graph, based on an iterative directed graph similarity procedure and optimal atom alignment between atoms using a pairwise matching algorithm. With the proposed approach the similarities detected are more intuitively understood because similar atoms in the molecules are explicitly shown. This noncontiguous atom matching structural similarity method (NAMS) was tested and compared with one of the most widely used similarity methods (fingerprint-based similarity) using three difficult data sets with different characteristics. Despite having a higher computational cost, the method performed well being able to distinguish either different or very similar hydrocarbons that were indistinguishable using a fingerprint-based approach. NAMS also verified the similarity principle using a data set of structurally similar steroids with differences in the binding affinity to the corticosteroid binding globulin receptor by showing that pairs of steroids with a high degree of similarity (>80%) tend to have smaller differences in the absolute value of binding activity. Using a highly diverse set of compounds with information about the monoamine oxidase inhibition level, the method was also able to recover a significantly higher average fraction of active compounds when the seed is active for different cutoff threshold values of similarity. Particularly, for the cutoff threshold values of 86%, 93%, and 96.5%, NAMS was able to recover a fraction of actives of 0.57, 0.63, and 0.83, respectively, while the fingerprint-based approach was able to recover a fraction of actives of 0.41, 0.40, and 0.39, respectively. NAMS is made available freely for the whole community in a simple Web based tool as well as the Python source code at http://nams.lasige.di.fc.ul.pt/

FigShare

Structural Similarity Based Kriging for Quantitative Structure Activity and Property Relationship Modeling

Author: Ana L. Teixeira (443973)
Andre O. Falcao (1773145)
Publication venue
Publication date
Field of study

Structurally similar molecules tend to have similar properties, i.e. closer molecules in the molecular space are more likely to yield similar property values while distant molecules are more likely to yield different values. Based on this principle, we propose the use of a new method that takes into account the high dimensionality of the molecular space, predicting chemical, physical, or biological properties based on the most similar compounds with measured properties. This methodology uses ordinary kriging coupled with three different molecular similarity approaches (based on molecular descriptors, fingerprints, and atom matching) which creates an interpolation map over the molecular space that is capable of predicting properties/activities for diverse chemical data sets. The proposed method was tested in two data sets of diverse chemical compounds collected from the literature and preprocessed. One of the data sets contained dihydrofolate reductase inhibition activity data, and the second molecules for which aqueous solubility was known. The overall predictive results using kriging for both data sets comply with the results obtained in the literature using typical QSPR/QSAR approaches. However, the procedure did not involve any type of descriptor selection or even minimal information about each problem, suggesting that this approach is directly applicable to a large spectrum of problems in QSAR/QSPR. Furthermore, the predictive results improve significantly with the similarity threshold between the training and testing compounds, allowing the definition of a confidence threshold of similarity and error estimation for each case inferred. The use of kriging for interpolation over the molecular metric space is independent of the training data set size, and no reparametrizations are necessary when more compounds are added or removed from the set, and increasing the size of the database will consequentially improve the quality of the estimations. Finally it is shown that this model can be used for checking the consistency of measured data and for guiding an extension of the training set by determining the regions of the molecular space for which new experimental measurements could be used to maximize the model’s predictive performance

FigShare

Noncontiguous Atom Matching Structural Similarity Function

Author: Ana L. Teixeira (443973)
Andre O. Falcao (1773145)
Publication venue
Publication date
Field of study

FigShare

A novel algorithm for feature selection using Harmony Search and its application for non-technical losses detection

Author: Chiachia Giovani
Falcao Alexandre X.
Papa João Paulo
Ramos Caio C. O.
Souza Andre N.
Publication venue: Pergamon-Elsevier B.V. Ltd
Publication date
Field of study

Finding an optimal subset of features that maximizes classification accuracy is still an open problem. In this paper, we exploit the speed of the Harmony Search algorithm and the Optimum-Path Forest classifier in order to propose a new fast and accurate approach for feature selection. Comparisons to some other pattern recognition and feature selection techniques showed that the proposed hybrid algorithm for feature selection outperformed them. The experiments were carried out in the context of identifying non-technical losses in power distribution systems. (C) 2011 Elsevier Ltd. All rights reserved.Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP

A Bayesian Approach to <i>in Silico</i> Blood-Brain Barrier Penetration Modeling

Author: Ana L. Teixeira (443973)
Andre O. Falcao (1773145)
Ines Filipa Martins (2071402)
Luis Pinheiro (2071399)
Publication venue
Publication date
Field of study

The human blood-brain barrier (BBB) is a membrane that protects the central nervous system (CNS) by restricting the passage of solutes. The development of any new drug must take into account its existence whether for designing new molecules that target components of the CNS or, on the other hand, to find new substances that should not penetrate the barrier. Several studies in the literature have attempted to predict BBB penetration, so far with limited success and few, if any, application to real world drug discovery and development programs. Part of the reason is due to the fact that only about 2% of small molecules can cross the BBB, and the available data sets are not representative of that reality, being generally biased with an over-representation of molecules that show an ability to permeate the BBB (BBB positives). To circumvent this limitation, the current study aims to devise and use a new approach based on Bayesian statistics, coupled with state-of-the-art machine learning methods to produce a robust model capable of being applied in real-world drug research scenarios. The data set used, gathered from the literature, totals 1970 curated molecules, one of the largest for similar studies. Random Forests and Support Vector Machines were tested in various configurations against several chemical descriptor set combinations. Models were tested in a 5-fold cross-validation process, and the best one tested over an independent validation set. The best fitted model produced an overall accuracy of 95%, with a mean square contingency coefficient (ϕ) of 0.74, and showing an overall capacity for predicting BBB positives of 83% and 96% for determining BBB negatives. This model was adapted into a Web based tool made available for the whole community at http://b3pp.lasige.di.fc.ul.pt

FigShare