358 research outputs found
Entangled random pure states with orthogonal symmetry: exact results
We compute analytically the density of Schmidt
eigenvalues, distributed according to a fixed-trace Wishart-Laguerre measure,
and the average R\'enyi entropy for reduced
density matrices of entangled random pure states with orthogonal symmetry
. The results are valid for arbitrary dimensions of the
corresponding Hilbert space partitions, and are in excellent agreement with
numerical simulations.Comment: 15 pages, 5 figure
Large Deviations of the Maximum Eigenvalue in Wishart Random Matrices
We compute analytically the probability of large fluctuations to the left of
the mean of the largest eigenvalue in the Wishart (Laguerre) ensemble of
positive definite random matrices. We show that the probability that all the
eigenvalues of a (N x N) Wishart matrix W=X^T X (where X is a rectangular M x N
matrix with independent Gaussian entries) are smaller than the mean value
=N/c decreases for large N as , where \beta=1,2 correspond respectively to
real and complex Wishart matrices, c=N/M < 1 and \Phi_{-}(x;c) is a large
deviation function that we compute explicitly. The result for the Anti-Wishart
case (M < N) simply follows by exchanging M and N. We also analytically
determine the average spectral density of an ensemble of constrained Wishart
matrices whose eigenvalues are forced to be smaller than a fixed barrier. The
numerical simulations are in excellent agreement with the analytical
predictions.Comment: Published version. References and appendix adde
In silico assessment of potential druggable pockets on the surface of α1-Antitrypsin conformers
The search for druggable pockets on the surface of a protein is often performed on a single conformer, treated as a rigid body. Transient druggable pockets may be missed in this approach. Here, we describe a methodology for systematic in silico analysis of surface clefts across multiple conformers of the metastable protein α1-antitrypsin (A1AT). Pathological mutations disturb the conformational landscape of A1AT, triggering polymerisation that leads to emphysema and hepatic cirrhosis. Computational screens for small molecule inhibitors of polymerisation have generally focused on one major druggable site visible in all crystal structures of native A1AT. In an alternative approach, we scan all surface clefts observed in crystal structures of A1AT and in 100 computationally produced conformers, mimicking the native solution ensemble. We assess the persistence, variability and druggability of these pockets. Finally, we employ molecular docking using publicly available libraries of small molecules to explore scaffold preferences for each site. Our approach identifies a number of novel target sites for drug design. In particular one transient site shows favourable characteristics for druggability due to high enclosure and hydrophobicity. Hits against this and other druggable sites achieve docking scores corresponding to a Kd in the µM–nM range, comparing favourably with a recently identified promising lead. Preliminary ThermoFluor studies support the docking predictions. In conclusion, our strategy shows considerable promise compared with the conventional single pocket/single conformer approach to in silico screening. Our best-scoring ligands warrant further experimental investigation
ProteoLens: a visual analytic tool for multi-scale database-driven biological network data mining
Background
New systems biology studies require researchers to understand how interplay among myriads of biomolecular entities is orchestrated in order to achieve high-level cellular and physiological functions. Many software tools have been developed in the past decade to help researchers visually navigate large networks of biomolecular interactions with built-in template-based query capabilities. To further advance researchers' ability to interrogate global physiological states of cells through multi-scale visual network explorations, new visualization software tools still need to be developed to empower the analysis. A robust visual data analysis platform driven by database management systems to perform bi-directional data processing-to-visualizations with declarative querying capabilities is needed.
Results
We developed ProteoLens as a JAVA-based visual analytic software tool for creating, annotating and exploring multi-scale biological networks. It supports direct database connectivity to either Oracle or PostgreSQL database tables/views, on which SQL statements using both Data Definition Languages (DDL) and Data Manipulation languages (DML) may be specified. The robust query languages embedded directly within the visualization software help users to bring their network data into a visualization context for annotation and exploration. ProteoLens supports graph/network represented data in standard Graph Modeling Language (GML) formats, and this enables interoperation with a wide range of other visual layout tools. The architectural design of ProteoLens enables the de-coupling of complex network data visualization tasks into two distinct phases: 1) creating network data association rules, which are mapping rules between network node IDs or edge IDs and data attributes such as functional annotations, expression levels, scores, synonyms, descriptions etc; 2) applying network data association rules to build the network and perform the visual annotation of graph nodes and edges according to associated data values. We demonstrated the advantages of these new capabilities through three biological network visualization case studies: human disease association network, drug-target interaction network and protein-peptide mapping network.
Conclusion
The architectural design of ProteoLens makes it suitable for bioinformatics expert data analysts who are experienced with relational database management to perform large-scale integrated network visual explorations. ProteoLens is a promising visual analytic platform that will facilitate knowledge discoveries in future network and systems biology studies
Artificial intelligence in biological activity prediction
Artificial intelligence has become an indispensable resource in chemoinformatics. Numerous machine learning algorithms for activity prediction recently emerged, becoming an indispensable approach to mine chemical information from large compound datasets. These approaches enable the automation of compound discovery to find biologically active molecules with important properties. Here, we present a review of some of the main machine learning studies in biological activity prediction of compounds, in particular for sweetness prediction. We discuss some of the most used compound featurization techniques and the major databases of chemical compounds relevant to these tasks.This study was supported by the European Commission through project SHIKIFACTORY100 - Modular cell factories for the production of 100 compounds from the shikimate pathway (Reference 814408), and by the Portuguese FCT under the scope of the strategic funding of UID/BIO/04469/2019 unit and BioTecNorte operation (NORTE-01-0145-FEDER-000004) funded by the European Regional Development Fund under the scope of Norte2020.info:eu-repo/semantics/publishedVersio
Recommended from our members
Early intervention with Bifidobacterium lactis NCC2818 modulates the host-microbe interface independent of the sustained changes induced by the neonatal environment
Inflammatory and metabolic diseases can originate during early-life and have been correlated with shifts in intestinal microbial ecology. Here we demonstrate that minor environmental fluctuations during the early neonatal period had sustained effects on the developing porcine microbiota and host-microbe interface. These inter-replicate effects appear to originate during the first day of life, and are likely to reflect very early microbiota acquisition from the environment. We statistically link early systemic inflammation with later local increases in inflammatory cytokine (IL-17) production, which could have important enteric health implications. Immunity, intestinal barrier function, host metabolism and host-microbiota co-metabolism were further modified by Bifidobacterium lactis NCC2818 supplementation, although composition of the in situ microbiota remained unchanged. Finally, our robust model identified novel, strong correlations between urinary metabolites (eg malonate, phenylacetylglycine, alanine) and mucosal immunoglobulin (IgM) and cytokine (IL-10, IL-4) production, thus providing the possibility of the development of urinary ‘dipstick’ tests to assess non-accessible mucosal immune development and identify early precursors (biomarkers) of disease. These results have important implications for infants exposed to neonatal factors including caesarean delivery, antibiotic therapy and delayed discharge from hospital environments, which may predispose to the development of inflammatory and metabolic diseases in later life
A linguistic rule-based approach to extract drug-drug interactions from pharmacological documents
<p>Abstract</p> <p>Background</p> <p>A drug-drug interaction (DDI) occurs when one drug influences the level or activity of another drug. The increasing volume of the scientific literature overwhelms health care professionals trying to be kept up-to-date with all published studies on DDI.</p> <p>Methods</p> <p>This paper describes a hybrid linguistic approach to DDI extraction that combines shallow parsing and syntactic simplification with pattern matching. Appositions and coordinate structures are interpreted based on shallow syntactic parsing provided by the UMLS MetaMap tool (MMTx). Subsequently, complex and compound sentences are broken down into clauses from which simple sentences are generated by a set of simplification rules. A pharmacist defined a set of domain-specific lexical patterns to capture the most common expressions of DDI in texts. These lexical patterns are matched with the generated sentences in order to extract DDIs.</p> <p>Results</p> <p>We have performed different experiments to analyze the performance of the different processes. The lexical patterns achieve a reasonable precision (67.30%), but very low recall (14.07%). The inclusion of appositions and coordinate structures helps to improve the recall (25.70%), however, precision is lower (48.69%). The detection of clauses does not improve the performance.</p> <p>Conclusions</p> <p>Information Extraction (IE) techniques can provide an interesting way of reducing the time spent by health care professionals on reviewing the literature. Nevertheless, no approach has been carried out to extract DDI from texts. To the best of our knowledge, this work proposes the first integral solution for the automatic extraction of DDI from biomedical texts.</p
Identifying Compound-Target Associations by Combining Bioactivity Profile Similarity Search and Public Databases Mining
Molecular target identification is of central importance to drug discovery. Here, we developed a computational approach, named bioactivity profile similarity search (BASS), for associating targets to small molecules by using the known target annotations of related compounds from public databases. To evaluate BASS, a bioactivity profile database was constructed using 4296 compounds that were commonly tested in the US National Cancer Institute 60 human tumor cell line anticancer drug screen (NCI-60). Each compound was used as a query to search against the entire bioactivity profile database, and reference compounds with similar bioactivity profiles above a threshold of 0.75 were considered as neighbor compounds of the query. Potential targets were subsequently linked to the identified neighbor compounds by using the known targets o
HelmCoP: An Online Resource for Helminth Functional Genomics and Drug and Vaccine Targets Prioritization
A vast majority of the burden from neglected tropical diseases result from helminth infections (nematodes and platyhelminthes). Parasitic helminthes infect over 2 billion, exerting a high collective burden that rivals high-mortality conditions such as AIDS or malaria, and cause devastation to crops and livestock. The challenges to improve control of parasitic helminth infections are multi-fold and no single category of approaches will meet them all. New information such as helminth genomics, functional genomics and proteomics coupled with innovative bioinformatic approaches provide fundamental molecular information about these parasites, accelerating both basic research as well as development of effective diagnostics, vaccines and new drugs. To facilitate such studies we have developed an online resource, HelmCoP (Helminth Control and Prevention), built by integrating functional, structural and comparative genomic data from plant, animal and human helminthes, to enable researchers to develop strategies for drug, vaccine and pesticide prioritization, while also providing a useful comparative genomics platform. HelmCoP encompasses genomic data from several hosts, including model organisms, along with a comprehensive suite of structural and functional annotations, to assist in comparative analyses and to study host-parasite interactions. The HelmCoP interface, with a sophisticated query engine as a backbone, allows users to search for multi-factorial combinations of properties and serves readily accessible information that will assist in the identification of various genes of interest. HelmCoP is publicly available at: http://www.nematode.net/helmcop.html
- …