8,785 research outputs found

    In Silico and In Vitro Investigation into the Next Generation of New Psychoactive Substances

    Get PDF
    New Psychoactive Substances (NPS) were designed to be legal alternatives to existing established recreational drugs. They have fast become a very popular and up until 2016, NPS were legal, cheap and freely accessible via the internet and high street “head shops”. The rapid expansion in the number of these drugs has reached epidemic proportions, whereby hundreds of NPS have been developed and sold within the last five-year period. As NPS are synthesized in clandestine laboratories there is little to no control in the manufacture, dosage and packaging of these drugs. The public health risks posed by these drugs are therefore far-reaching. Fatalities and severe adverse reactions associated with these compounds have become an ongoing challenge to healthcare services, primarily because these drugs have not previously been abused and therefore there is little pharmacological information available regarding NPS. There are a number of different biological receptors that are implicated in the effects of NPS and the mechanism of action for the majority of these drugs is still largely unknown. It is of great importance to try and establish an understanding of how various classes of NPS interact on a molecular level. In this thesis, structure-based and ligand-based in Silico methodologies were employed to gain a better understanding of how NPS may interact with monoamine transporters (MAT). Key findings included both molecular docking studies and a number of robust and predictive QSAR models for the dopamine and serotonin transporters provided insight into how promiscuity of NPS between the different MAT isoforms could arise. In addition, pharmacophore models were generated to identify chemical entities that were structurally dissimilar to known existing NPS that had the potential to interact with the cannabinoid 1 receptor (CB1) and hence were hypothesised could elicit similar biological responses to known potent synthetic cannabinoids. Thirteen of these compounds were identified and carried forward for in vitro and ex vivo analyses, where preliminary results have shown that two compounds activate the CB1 receptor. Further optimisation of these compounds could yield a novel SC scaffold that was previously unseen. Additionally, the compounds identified and the methodology employed in the generation of these new chemical scaffolds could be used to guide Early Warning Systems (EWS) and facilitate law enforcement with respect to emergent NPS

    Evaluating Overfit and Underfit in Models of Network Community Structure

    Full text link
    A common data mining task on networks is community detection, which seeks an unsupervised decomposition of a network into structural groups based on statistical regularities in the network's connectivity. Although many methods exist, the No Free Lunch theorem for community detection implies that each makes some kind of tradeoff, and no algorithm can be optimal on all inputs. Thus, different algorithms will over or underfit on different inputs, finding more, fewer, or just different communities than is optimal, and evaluation methods that use a metadata partition as a ground truth will produce misleading conclusions about general accuracy. Here, we present a broad evaluation of over and underfitting in community detection, comparing the behavior of 16 state-of-the-art community detection algorithms on a novel and structurally diverse corpus of 406 real-world networks. We find that (i) algorithms vary widely both in the number of communities they find and in their corresponding composition, given the same input, (ii) algorithms can be clustered into distinct high-level groups based on similarities of their outputs on real-world networks, and (iii) these differences induce wide variation in accuracy on link prediction and link description tasks. We introduce a new diagnostic for evaluating overfitting and underfitting in practice, and use it to roughly divide community detection methods into general and specialized learning algorithms. Across methods and inputs, Bayesian techniques based on the stochastic block model and a minimum description length approach to regularization represent the best general learning approach, but can be outperformed under specific circumstances. These results introduce both a theoretically principled approach to evaluate over and underfitting in models of network community structure and a realistic benchmark by which new methods may be evaluated and compared.Comment: 22 pages, 13 figures, 3 table

    Consensus clustering and functional interpretation of gene-expression data

    Get PDF
    Microarray analysis using clustering algorithms can suffer from lack of inter-method consistency in assigning related gene-expression profiles to clusters. Obtaining a consensus set of clusters from a number of clustering methods should improve confidence in gene-expression analysis. Here we introduce consensus clustering, which provides such an advantage. When coupled with a statistically based gene functional analysis, our method allowed the identification of novel genes regulated by NFÎșB and the unfolded protein response in certain B-cell lymphomas

    Empirical Potential Function for Simplified Protein Models: Combining Contact and Local Sequence-Structure Descriptors

    Full text link
    An effective potential function is critical for protein structure prediction and folding simulation. Simplified protein models such as those requiring only CαC_\alpha or backbone atoms are attractive because they enable efficient search of the conformational space. We show residue specific reduced discrete state models can represent the backbone conformations of proteins with small RMSD values. However, no potential functions exist that are designed for such simplified protein models. In this study, we develop optimal potential functions by combining contact interaction descriptors and local sequence-structure descriptors. The form of the potential function is a weighted linear sum of all descriptors, and the optimal weight coefficients are obtained through optimization using both native and decoy structures. The performance of the potential function in test of discriminating native protein structures from decoys is evaluated using several benchmark decoy sets. Our potential function requiring only backbone atoms or CαC_\alpha atoms have comparable or better performance than several residue-based potential functions that require additional coordinates of side chain centers or coordinates of all side chain atoms. By reducing the residue alphabets down to size 5 for local structure-sequence relationship, the performance of the potential function can be further improved. Our results also suggest that local sequence-structure correlation may play important role in reducing the entropic cost of protein folding.Comment: 20 pages, 5 figures, 4 tables. In press, Protein

    Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening

    Full text link
    This work introduces a number of algebraic topology approaches, such as multicomponent persistent homology, multi-level persistent homology and electrostatic persistence for the representation, characterization, and description of small molecules and biomolecular complexes. Multicomponent persistent homology retains critical chemical and biological information during the topological simplification of biomolecular geometric complexity. Multi-level persistent homology enables a tailored topological description of inter- and/or intra-molecular interactions of interest. Electrostatic persistence incorporates partial charge information into topological invariants. These topological methods are paired with Wasserstein distance to characterize similarities between molecules and are further integrated with a variety of machine learning algorithms, including k-nearest neighbors, ensemble of trees, and deep convolutional neural networks, to manifest their descriptive and predictive powers for chemical and biological problems. Extensive numerical experiments involving more than 4,000 protein-ligand complexes from the PDBBind database and near 100,000 ligands and decoys in the DUD database are performed to test respectively the scoring power and the virtual screening power of the proposed topological approaches. It is demonstrated that the present approaches outperform the modern machine learning based methods in protein-ligand binding affinity predictions and ligand-decoy discrimination
    • 

    corecore