15 research outputs found

    AI is a viable alternative to high throughput screening: a 318-target study

    Get PDF
    : High throughput screening (HTS) is routinely used to identify bioactive small molecules. This requires physical compounds, which limits coverage of accessible chemical space. Computational approaches combined with vast on-demand chemical libraries can access far greater chemical space, provided that the predictive accuracy is sufficient to identify useful molecules. Through the largest and most diverse virtual HTS campaign reported to date, comprising 318 individual projects, we demonstrate that our AtomNet® convolutional neural network successfully finds novel hits across every major therapeutic area and protein class. We address historical limitations of computational screening by demonstrating success for target proteins without known binders, high-quality X-ray crystal structures, or manual cherry-picking of compounds. We show that the molecules selected by the AtomNet® model are novel drug-like scaffolds rather than minor modifications to known bioactive compounds. Our empirical results suggest that computational methods can substantially replace HTS as the first step of small-molecule drug discovery

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead

    Dissection of PIK3CA Aberration for Cervical Adenocarcinoma Outcomes

    No full text
    Personalized treatment of genetically stratified subgroups has the potential to improve outcomes in many malignant tumors. This study distills clinically meaningful prognostic/predictive genomic marker for cervical adenocarcinoma using signature genomic aberrations and single-point nonsynonymous mutation-specific droplet digital PCR (ddPCR). Mutations in PIK3CA E542K, E545K, or H1047R were detected in 41.7% of tumors. PIK3CA mutation detected in the patient’s circulating DNA collected before treatment or during follow-up was significantly associated with decreased progression-free survival or overall survival. PIK3CA mutation in the circulating DNA during follow-up after treatment predicted recurrence with 100% sensitivity and 64.29% specificity. It is the first indication of the predictive power of PIK3CA mutations in cervical adenocarcinoma. The work contributes to the development of liquid biopsies for follow up surveillance and a possibility of tailoring management of this particular women’s cancer
    corecore