7,623 research outputs found

    Differential gene expression graphs: A data structure for classification in DNA microarrays

    Get PDF
    This paper proposes an innovative data structure to be used as a backbone in designing microarray phenotype sample classifiers. The data structure is based on graphs and it is built from a differential analysis of the expression levels of healthy and diseased tissue samples in a microarray dataset. The proposed data structure is built in such a way that, by construction, it shows a number of properties that are perfectly suited to address several problems like feature extraction, clustering, and classificatio

    Machine learning and structural analysis of Mycobacterium tuberculosis pan-genome identifies genetic signatures of antibiotic resistance.

    Get PDF
    Mycobacterium tuberculosis is a serious human pathogen threat exhibiting complex evolution of antimicrobial resistance (AMR). Accordingly, the many publicly available datasets describing its AMR characteristics demand disparate data-type analyses. Here, we develop a reference strain-agnostic computational platform that uses machine learning approaches, complemented by both genetic interaction analysis and 3D structural mutation-mapping, to identify signatures of AMR evolution to 13 antibiotics. This platform is applied to 1595 sequenced strains to yield four key results. First, a pan-genome analysis shows that M. tuberculosis is highly conserved with sequenced variation concentrated in PE/PPE/PGRS genes. Second, the platform corroborates 33 genes known to confer resistance and identifies 24 new genetic signatures of AMR. Third, 97 epistatic interactions across 10 resistance classes are revealed. Fourth, detailed structural analysis of these genes yields mechanistic bases for their selection. The platform can be used to study other human pathogens

    Tubular cell and keratinocyte single-cell transcriptomics applied to lupus nephritis reveal type I IFN and fibrosis relevant pathways.

    Get PDF
    The molecular and cellular processes that lead to renal damage and to the heterogeneity of lupus nephritis (LN) are not well understood. We applied single-cell RNA sequencing (scRNA-seq) to renal biopsies from patients with LN and evaluated skin biopsies as a potential source of diagnostic and prognostic markers of renal disease. Type I interferon (IFN)-response signatures in tubular cells and keratinocytes distinguished patients with LN from healthy control subjects. Moreover, a high IFN-response signature and fibrotic signature in tubular cells were each associated with failure to respond to treatment. Analysis of tubular cells from patients with proliferative, membranous and mixed LN indicated pathways relevant to inflammation and fibrosis, which offer insight into their histologic differences. In summary, we applied scRNA-seq to LN to deconstruct its heterogeneity and identify novel targets for personalized approaches to therapy

    Transcript-indexed ATAC-seq for precision immune profiling.

    Get PDF
    T cells create vast amounts of diversity in the genes that encode their T cell receptors (TCRs), which enables individual clones to recognize specific peptide-major histocompatibility complex (MHC) ligands. Here we combined sequencing of the TCR-encoding genes with assay for transposase-accessible chromatin with sequencing (ATAC-seq) analysis at the single-cell level to provide information on the TCR specificity and epigenomic state of individual T cells. By using this approach, termed transcript-indexed ATAC-seq (T-ATAC-seq), we identified epigenomic signatures in immortalized leukemic T cells, primary human T cells from healthy volunteers and primary leukemic T cells from patient samples. In peripheral blood CD4+ T cells from healthy individuals, we identified cis and trans regulators of naive and memory T cell states and found substantial heterogeneity in surface-marker-defined T cell populations. In patients with a leukemic form of cutaneous T cell lymphoma, T-ATAC-seq enabled identification of leukemic and nonleukemic regulatory pathways in T cells from the same individual by allowing separation of the signals that arose from the malignant clone from the background T cell noise. Thus, T-ATAC-seq is a new tool that enables analysis of epigenomic landscapes in clonal T cells and should be valuable for studies of T cell malignancy, immunity and immunotherapy

    I_MDS: an inflammatory bowel disease molecular activity score to classify patients with differing disease-driving pathways and therapeutic response to anti-TNF treatment

    Get PDF
    Crohn's disease and ulcerative colitis are driven by both common and distinct underlying mechanisms of pathobiology. Both diseases, exhibit heterogeneity underscored by the variable clinical responses to therapeutic interventions. We aimed to identify disease-driving pathways and classify individuals into subpopulations that differ in their pathobiology and response to treatment. We applied hierarchical clustering of enrichment scores derived from gene set variation analysis of signatures representative of various immunological processes and activated cell types, to a colonic biopsy dataset that included healthy volunteers, Crohn's disease and ulcerative colitis patients. Patient stratification at baseline or after anti-TNF treatment in clinical responders and non-responders was queried. Signatures with significantly different enrichment scores were identified using a general linear model. Comparisons to healthy controls were made at baseline in all participants and then separately in responders and non-responders. Fifty-nine percent of the signatures were commonly enriched in both conditions at baseline, supporting the notion of a disease continuum within ulcerative colitis and Crohn's disease. Signatures included T cells, macrophages, neutrophil activation and poly:IC signatures, representing acute inflammation and a complex mix of potential disease-driving biology. Collectively, identification of significantly enriched signatures allowed establishment of an inflammatory bowel disease molecular activity score which uses biopsy transcriptomics as a surrogate marker to accurately track disease severity. This score separated diseased from healthy samples, enabled discrimination of clinical responders and non-responders at baseline with 100% specificity and 78.8% sensitivity, and was validated in an independent data set that showed comparable classification. Comparing responders and non-responders separately at baseline to controls, 43% and 70% of signatures were enriched, respectively, suggesting greater molecular dysregulation in TNF non-responders at baseline. This methodological approach could facilitate better targeted design of clinical studies to test therapeutics, concentrating on patient subsets sharing similar underlying pathobiology, therefore increasing the likelihood of clinical response
    • ā€¦
    corecore