72 research outputs found

    motifDiverge: a model for assessing the statistical significance of gene regulatory motif divergence between two DNA sequences

    Full text link
    Next-generation sequencing technology enables the identification of thousands of gene regulatory sequences in many cell types and organisms. We consider the problem of testing if two such sequences differ in their number of binding site motifs for a given transcription factor (TF) protein. Binding site motifs impart regulatory function by providing TFs the opportunity to bind to genomic elements and thereby affect the expression of nearby genes. Evolutionary changes to such functional DNA are hypothesized to be major contributors to phenotypic diversity within and between species; but despite the importance of TF motifs for gene expression, no method exists to test for motif loss or gain. Assuming that motif counts are Binomially distributed, and allowing for dependencies between motif instances in evolutionarily related sequences, we derive the probability mass function of the difference in motif counts between two nucleotide sequences. We provide a method to numerically estimate this distribution from genomic data and show through simulations that our estimator is accurate. Finally, we introduce the R package {\tt motifDiverge} that implements our methodology and illustrate its application to gene regulatory enhancers identified by a mouse developmental time course experiment. While this study was motivated by analysis of regulatory motifs, our results can be applied to any problem involving two correlated Bernoulli trials

    Economic outcomes of percutaneous coronary intervention with drug-eluting stents versus bypass surgery for patients with left main or three-vessel coronary artery disease: One-year results from the SYNTAX trial

    Get PDF
    Objectives: To evaluate the cost-effectiveness of alternative approaches to revascularization for patients with three-vessel or left main coronary artery disease (CAD). Background: Previous studies have demonstrated that, despite higher initial costs, long-term costs with bypass surgery (CABG) in multivessel CAD are similar to those for percutaneous coronary intervention (PCI). The impact of drug-eluting stents (DES) on these results is unknown. Methods: The SYNTAX trial randomized 1,800 patients with left main or three-vessel CAD to either CABG (n = 897) or PCI using paclitaxel-eluting stents (n = 903). Resource utilization data were collected prospectively for all patients, and cumulative 1-year costs were assessed from the perspective of the U.S. healthcare system. Results: Total costs for the initial hospitalization were 5,693/patienthigherwithCABG,whereasfollow−upcostswere5,693/patient higher with CABG, whereas follow-up costs were 2,282/patient higher with PCI due mainly to more frequent revascularization procedures and higher outpatient medication costs. Total 1-year costs were thus 3,590/patienthigherwithCABG,whilequality−adjustedlifeexpectancywasslightlyhigherwithPCI.AlthoughPCIwasaneconomicallydominantstrategyfortheoverallpopulation,cost−effectivenessvariedconsiderablyaccordingtoangiographiccomplexity.Forpatientswithhighangiographiccomplexity(SYNTAXscore>32),total1−yearcostsweresimilarforCABGandPCI,andtheincrementalcost−effectivenessratioforCABGwas3,590/patient higher with CABG, while quality-adjusted life expectancy was slightly higher with PCI. Although PCI was an economically dominant strategy for the overall population, cost-effectiveness varied considerably according to angiographic complexity. For patients with high angiographic complexity (SYNTAX score > 32), total 1-year costs were similar for CABG and PCI, and the incremental cost-effectiveness ratio for CABG was 43,486 per quality-adjusted life-year gained. Conclusions: Among patients with three-vessel or left main CAD, PCI is an economically attractive strategy over the first year for patients with low and moderate angiographic complexity, while CABG is favored among patients with high angiographic complexity

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts

    Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas

    Get PDF
    Although theMYConcogene has been implicated incancer, a systematic assessment of alterations ofMYC, related transcription factors, and co-regulatoryproteins, forming the proximal MYC network (PMN),across human cancers is lacking. Using computa-tional approaches, we define genomic and proteo-mic features associated with MYC and the PMNacross the 33 cancers of The Cancer Genome Atlas.Pan-cancer, 28% of all samples had at least one ofthe MYC paralogs amplified. In contrast, the MYCantagonists MGA and MNT were the most frequentlymutated or deleted members, proposing a roleas tumor suppressors.MYCalterations were mutu-ally exclusive withPIK3CA,PTEN,APC,orBRAFalterations, suggesting that MYC is a distinct onco-genic driver. Expression analysis revealed MYC-associated pathways in tumor subtypes, such asimmune response and growth factor signaling; chro-matin, translation, and DNA replication/repair wereconserved pan-cancer. This analysis reveals insightsinto MYC biology and is a reference for biomarkersand therapeutics for cancers with alterations ofMYC or the PMN

    Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas

    Get PDF
    This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smokin

    Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images

    Get PDF
    Beyond sample curation and basic pathologic characterization, the digitized H&E-stained images of TCGA samples remain underutilized. To highlight this resource, we present mappings of tumorinfiltrating lymphocytes (TILs) based on H&E images from 13 TCGA tumor types. These TIL maps are derived through computational staining using a convolutional neural network trained to classify patches of images. Affinity propagation revealed local spatial structure in TIL patterns and correlation with overall survival. TIL map structural patterns were grouped using standard histopathological parameters. These patterns are enriched in particular T cell subpopulations derived from molecular measures. TIL densities and spatial structure were differentially enriched among tumor types, immune subtypes, and tumor molecular subtypes, implying that spatial infiltrate state could reflect particular tumor cell aberration states. Obtaining spatial lymphocytic patterns linked to the rich genomic characterization of TCGA samples demonstrates one use for the TCGA image archives with insights into the tumor-immune microenvironment

    A Multicomponent Animal Virus Isolated from Mosquitoes

    Get PDF
    RNA viruses exhibit a variety of genome organization strategies, including multicomponent genomes in which each segment is packaged separately. Although multicomponent genomes are common among viruses infecting plants and fungi, their prevalence among those infecting animals remains unclear. We characterize a multicomponent RNA virus isolated from mosquitoes, designated Guaico Culex virus (GCXV). GCXV belongs to a diverse clade of segmented viruses (Jingmenvirus) related to the prototypically unsegmented Flaviviridae. The GCXV genome comprises five segments, each of which appears to be separately packaged. The smallest segment is not required for replication, and its presence is variable in natural infections. We also describe a variant of Jingmen tick virus, another Jingmenvirus, sequenced from a Ugandan red colobus monkey, thus expanding the host range of this segmented and likely multicomponent virus group. Collectively, this study provides evidence for the existence of multicomponent animal viruses and their potential relevance for animal and human health.RNA viruses exhibit a variety of genome organization strategies, including multicomponent genomes in which each segment is packaged separately. Although multicomponent genomes are common among viruses infecting plants and fungi, their prevalence among those infecting animals remains unclear. We characterize a multicomponent RNA virus isolated from mosquitoes, designated Guaico Culex virus (GCXV). GCXV belongs to a diverse clade of segmented viruses (Jingmenvirus) related to the prototypically unsegmented Flaviviridae. The GCXV genome comprises five segments, each of which appears to be separately packaged. The smallest segment is not required for replication, and its presence is variable in natural infections. We also describe a variant of Jingmen tick virus, another Jingmenvirus, sequenced from a Ugandan red colobus monkey, thus expanding the host range of this segmented and likely multicomponent virus group. Collectively, this study provides evidence for the existence of multicomponent animal viruses and their potential relevance for animal and human health

    Skill formation and precarious labor: the historical role of the industrial training institutes in India 1950-2018

    Get PDF
    This paper explores the historical and ideological contestations over the meaning, nature and scope of industrial skill training in state-sponsored Industrial Training Institutes (ITIs) in their attempts to create a disciplined and committed labour force in India. Through a combination of conceptual insights drawn from Indian labour historiography and ethnographic participant research, the paper addresses the challenges faced by ITIs in maintaining a unified, centralized vision for industrial skill-training of workers under conditions of vastly uneven geographical development of the industrial sector and progressively intense interregional capital mobility in contemporary India

    Integrated genomic characterization of pancreatic ductal adenocarcinoma

    Get PDF
    We performed integrated genomic, transcriptomic, and proteomic profiling of 150 pancreatic ductal adenocarcinoma (PDAC) specimens, including samples with characteristic low neoplastic cellularity. Deep whole-exome sequencing revealed recurrent somatic mutations in KRAS, TP53, CDKN2A, SMAD4, RNF43, ARID1A, TGFβR2, GNAS, RREB1, and PBRM1. KRAS wild-type tumors harbored alterations in other oncogenic drivers, including GNAS, BRAF, CTNNB1, and additional RAS pathway genes. A subset of tumors harbored multiple KRAS mutations, with some showing evidence of biallelic mutations. Protein profiling identified a favorable prognosis subset with low epithelial-mesenchymal transition and high MTOR pathway scores. Associations of non-coding RNAs with tumor-specific mRNA subtypes were also identified. Our integrated multi-platform analysis reveals a complex molecular landscape of PDAC and provides a roadmap for precision medicine

    Integrated Genomic Analysis of the Ubiquitin Pathway across Cancer Types

    Get PDF
    Protein ubiquitination is a dynamic and reversibleprocess of adding single ubiquitin molecules orvarious ubiquitin chains to target proteins. Here,using multidimensional omic data of 9,125 tumorsamples across 33 cancer types from The CancerGenome Atlas, we perform comprehensive molecu-lar characterization of 929 ubiquitin-related genesand 95 deubiquitinase genes. Among them, we sys-tematically identify top somatic driver candidates,including mutatedFBXW7with cancer-type-specificpatterns and amplifiedMDM2showing a mutuallyexclusive pattern withBRAFmutations. Ubiquitinpathway genes tend to be upregulated in cancermediated by diverse mechanisms. By integratingpan-cancer multiomic data, we identify a group oftumor samples that exhibit worse prognosis. Thesesamples are consistently associated with the upre-gulation of cell-cycle and DNA repair pathways, char-acterized by mutatedTP53,MYC/TERTamplifica-tion, andAPC/PTENdeletion. Our analysishighlights the importance of the ubiquitin pathwayin cancer development and lays a foundation fordeveloping relevant therapeutic strategies
    • …
    corecore