59 research outputs found

    Stacked Generalizations in Imbalanced Fraud Data Sets using Resampling Methods

    Full text link
    This study uses stacked generalization, which is a two-step process of combining machine learning methods, called meta or super learners, for improving the performance of algorithms in step one (by minimizing the error rate of each individual algorithm to reduce its bias in the learning set) and then in step two inputting the results into the meta learner with its stacked blended output (demonstrating improved performance with the weakest algorithms learning better). The method is essentially an enhanced cross-validation strategy. Although the process uses great computational resources, the resulting performance metrics on resampled fraud data show that increased system cost can be justified. A fundamental key to fraud data is that it is inherently not systematic and, as of yet, the optimal resampling methodology has not been identified. Building a test harness that accounts for all permutations of algorithm sample set pairs demonstrates that the complex, intrinsic data structures are all thoroughly tested. Using a comparative analysis on fraud data that applies stacked generalizations provides useful insight needed to find the optimal mathematical formula to be used for imbalanced fraud data sets.Comment: 19 pages, 3 figures, 8 table

    Developing Community Reinforcement and Family Training (CRAFT) for Parents of Treatment-Resistant Adolescents.

    Get PDF
    We describe a project focused on training parents to facilitate their treatment-resistant adolescent\u27s treatment entry and to manage their child after entry into community-based treatment. Controlled studies show that Community Reinforcement and Family Training (CRAFT) is a unilateral treatment that fosters treatment entry of adults; however, there are no controlled trials for parents with a substance-abusing child. We examined the behavioral parent training literature to guide us in tailoring CRAFT for parents of adolescents. We discuss adaptations to CRAFT, outcomes and experiences gained from a brief pilot of the revised CRAFT program, and the future directions of this work

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts

    Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas

    Get PDF
    This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smokin

    Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas

    Get PDF
    Although theMYConcogene has been implicated incancer, a systematic assessment of alterations ofMYC, related transcription factors, and co-regulatoryproteins, forming the proximal MYC network (PMN),across human cancers is lacking. Using computa-tional approaches, we define genomic and proteo-mic features associated with MYC and the PMNacross the 33 cancers of The Cancer Genome Atlas.Pan-cancer, 28% of all samples had at least one ofthe MYC paralogs amplified. In contrast, the MYCantagonists MGA and MNT were the most frequentlymutated or deleted members, proposing a roleas tumor suppressors.MYCalterations were mutu-ally exclusive withPIK3CA,PTEN,APC,orBRAFalterations, suggesting that MYC is a distinct onco-genic driver. Expression analysis revealed MYC-associated pathways in tumor subtypes, such asimmune response and growth factor signaling; chro-matin, translation, and DNA replication/repair wereconserved pan-cancer. This analysis reveals insightsinto MYC biology and is a reference for biomarkersand therapeutics for cancers with alterations ofMYC or the PMN

    Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images

    Get PDF
    Beyond sample curation and basic pathologic characterization, the digitized H&E-stained images of TCGA samples remain underutilized. To highlight this resource, we present mappings of tumorinfiltrating lymphocytes (TILs) based on H&E images from 13 TCGA tumor types. These TIL maps are derived through computational staining using a convolutional neural network trained to classify patches of images. Affinity propagation revealed local spatial structure in TIL patterns and correlation with overall survival. TIL map structural patterns were grouped using standard histopathological parameters. These patterns are enriched in particular T cell subpopulations derived from molecular measures. TIL densities and spatial structure were differentially enriched among tumor types, immune subtypes, and tumor molecular subtypes, implying that spatial infiltrate state could reflect particular tumor cell aberration states. Obtaining spatial lymphocytic patterns linked to the rich genomic characterization of TCGA samples demonstrates one use for the TCGA image archives with insights into the tumor-immune microenvironment

    Candida albicans Isolates from the Gut of Critically Ill Patients Respond to Phosphate Limitation by Expressing Filaments and a Lethal Phenotype

    Get PDF
    Candida albicans is an opportunistic pathogen that proliferates in the intestinal tract of critically ill patients where it continues to be a major cause of infectious-related mortality. The precise cues that shift intestinal C. albicans from its ubiquitous indolent colonizing yeast form to an invasive and lethal filamentous form remain unknown. We have previously shown that severe phosphate depletion develops in the intestinal tract during extreme physiologic stress and plays a major role in shifting intestinal Pseudomonas aeruginosa to express a lethal phenotype via conserved phosphosensory-phosphoregulatory systems. Here we studied whether phosphate dependent virulence expression could be similarly demonstrated for C. albicans. C. albicans isolates from the stool of critically ill patients and laboratory prototype strains (SC5314, BWP17, SN152) were evaluated for morphotype transformation and lethality against C. elegans and mice during exposure to phosphate limitation. Isolates ICU1 and ICU12 were able to filament and kill C. elegans in a phosphate dependent manner. In a mouse model of intestinal phosphate depletion (30% hepatectomy), direct intestinal inoculation of C. albicans caused mortality that was prevented by oral phosphate supplementation. Prototype strains displayed limited responses to phosphate limitation; however, the pho4Δ mutant displayed extensive filamentation during low phosphate conditions compared to its isogenic parent strain SN152, suggesting that mutation in the transcriptional factor Pho4p may sensitize C. albicans to phosphate limitation. Extensive filamentation was also observed in strain ICU12 suggesting that this strain is also sensitized to phosphate limitation. Analysis of the sequence of PHO4 in strain ICU12, its transcriptional response to phosphate limitation, and phosphatase assays confirmed that ICU12 demonstrates a profound response to phosphate limitation. The emergence of strains of C. albicans with marked responsiveness to phosphate limitation may represent a fitness adaptation to the complex and nutrient scarce environment typical of the gut of a critically ill patient

    Integrated Genomic Analysis of the Ubiquitin Pathway across Cancer Types

    Get PDF
    Protein ubiquitination is a dynamic and reversibleprocess of adding single ubiquitin molecules orvarious ubiquitin chains to target proteins. Here,using multidimensional omic data of 9,125 tumorsamples across 33 cancer types from The CancerGenome Atlas, we perform comprehensive molecu-lar characterization of 929 ubiquitin-related genesand 95 deubiquitinase genes. Among them, we sys-tematically identify top somatic driver candidates,including mutatedFBXW7with cancer-type-specificpatterns and amplifiedMDM2showing a mutuallyexclusive pattern withBRAFmutations. Ubiquitinpathway genes tend to be upregulated in cancermediated by diverse mechanisms. By integratingpan-cancer multiomic data, we identify a group oftumor samples that exhibit worse prognosis. Thesesamples are consistently associated with the upre-gulation of cell-cycle and DNA repair pathways, char-acterized by mutatedTP53,MYC/TERTamplifica-tion, andAPC/PTENdeletion. Our analysishighlights the importance of the ubiquitin pathwayin cancer development and lays a foundation fordeveloping relevant therapeutic strategies

    The Cancer Genome Atlas Comprehensive Molecular Characterization of Renal Cell Carcinoma

    Get PDF

    The genetic architecture of the human cerebral cortex

    Get PDF
    The cerebral cortex underlies our complex cognitive capabilities, yet little is known about the specific genetic loci that influence human cortical structure. To identify genetic variants that affect cortical structure, we conducted a genome-wide association meta-analysis of brain magnetic resonance imaging data from 51,665 individuals. We analyzed the surface area and average thickness of the whole cortex and 34 regions with known functional specializations. We identified 199 significant loci and found significant enrichment for loci influencing total surface area within regulatory elements that are active during prenatal cortical development, supporting the radial unit hypothesis. Loci that affect regional surface area cluster near genes in Wnt signaling pathways, which influence progenitor expansion and areal identity. Variation in cortical structure is genetically correlated with cognitive function, Parkinson's disease, insomnia, depression, neuroticism, and attention deficit hyperactivity disorder
    • …
    corecore