275 research outputs found

    A methodology pruning the search space of six compiler transformations by addressing them together as one problem and by exploiting the hardware architecture details

    Get PDF
    Today’s compilers have a plethora of optimizations-transformations to choose from, and the correct choice, order as well parameters of transformations have a significant/large impact on performance; choosing the correct order and parameters of optimizations has been a long standing problem in compilation research, which until now remains unsolved; the separate sub-problems optimization gives a different schedule/binary for each sub-problem and these schedules cannot coexist, as by refining one degrades the other. Researchers try to solve this problem by using iterative compilation techniques but the search space is so big that it cannot be searched even by using modern supercomputers. Moreover, compiler transformations do not take into account the hardware architecture details and data reuse in an efficient way. In this paper, a new iterative compilation methodology is presented which reduces the search space of six compiler transformations by addressing the above problems; the search space is reduced by many orders of magnitude and thus an efficient solution is now capable to be found. The transformations are the following: loop tiling (including the number of the levels of tiling), loop unroll, register allocation, scalar replacement, loop interchange and data array layouts. The search space is reduced (a) by addressing the aforementioned transformations together as one problem and not separately, (b) by taking into account the custom hardware architecture details (e.g., cache size and associativity) and algorithm characteristics (e.g., data reuse). The proposed methodology has been evaluated over iterative compilation and gcc/icc compilers, on both embedded and general purpose processors; it achieves significant performance gains at many orders of magnitude lower compilation time

    Second tier testing to reduce the number of non-actionable secondary findings and false-positive referrals in newborn screening for severe combined immunodeficiency

    Get PDF
    Purpose Newborn screening (NBS) for severe combined immunodeficiency (SCID) is based on the detection of T-cell receptor excision circles (TRECs). TRECs are a sensitive biomarker for T-cell lymphopenia, but not specific for SCID. This creates a palette of secondary findings associated with low T-cells that require follow-up and treatment or are non-actionable. The high rate of (non-actionable) secondary findings and false-positive referrals raises questions about the harm-benefit-ratio of SCID screening, as referrals are associated with high emotional impact and anxiety for parents. Methods An alternative quantitative TREC PCR with different primers was performed on NBS cards of referred newborns (N = 56) and epigenetic immune cell counting was used as for relative quantification of CD3 + T-cells (N = 59). Retrospective data was used to determine the reduction in referrals with a lower TREC cutoff value or an adjusted screening algorithm. Results When analyzed with a second PCR with different primers, 45% of the referrals (25/56) had TREC levels above cutoff, including four false-positive cases in which two SNPs were identified. With epigenetic qPCR, 41% (24/59) of the referrals were within the range of the relative CD3 + T-cell counts of the healthy controls. Lowering the TREC cutoff value or adjusting the screening algorithm led to lower referral rates but did not prevent all false-positive referrals. Conclusions Second tier tests and adjustments of cutoff values or screening algorithms all have the potential to reduce the number of non-actionable secondary findings in NBS for SCID, although second tier tests are more effective in preventing false-positive referrals.Transplantation and immunomodulatio

    Frequent deletion of the CDKN2A locus in chordoma: analysis of chromosomal imbalances using array comparative genomic hybridisation

    Get PDF
    The initiating somatic genetic events in chordoma development have not yet been identified. Most cytogenetically investigated chordomas have displayed near-diploid or moderately hypodiploid karyotypes, with several numerical and structural rearrangements. However, no consistent structural chromosome aberration has been reported. This is the first array-based study characterising DNA copy number changes in chordoma. Array comparative genomic hybridisation (aCGH) identified copy number alterations in all samples and imbalances affecting 5 or more out of the 21 investigated tumours were seen on all chromosomes. In general, deletions were more common than gains and no high-level amplification was found, supporting previous findings of primarily losses of large chromosomal regions as an important mechanism in chordoma development. Although small imbalances were commonly found, the vast majority of these were detected in single cases; no small deletion affecting all tumours could be discerned. However, the CDKN2A and CDKN2B loci in 9p21 were homo- or heterozygously lost in 70% of the tumours, a finding corroborated by fluorescence in situ hybridisation, suggesting that inactivation of these genes constitute an important step in chordoma development

    Characteristics of predictor sets found using differential prioritization

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Feature selection plays an undeniably important role in classification problems involving high dimensional datasets such as microarray datasets. For filter-based feature selection, two well-known criteria used in forming predictor sets are relevance and redundancy. However, there is a third criterion which is at least as important as the other two in affecting the efficacy of the resulting predictor sets. This criterion is the degree of differential prioritization (DDP), which varies the emphases on relevance and redundancy depending on the value of the DDP. Previous empirical works on publicly available microarray datasets have confirmed the effectiveness of the DDP in molecular classification. We now propose to establish the fundamental strengths and merits of the DDP-based feature selection technique. This is to be done through a simulation study which involves vigorous analyses of the characteristics of predictor sets found using different values of the DDP from toy datasets designed to mimic real-life microarray datasets.</p> <p>Results</p> <p>A simulation study employing analytical measures such as the distance between classes before and after transformation using principal component analysis is implemented on toy datasets. From these analyses, the necessity of adjusting the differential prioritization based on the dataset of interest is established. This conclusion is supported by comparisons against both simplistic rank-based selection and state-of-the-art equal-priorities scoring methods, which demonstrates the superiority of the DDP-based feature selection technique. Reapplying similar analyses to real-life multiclass microarray datasets provides further confirmation of our findings and of the significance of the DDP for practical applications.</p> <p>Conclusion</p> <p>The findings have been achieved based on analytical evaluations, not empirical evaluation involving classifiers, thus providing further basis for the usefulness of the DDP and validating the need for unequal priorities on relevance and redundancy during feature selection for microarray datasets, especially highly multiclass datasets.</p

    An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics

    Get PDF
    For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale. Analysis of clinicopathologic annotations for over 11,000 cancer patients in the TCGA program leads to the generation of TCGA Clinical Data Resource, which provides recommendations of clinical outcome endpoint usage for 33 cancer types

    Before and After: Comparison of Legacy and Harmonized TCGA Genomic Data Commons’ Data

    Get PDF
    We present a systematic analysis of the effects of synchronizing a large-scale, deeply characterized, multi-omic dataset to the current human reference genome, using updated software, pipelines, and annotations. For each of 5 molecular data platforms in The Cancer Genome Atlas (TCGA)—mRNA and miRNA expression, single nucleotide variants, DNA methylation and copy number alterations—comprehensive sample, gene, and probe-level studies were performed, towards quantifying the degree of similarity between the ‘legacy’ GRCh37 (hg19) TCGA data and its GRCh38 (hg38) version as ‘harmonized’ by the Genomic Data Commons. We offer gene lists to elucidate differences that remained after controlling for confounders, and strategies to mitigate their impact on biological interpretation. Our results demonstrate that the hg19 and hg38 TCGA datasets are very highly concordant, promote informed use of either legacy or harmonized omics data, and provide a rubric that encourages similar comparisons as new data emerge and reference data evolve. Gao et al. performed a systematic analysis of the effects of synchronizing the large-scale, widely used, multi-omic dataset of The Cancer Genome Atlas to the current human reference genome. For each of the five molecular data platforms assessed, they demonstrated a very high concordance between the ‘legacy’ GRCh37 (hg19) TCGA data and its GRCh38 (hg38) version as ‘harmonized’ by the Genomic Data Commons

    Genomic and molecular characterization of preterm birth.

    Get PDF
    Preterm birth (PTB) complications are the leading cause of long-term morbidity and mortality in children. By using whole blood samples, we integrated whole-genome sequencing (WGS), RNA sequencing (RNA-seq), and DNA methylation data for 270 PTB and 521 control families. We analyzed this combined dataset to identify genomic variants associated with PTB and secondary analyses to identify variants associated with very early PTB (VEPTB) as well as other subcategories of disease that may contribute to PTB. We identified differentially expressed genes (DEGs) and methylated genomic loci and performed expression and methylation quantitative trait loci analyses to link genomic variants to these expression and methylation changes. We performed enrichment tests to identify overlaps between new and known PTB candidate gene systems. We identified 160 significant genomic variants associated with PTB-related phenotypes. The most significant variants, DEGs, and differentially methylated loci were associated with VEPTB. Integration of all data types identified a set of 72 candidate biomarker genes for VEPTB, encompassing genes and those previously associated with PTB. Notably, PTB-associated genes RAB31 and RBPJ were identified by all three data types (WGS, RNA-seq, and methylation). Pathways associated with VEPTB include EGFR and prolactin signaling pathways, inflammation- and immunity-related pathways, chemokine signaling, IFN-γ signaling, and Notch1 signaling. Progress in identifying molecular components of a complex disease is aided by integrated analyses of multiple molecular data types and clinical data. With these data, and by stratifying PTB by subphenotype, we have identified associations between VEPTB and the underlying biology

    Mechanisms of ring chromosome formation, ring instability and clinical consequences

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The breakpoints and mechanisms of ring chromosome formation were studied and mapped in 14 patients.</p> <p>Methods</p> <p>Several techniques were performed such as genome-wide array, MLPA (Multiplex Ligation-Dependent Probe Amplification) and FISH (Fluorescent <it>in situ </it>Hybridization).</p> <p>Results</p> <p>The ring chromosomes of patients I to XIV were determined to be, respectively: r(3)(p26.1q29), r(4)(p16.3q35.2), r(10)(p15.3q26.2), r(10)(p15.3q26.13), r(13)(p13q31.1), r(13)(p13q34), r(14)(p13q32.33), r(15)(p13q26.2), r(18)(p11.32q22.2), r(18)(p11.32q21.33), r(18)(p11.21q23), r(22)(p13q13.33), r(22)(p13q13.2), and r(22)(p13q13.2). These rings were found to have been formed by different mechanisms, such as: breaks in both chromosome arms followed by end-to-end reunion (patients IV, VIII, IX, XI, XIII and XIV); a break in one chromosome arm followed by fusion with the subtelomeric region of the other (patients I and II); a break in one chromosome arm followed by fusion with the opposite telomeric region (patients III and X); fusion of two subtelomeric regions (patient VII); and telomere-telomere fusion (patient XII). Thus, the r(14) and one r(22) can be considered complete rings, since there was no loss of relevant genetic material. Two patients (V and VI) with r(13) showed duplication along with terminal deletion of 13q, one of them proved to be inverted, a mechanism known as inv-dup-del. Ring instability was detected by ring loss and secondary aberrations in all but three patients, who presented stable ring chromosomes (II, XIII and XIV).</p> <p>Conclusions</p> <p>We concluded that the clinical phenotype of patients with ring chromosomes may be related with different factors, including gene haploinsufficiency, gene duplications and ring instability. Epigenetic factors due to the circular architecture of ring chromosomes must also be considered, since even complete ring chromosomes can result in phenotypic alterations, as observed in our patients with complete r(14) and r(22).</p
    corecore