298 research outputs found

    TTC: A Tensor Transposition Compiler for Multiple Architectures

    Full text link
    We consider the problem of transposing tensors of arbitrary dimension and describe TTC, an open source domain-specific parallel compiler. TTC generates optimized parallel C++/CUDA C code that achieves a significant fraction of the system's peak memory bandwidth. TTC exhibits high performance across multiple architectures, including modern AVX-based systems (e.g.,~Intel Haswell, AMD Steamroller), Intel's Knights Corner as well as different CUDA-based GPUs such as NVIDIA's Kepler and Maxwell architectures. We report speedups of TTC over a meaningful baseline implementation generated by external C++ compilers; the results suggest that a domain-specific compiler can outperform its general purpose counterpart significantly: For instance, comparing with Intel's latest C++ compiler on the Haswell and Knights Corner architecture, TTC yields speedups of up to 8×8\times and 32×32\times, respectively. We also showcase TTC's support for multiple leading dimensions, making it a suitable candidate for the generation of performance-critical packing functions that are at the core of the ubiquitous BLAS 3 routines

    What am I not seeing? An Interactive Approach to Social Content Discovery in Microblogs

    Get PDF
    In this paper, we focus on the informational and user experience benefits of user-driven topic exploration in microblog communities, such as Twitter, in an inspectable, controllable and personalized manner. To this end, we introduce ``HopTopics'' -- a novel interactive tool for exploring content that is popular just beyond a user's typical information horizon in a microblog, as defined by the network of individuals that they are connected to. We present results of a user study (N=122) to evaluate HopTopics with varying complexity against a typical microblog feed in both personalized and non-personalized conditions. Results show that the HopTopics system, leveraging content from both the direct and extended network of a user, succeeds in giving users a better sense of control and transparency. Moreover, participants had a poor mental model for the degree of novel content discovered when presented with non-personalized data in the Inspectable interface

    When transcriptome meets metabolome: fast cellular responses of yeast to sudden relief of glucose limitation

    Get PDF
    Within the first 5 min after a sudden relief from glucose limitation, Saccharomyces cerevisiae exhibited fast changes of intracellular metabolite levels and a major transcriptional reprogramming. Integration of transcriptome and metabolome data revealed tight relationships between the changes at these two levels. Transcriptome as well as metabolite changes reflected a major investment in two processes: adaptation from fully respiratory to respiro-fermentative metabolism and preparation for growth acceleration. At the metabolite level, a severe drop of the AXP pools directly after glucose addition was not accompanied by any of the other three NXP. To counterbalance this loss, purine biosynthesis and salvage pathways were transcriptionally upregulated in a concerted manner, reflecting a sudden increase of the purine demand. The short-term dynamics of the transcriptome revealed a remarkably fast decrease in the average half-life of downregulated genes. This acceleration of mRNA decay can be interpreted both as an additional nucleotide salvage pathway and an additional level of glucose-induced regulation of gene expression

    A methodology pruning the search space of six compiler transformations by addressing them together as one problem and by exploiting the hardware architecture details

    Get PDF
    Today’s compilers have a plethora of optimizations-transformations to choose from, and the correct choice, order as well parameters of transformations have a significant/large impact on performance; choosing the correct order and parameters of optimizations has been a long standing problem in compilation research, which until now remains unsolved; the separate sub-problems optimization gives a different schedule/binary for each sub-problem and these schedules cannot coexist, as by refining one degrades the other. Researchers try to solve this problem by using iterative compilation techniques but the search space is so big that it cannot be searched even by using modern supercomputers. Moreover, compiler transformations do not take into account the hardware architecture details and data reuse in an efficient way. In this paper, a new iterative compilation methodology is presented which reduces the search space of six compiler transformations by addressing the above problems; the search space is reduced by many orders of magnitude and thus an efficient solution is now capable to be found. The transformations are the following: loop tiling (including the number of the levels of tiling), loop unroll, register allocation, scalar replacement, loop interchange and data array layouts. The search space is reduced (a) by addressing the aforementioned transformations together as one problem and not separately, (b) by taking into account the custom hardware architecture details (e.g., cache size and associativity) and algorithm characteristics (e.g., data reuse). The proposed methodology has been evaluated over iterative compilation and gcc/icc compilers, on both embedded and general purpose processors; it achieves significant performance gains at many orders of magnitude lower compilation time

    Frequent deletion of the CDKN2A locus in chordoma: analysis of chromosomal imbalances using array comparative genomic hybridisation

    Get PDF
    The initiating somatic genetic events in chordoma development have not yet been identified. Most cytogenetically investigated chordomas have displayed near-diploid or moderately hypodiploid karyotypes, with several numerical and structural rearrangements. However, no consistent structural chromosome aberration has been reported. This is the first array-based study characterising DNA copy number changes in chordoma. Array comparative genomic hybridisation (aCGH) identified copy number alterations in all samples and imbalances affecting 5 or more out of the 21 investigated tumours were seen on all chromosomes. In general, deletions were more common than gains and no high-level amplification was found, supporting previous findings of primarily losses of large chromosomal regions as an important mechanism in chordoma development. Although small imbalances were commonly found, the vast majority of these were detected in single cases; no small deletion affecting all tumours could be discerned. However, the CDKN2A and CDKN2B loci in 9p21 were homo- or heterozygously lost in 70% of the tumours, a finding corroborated by fluorescence in situ hybridisation, suggesting that inactivation of these genes constitute an important step in chordoma development

    An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics

    Get PDF
    For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale. Analysis of clinicopathologic annotations for over 11,000 cancer patients in the TCGA program leads to the generation of TCGA Clinical Data Resource, which provides recommendations of clinical outcome endpoint usage for 33 cancer types

    Explorative Analysis of Recommendations Through Interactive Visualization

    Get PDF
    Even though today's recommender algorithms are highly sophisticated, they can hardly take into account the users' situational needs. An obvious way to address this is to initially inquire the users' momentary preferences, but the users' inability to accurately state them upfront may lead to the loss of several good alternatives. Hence, this paper suggests to generate the recommendations without such additional input data from the users and let them interactively explore the recommended items on their own. To support this explorative analysis, a novel visualization tool based on treemaps is developed. The analysis of the prototype demonstrates that the interactive treemap visualization facilitates the users' comprehension of the big picture of available alternatives and the reasoning behind the recommendations. This helps the users get clear about their situational needs, inspect the most relevant recommendations in detail, and finally arrive at informed decisions

    EuroFlow Standardized Approach to Diagnostic Immunopheneotyping of Severe PID in Newborns and Young Children

    Get PDF
    The EuroFlow PID consortium developed a set of flow cytometry tests for evaluation of patients with suspicion of primary immunodeficiency (PID). In this technical report we evaluate the performance of the SCID-RTE tube that explores the presence of recent thymic emigrants (RTE) together with T-cell activation status and maturation stages and discuss its applicability in the context of the broader EuroFlow PID flow cytometry testing algorithm for diagnostic orientation of PID of the lymphoid system. We have analyzed peripheral blood cells of 26 patients diagnosed between birth and 2 years of age with a genetically defined primary immunodeficiency disorder: 1
    corecore