669 research outputs found

    Automatic Generation of Efficient Sparse Tensor Format Conversion Routines

    Full text link
    This paper shows how to generate code that efficiently converts sparse tensors between disparate storage formats (data layouts) such as CSR, DIA, ELL, and many others. We decompose sparse tensor conversion into three logical phases: coordinate remapping, analysis, and assembly. We then develop a language that precisely describes how different formats group together and order a tensor's nonzeros in memory. This lets a compiler emit code that performs complex remappings of nonzeros when converting between formats. We also develop a query language that can extract statistics about sparse tensors, and we show how to emit efficient analysis code that computes such queries. Finally, we define an abstract interface that captures how data structures for storing a tensor can be efficiently assembled given specific statistics about the tensor. Disparate formats can implement this common interface, thus letting a compiler emit optimized sparse tensor conversion code for arbitrary combinations of many formats without hard-coding for any specific combination. Our evaluation shows that the technique generates sparse tensor conversion routines with performance between 1.00 and 2.01×\times that of hand-optimized versions in SPARSKIT and Intel MKL, two popular sparse linear algebra libraries. And by emitting code that avoids materializing temporaries, which both libraries need for many combinations of source and target formats, our technique outperforms those libraries by 1.78 to 4.01×\times for CSC/COO to DIA/ELL conversion.Comment: Presented at PLDI 202

    Developing Single-Molecule TPM Experiments for Direct Observation of Successful RecA-Mediated Strand Exchange Reaction

    Get PDF
    RecA recombinases play a central role in homologous recombination. Once assembled on single-stranded (ss) DNA, RecA nucleoprotein filaments mediate the pairing of homologous DNA sequences and strand exchange processes. We have designed two experiments based on tethered particle motion (TPM) to investigate the fates of the invading and the outgoing strands during E. coli RecA-mediated pairing and strand exchange at the single-molecule level in the absence of force. TPM experiments measure the tethered bead Brownian motion indicative of the DNA tether length change resulting from RecA binding and dissociation. Experiments with beads labeled on either the invading strand or the outgoing strand showed that DNA pairing and strand exchange occurs successfully in the presence of either ATP or its non-hydrolyzable analog, ATPγS. The strand exchange rates and efficiencies are similar under both ATP and ATPγS conditions. In addition, the Brownian motion time-courses suggest that the strand exchange process progresses uni-directionally in the 5′-to-3′ fashion, using a synapse segment with a wide and continuous size distribution

    Mapping structural diversity in networks sharing a given degree distribution and global clustering: Adaptive resolution grid search evolution with Diophantine equation-based mutations

    Get PDF
    Methods that generate networks sharing a given degree distribution and global clustering can induce changes in structural properties other than that controlled for. Diversity in structural properties, in turn, can affect the outcomes of dynamical processes operating on those networks. Since exhaustive sampling is not possible, we propose a novel evolutionary framework for mapping this structural diversity. The three main features of this framework are: (a) subgraph-based encoding of networks, (b) exact mutations based on solving systems of Diophantine equations, and (c) heuristic diversity-driven mechanism to drive resolution changes in the MapElite algorithm.We show that our framework can elicit networks with diversity in their higher-order structure and that this diversity affects the behaviour of the complex contagion model. Through a comparison with state of the art clustered network generation methods, we demonstrate that our approach can uncover a comparably diverse range of networks without needing computationally unfeasible mixing times. Further, we suggest that the subgraph-based encoding provides greater confidence in the diversity of higher-order network structure for low numbers of samples and is the basis for explaining our results with complex contagion model. We believe that this framework could be applied to other complex landscapes that cannot be practically mapped via exhaustive sampling

    Review of the literature and suggestions for the design of rodent survival studies for the identification of compounds that increase health and life span

    Get PDF
    Much of the literature describing the search for agents that increase the life span of rodents was found to suffer from confounds. One-hundred-six studies, absent 20 contradictory melatonin studies, of compounds or combinations of compounds were reviewed. Only six studies reported both life span extension and food consumption data, thereby excluding the potential effects of caloric restriction. Six other studies reported life span extension without a change in body weight. However, weight can be an unreliable surrogate measure of caloric consumption. Twenty studies reported that food consumption or weight was unchanged, but it was unclear whether these data were anecdotal or systematic. Twenty-nine reported extended life span likely due to induced caloric restriction. Thirty-six studies reported no effect on life span, and three a decrease. The remaining studies suffer from more serious confounds. Though still widely cited, studies showing life span extension using short-lived or “enfeebled” rodents have not been shown to predict longevity effects in long-lived animals. We suggest improvements in experimental design that will enhance the reliability of the rodent life span literature. First, animals should receive measured quantities of food and its consumption monitored, preferably daily, and reported. Weights should be measured regularly and reported. Second, a genetically heterogeneous, long-lived rodent should be utilized. Third, chemically defined diets should be used. Fourth, a positive control (e.g., a calorically restricted group) is highly desirable. Fifth, drug dosages should be chosen based on surrogate endpoints or accepted cross-species scaling factors. These procedures should improve the reliability of the scientific literature and accelerate the identification of longevity and health span-enhancing agents

    A Common Left Occipito-Temporal Dysfunction in Developmental Dyslexia and Acquired Letter-By-Letter Reading?

    Get PDF
    We used fMRI to examine functional brain abnormalities of German-speaking dyslexics who suffer from slow effortful reading but not from a reading accuracy problem. Similar to acquired cases of letter-by-letter reading, the developmental cases exhibited an abnormal strong effect of length (i.e., number of letters) on response time for words and pseudowords.Corresponding to lesions of left occipito-temporal (OT) regions in acquired cases, we found a dysfunction of this region in our developmental cases who failed to exhibit responsiveness of left OT regions to the length of words and pseudowords. This abnormality in the left OT cortex was accompanied by absent responsiveness to increased sublexical reading demands in phonological inferior frontal gyrus (IFG) regions. Interestingly, there was no abnormality in the left superior temporal cortex which--corresponding to the onological deficit explanation--is considered to be the prime locus of the reading difficulties of developmental dyslexia cases.The present functional imaging results suggest that developmental dyslexia similar to acquired letter-by-letter reading is due to a primary dysfunction of left OT regions

    A Powerful Test of Parent-of-Origin Effects for Quantitative Traits Using Haplotypes

    Get PDF
    Imprinting is an epigenetic phenomenon where the same alleles have unequal transcriptions and thus contribute differently to a trait depending on their parent of origin. This mechanism has been found to affect a variety of human disorders. Although various methods for testing parent-of-origin effects have been proposed in linkage analysis settings, only a few are available for association analysis and they are usually restricted to small families and particular study designs. In this study, we develop a powerful maximum likelihood test to evaluate the parent-of-origin effects of SNPs on quantitative phenotypes in general family studies. Our method incorporates haplotype distribution to take advantage of inter-marker LD information in genome-wide association studies (GWAS). Our method also accommodates missing genotypes that often occur in genetic studies. Our simulation studies with various minor allele frequencies, LD structures, family sizes, and missing schemes have uniformly shown that using the new method significantly improves the power of detecting imprinted genes compared with the method using the SNP at the testing locus only. Our simulations suggest that the most efficient strategy to investigate parent-of-origin effects is to recruit one parent and as many offspring as possible under practical constraints. As a demonstration, we applied our method to a dataset from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) to test the parent-of-origin effects of the SNPs within the PPARGC1A, MTP and FABP2 genes on diabetes-related phenotypes, and found that several SNPs in the MTP gene show parent-of-origin effects on insulin and glucose levels
    corecore