150 research outputs found

    Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor Programs

    Full text link
    As deep learning models nowadays are widely adopted by both cloud services and edge devices, reducing the latency of deep learning model inferences becomes crucial to provide efficient model serving. However, it is challenging to develop efficient tensor programs for deep learning operators due to the high complexity of modern accelerators and the rapidly growing number of operators. Deep learning compilers, such as Apache TVM, adopt declarative scheduling primitives to lower the bar of developing tensor programs. However, we show that this approach is insufficient to cover state-of-the-art tensor program optimizations. In this paper, we propose to embed the scheduling process into tensor programs and use dedicated mappings, called task mappings, to define the computation assignment and ordering. This new approach greatly enriches the expressible optimizations by allowing developers to manipulate tensor programs at a much finer granularity. We call the proposed method the task-mapping programming paradigm. In addition, we propose a new post-scheduling fusion optimization that allows developers to focus on scheduling every single operator and automates the fusion after scheduling. It greatly reduces the engineering efforts for operator fusion. Our proposed paradigm also constructs an efficient hardware-centric schedule space, which is agnostic to the program input size and greatly reduces the tuning time. With the proposed paradigm, we implement a deep learning compiler Hidet. Extensive experiments on modern convolution and transformer models show that Hidet outperforms state-of-the-art DNN inference framework, ONNX Runtime, and compiler, TVM equipped with scheduler AutoTVM and Ansor, by up to 1.48x (1.22x on average). It also reduces the tuning time by 20x and 11x compared with AutoTVM and Ansor, respectively. We open-sourced hidet at https://www.github.com/hidet-org/hidet.Comment: 15 pages, 22 figures, 1 tabl

    Influences of mental accounting on consumption decisions: asymmetric effect of a scarcity mindset

    Get PDF
    A scarcity mindset is considered to impact consumer behaviors. Our research aimed to examine the moderating effect of the scarcity mindset on the relationship between mental accounting and hedonic (vs. utilitarian) consumption. We conducted an online experimental design (mental accounting: windfall gains vs. hard-earning gains; consumption: hedonic products vs. utilitarian products) and verified our hypotheses in two distinct samples: a student sample and an adult sample. Our results showed that consumers who received windfall gains tended to use it for hedonic consumption rather than utilitarian consumption. Intriguingly, such an effect was insignificant under a high level of a scarcity mindset but significant under a low level of the scarcity mindset. Moreover, consumers who received hard-earning gains tended to spend the money on utilitarian (vs. hedonic) consumption. However, we did not detect the impact of the scarcity mindset on such effects. Our research suggested an asymmetric effect of the scarcity mindset on hedonic (vs. utilitarian) consumption under two different mental accounts. It highlights the important role of the scarcity mindset in consumer behaviors, which leaves avenues for future research to understand marketing promotion strategies for distinct products

    Assessment of multi-source observation merged 1 km-grid precipitation product during the disastrous rainstorms in Guangdong

    Get PDF
    This paper aims to assess the latest 1 km-grid Analysis Real Time (ART_1 km) precipitation product developed by the National Meteorological Information Center of China Meteorological Administration (CMA), which can provide great support for disaster weather monitoring and warning, intelligent grid forecasting and weather services. Observed precipitation data from the independent stations (including non-uploaded regional meteorological stations and hydrometric stations) that were not integrated into the ART_1 km precipitation product as well as precipitation classification inspection are used to assess the quality of this product during twenty disastrous rainstorm cases from May to August during 2019-2022 in Guangdong. The results show that the ART_1 km precipitation product successfully reproduces the precipitation location, strength, and trends in these cases, with the best performance in the Pearl River Delta, the east of eastern Guangdong, and the north of northern Guangdong. The stronger the precipitation, the greater the correlation as well as the root mean square error (RMSE) and mean error (ME) between the ART_1 km precipitation and the observed precipitation. When the hourly precipitation is not classified, about 60% of these independent stations present a correlation efficient ≥ 0.8, more than 90% of the stations present an RMSE within the range of [1.0, 5.0) mm, and more than 60% of the stations present a ME within ±0.1 mm. When the hourly precipitation is < 5 mm, most of the stations have a correlation efficient < 0.5, an RMSE within the range of [1.0, 5.0) mm, and a ME within [0.0, 0.5] mm. When the hourly precipitation is ≥ 20 mm, 42%~56% of the stations have a correlation efficient ≥ 0.5, and most of the stations have an RMSE ≥ 10 mm and a ME < 0 mm, even when the hourly precipitation is ≥ 50 mm, most of the stations have a ME < -10 mm. Overall, ART_1 km precipitation is usually underestimated at the independent stations, and integrating observations from more sites into producing ART_1 km precipitation is helpful to improve the quality of the products

    YeastFab:the design and construction of standard biological parts for metabolic engineering in Saccharomyces cerevisiae

    Get PDF
    It is a routine task in metabolic engineering to introduce multicomponent pathways into a heterologous host for production of metabolites. However, this process sometimes may take weeks to months due to the lack of standardized genetic tools. Here, we present a method for the design and construction of biological parts based on the native genes and regulatory elements in Saccharomyces cerevisiae. We have developed highly efficient protocols (termed YeastFab Assembly) to synthesize these genetic elements as standardized biological parts, which can be used to assemble transcriptional units in a single-tube reaction. In addition, standardized characterization assays are developed using reporter constructs to calibrate the function of promoters. Furthermore, the assembled transcription units can be either assayed individually or applied to construct multi-gene metabolic pathways, which targets a genomic locus or a receiving plasmid effectively, through a simple in vitro reaction. Finally, using β-carotene biosynthesis pathway as an example, we demonstrate that our method allows us not only to construct and test a metabolic pathway in several days, but also to optimize the production through combinatorial assembly of a pathway using hundreds of regulatory biological parts

    The Crystal Structure and RNA-Binding of an Orthomyxovirus Nucleoprotein

    Get PDF
    Genome packaging for viruses with segmented genomes is often a complex problem. This is particularly true for influenza viruses and other orthomyxoviruses, whose genome consists of multiple negative-sense RNAs encapsidated as ribonucleoprotein (RNP) complexes. To better understand the structural features of orthomyxovirus RNPs that allow them to be packaged, we determined the crystal structure of the nucleoprotein (NP) of a fish orthomyxovirus, the infectious salmon anemia virus (ISAV) (genus Isavirus). As the major protein component of the RNPs, ISAV-NP possesses a bi-lobular structure similar to the influenza virus NP. Because both RNA-free and RNA-bound ISAV NP forms stable dimers in solution, we were able to measure the NP RNA binding affinity as well as the stoichiometry using recombinant proteins and synthetic oligos. Our RNA binding analysis revealed that each ISAV-NP binds ,12 nts of RNA, shorter than the 24ï¾–28 nts originally estimated for the influenza A virus NP based on population average. The 12-nt stoichiometry was further confirmed by results from electron microscopy and dynamic light scattering. Considering that RNPs of ISAV and the influenza viruses have similar morphologies and dimensions, our findings suggest that NP-free RNA may exist on orthomyxovirus RNPs, and selective RNP packaging may be accomplished through direct RNA-RNA interactions

    Structure and assembly of the influenza A virus ribonucleoprotein complex

    Get PDF
    The genome of influenza A viruses consists of eight segments of single-stranded, negative-sense RNA that are encapsidated as individual rod-shaped ribonucleoprotein complexes (RNPs). Each RNP contains a viral RNA, a viral polymerase and multiple copies of the viral nucleoprotein (NP). Influenza A virus RNPs play important roles during virus infection by directing viral RNA replication and transcription, intracellular transport of the viral RNA, gene reassortment as well as viral genome packaging into progeny particles. As a unique genomic entity, the influenza A virus RNP has been extensively studied since the 1960s. Recently, exciting progress has been made in studying the RNP structure and its assembly, leading to a better understanding of the structural basis of various RNP functions
    • …
    corecore