366 research outputs found

    Evaluating Emerging CXL-enabled Memory Pooling for HPC Systems

    Full text link
    Current HPC systems provide memory resources that are statically configured and tightly coupled with compute nodes. However, workloads on HPC systems are evolving. Diverse workloads lead to a need for configurable memory resources to achieve high performance and utilization. In this study, we evaluate a memory subsystem design leveraging CXL-enabled memory pooling. Two promising use cases of composable memory subsystems are studied -- fine-grained capacity provisioning and scalable bandwidth provisioning. We developed an emulator to explore the performance impact of various memory compositions. We also provide a profiler to identify the memory usage patterns in applications and their optimization opportunities. Seven scientific and six graph applications are evaluated on various emulated memory configurations. Three out of seven scientific applications had less than 10% performance impact when the pooled memory backed 75% of their memory footprint. The results also show that a dynamically configured high-bandwidth system can effectively support bandwidth-intensive unstructured mesh-based applications like OpenFOAM. Finally, we identify interference through shared memory pools as a practical challenge for adoption on HPC systems.Comment: 10 pages, 13 figures. Accepted for publication in Workshop on Memory Centric High Performance Computing (MCHPC'22) at SC2

    A Quantitative Approach for Adopting Disaggregated Memory in HPC Systems

    Full text link
    Memory disaggregation has recently been adopted in data centers to improve resource utilization, motivated by cost and sustainability. Recent studies on large-scale HPC facilities have also highlighted memory underutilization. A promising and non-disruptive option for memory disaggregation is rack-scale memory pooling, where shared memory pools supplement node-local memory. This work outlines the prospects and requirements for adoption and clarifies several misconceptions. We propose a quantitative method for dissecting application requirements on the memory system from the top down in three levels, moving from general, to multi-tier memory systems, and then to memory pooling. We provide a multi-level profiling tool and LBench to facilitate the quantitative approach. We evaluate a set of representative HPC workloads on an emulated platform. Our results show that prefetching activities can significantly influence memory traffic profiles. Interference in memory pooling has varied impacts on applications, depending on their access ratios to memory tiers and arithmetic intensities. Finally, in two case studies, we show the benefits of our findings at the application and system levels, achieving 50% reduction in remote access and 13% speedup in BFS, and reducing performance variation of co-located workloads in interference-aware job scheduling.Comment: Accepted to SC23 (The International Conference for High Performance Computing, Networking, Storage, and Analysis 2023

    One Region: Promoting Prosperity Across Race

    Get PDF
    Identifies communities with limited opportunities in New York City and environs. Examines the links between housing, jobs, education, and other indicators and racial inequity, poverty, lack of investment, and suburban sprawl. Makes policy recommendations

    Pertanggungjawaban Para Pihak Yang Terlibat Dalam Perjudian Poker Online

    Get PDF
    ABSTRACT: Gambling is a crime, namely betting an amount of money where the winner gets the betting money or in other words a battle of fate. The problem of gambling can be detrimental to society and the morals of our nation, basically this crime causes public order, peace and security to be disrupted. Besides that, the influence on children is very large, they will join in committing gambling crimes that they see happening in their environment, which will negatively affect the psychology of children and cause material losses for those who do it. Even though the prohibition of gambling both offline and using electronic media or more familiarly known as online gambling has been very strictly regulated in several laws and regulations, in fact this gambling is increasingly widespread in society. Even with the many regulations that tighten the scope of gambling, gamblers are even more innovative in running this business. Gambling means that an act can be punished by the existence of regulations. If someone has committed a crime of gambling, then he can only be tried based on the criminal laws and regulations that were in effect at the time the act was committed. Online gambling is one type of Cyber ​​Crime action. namely the misuse of technology for gambling, online gambling itself has been regulated in Law No. 11 of 2008 concerning Information and Electronic Transactions. And the article that regulates online gambling is listed in article 27 paragraph 2

    Analisis Keuntungan Usaha Keripik Pisang Goroho Marindi Kelurahan Karombasan Utara Kecamatan Wanea Kota Manado

    Get PDF
    The purpose of the study was to analyze the profit of goroho banana chips business “Marindi” in Karombasan Utara Village, Wanea District, Manado City. The research was conducted from February to March 2023. This research was conducted at the “Marindi” goroho banana chips business located in North Karombasan Village, Wanea District, Manado City. The data taken is primary data using direct interview techniques with business owners, based on questions (questionnaires) that have been prepared. The sampling method used in this research is a case study by determining the Goroho Banana Chips business in North Karombasan Village, Wanea District, Manado City. Data analysis carried out in this research is descriptive analysis method and presented in tabular form. Then measured using revenue analysis, profit, total costs incurred, and revenue cost (R/C). The results showed that the total revenue of the “Marindi” goroho banana chips business during January 2023 amounted to Rp13,140,000 / month with production costs / month of Rp6,062,765 overall the total profit earned by the “Marindi” business amounted to Rp7,077,235. The feasibility of the “Marindi” goroho banana chips business shows an R / C Ratio of 2.16 with an R / C value> 1, so the business profit is feasible to run

    Ramsey and Tur\'an numbers of sparse hypergraphs

    Full text link
    Degeneracy plays an important role in understanding Tur\'an- and Ramsey-type properties of graphs. Unfortunately, the usual hypergraphical generalization of degeneracy fails to capture these properties. We define the skeletal degeneracy of a kk-uniform hypergraph as the degeneracy of its 11-skeleton (i.e., the graph formed by replacing every kk-edge by a kk-clique). We prove that skeletal degeneracy controls hypergraph Tur\'an and Ramsey numbers in a similar manner to (graphical) degeneracy. Specifically, we show that kk-uniform hypergraphs with bounded skeletal degeneracy have linear Ramsey number. This is the hypergraph analogue of the Burr-Erd\H{o}s conjecture (proved by Lee). In addition, we give upper and lower bounds of the same shape for the Tur\'an number of a kk-uniform kk-partite hypergraph in terms of its skeletal degeneracy. The proofs of both results use the technique of dependent random choice. In addition, the proof of our Ramsey result uses the `random greedy process' introduced by Lee in his resolution of the Burr-Erd\H{o}s conjecture.Comment: 33 page

    Language classification using n-grams accelerated by FPGA-based Bloom filters

    Full text link
    N-Gram (n-character sequences in text documents) count-ing is a well-established technique used in classifying the language of text in a document. In this paper, n-gram processing is accelerated through the use of reconfigurable hardware on the XtremeData XD1000 system. Our design employs parallelism at multiple levels, with parallel Bloom Filters accessing on-chip RAM, parallel language classifiers, and parallel document processing. In contrast to another hardware implementation (HAIL algorithm) that uses off-chip SRAM for lookup, our highly scalable implementation uses only on-chip memory blocks. Our implementation of end-to-end language classification runs at 85 Ă— comparable software and 1.45 Ă— the competing hardware design. 1
    • …
    corecore