74 research outputs found

    Multi-GPU Graph Analytics

    Full text link
    We present a single-node, multi-GPU programmable graph processing library that allows programmers to easily extend single-GPU graph algorithms to achieve scalable performance on large graphs with billions of edges. Directly using the single-GPU implementations, our design only requires programmers to specify a few algorithm-dependent concerns, hiding most multi-GPU related implementation details. We analyze the theoretical and practical limits to scalability in the context of varying graph primitives and datasets. We describe several optimizations, such as direction optimizing traversal, and a just-enough memory allocation scheme, for better performance and smaller memory consumption. Compared to previous work, we achieve best-of-class performance across operations and datasets, including excellent strong and weak scalability on most primitives as we increase the number of GPUs in the system.Comment: 12 pages. Final version submitted to IPDPS 201

    The Problem and Countermeasures of Cross-border E-commerce Logistics

    Get PDF
    With the development of free trade zone, strategy of One Belt and One Road and the policy of cross-border e-commerce, many domestic electric business platforms enter into industry of cross-border electricity. Cross-border e-commerce which is a new business model has advantages, such as, fewer links, short processing cycle and low cost. It can powerfully develop today. However, cross-border logistics which is short slab of cross-border e-commerce restricts the development of cross-border e-commerce. Cross-border e-commerce that has multi-function, multiple frequency operation and comprehensive characteristics require logistics service has the characteristics of agility, high efficiency, low cost and visualization. The model of cross-border logistics in our country has many problems. For example, high cost, long time, incomplete logistics information system, lack of large-scale logistics service enterprises. This factors seriously restrict the development of cross-border e-commerce. In this paper, cross-border electricity development present in our country and its existing problems have been discussed, and we put forward relevant countermeasures. And the goal of the paper is to provide electric commercial enterprises some reference and experience

    Gunrock: GPU Graph Analytics

    Full text link
    For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and the complexity of programming GPUs, have presented two significant challenges to developing a programmable high-performance graph library. "Gunrock", our graph-processing system designed specifically for the GPU, uses a high-level, bulk-synchronous, data-centric abstraction focused on operations on a vertex or edge frontier. Gunrock achieves a balance between performance and expressiveness by coupling high performance GPU computing primitives and optimization strategies with a high-level programming model that allows programmers to quickly develop new graph primitives with small code size and minimal GPU programming knowledge. We characterize the performance of various optimization strategies and evaluate Gunrock's overall performance on different GPU architectures on a wide range of graph primitives that span from traversal-based algorithms and ranking algorithms, to triangle counting and bipartite-graph-based algorithms. The results show that on a single GPU, Gunrock has on average at least an order of magnitude speedup over Boost and PowerGraph, comparable performance to the fastest GPU hardwired primitives and CPU shared-memory graph libraries such as Ligra and Galois, and better performance than any other GPU high-level graph library.Comment: 52 pages, invited paper to ACM Transactions on Parallel Computing (TOPC), an extended version of PPoPP'16 paper "Gunrock: A High-Performance Graph Processing Library on the GPU

    Performance Characterization of High-Level Programming Models for GPU Graph Analytics

    Full text link
    We identify several factors that are critical to high-performance GPU graph analytics: efficient building block operators, synchronization and data movement, workload distribution and load balancing, and memory access patterns. We analyze the impact of these critical factors through three GPU graph analytic frameworks, Gunrock, MapGraph, and VertexAPI2. We also examine their effect on different workloads: four common graph primitives from multiple graph application domains, evaluated through real-world and synthetic graphs. We show that efficient building block operators enable more powerful operations for fast information propagation and result in fewer device kernel invocations, less data movement, and fewer global synchronizations, and thus are key focus areas for efficient large-scale graph analytics on the GPU

    Distributed Differential Privacy via Shuffling vs Aggregation: a Curious Study

    Get PDF
    How to achieve distributed differential privacy (DP) without a trusted central party is of great interest in both theory and practice. Recently, the shuffle model has attracted much attention. Unlike the local DP model in which the users send randomized data directly to the data collector/analyzer, in the shuffle model an intermediate untrusted shuffler is introduced to randomly permute the data, which have already been randomized by the users, before they reach the analyzer. The most appealing aspect is that while shuffling does not explicitly add more noise to the data, it can make privacy better. The privacy amplification effect in consequence means the users need to add less noise to the data than in the local DP model, but can achieve the same level of differential privacy. Thus, protocols in the shuffle model can provide better accuracy than those in the local DP model. What looks interesting to us is that the architecture of the shuffle model is similar to private aggregation, which has been studied for more than a decade. In private aggregation, locally randomized user data are aggregated by an intermediate untrusted aggregator. Thus, our question is whether aggregation also exhibits some sort of privacy amplification effect? And if so, how good is this ``aggregation model\u27\u27 in comparison with the shuffle model. We conducted the first comparative study between the two, covering privacy amplification, functionalities, protocol accuracy, and practicality. The results as yet suggest that the new shuffle model does not have obvious advantages over the old aggregation model. On the contrary, protocols in the aggregation model outperform those in the shuffle model, sometimes significantly, in many aspects

    Benefits and risks of drug combination therapy for diabetes mellitus and its complications: a comprehensive review

    Get PDF
    Diabetes is a chronic metabolic disease, and its therapeutic goals focus on the effective management of blood glucose and various complications. Drug combination therapy has emerged as a comprehensive treatment approach for diabetes. An increasing number of studies have shown that, compared with monotherapy, combination therapy can bring significant clinical benefits while controlling blood glucose, weight, and blood pressure, as well as mitigating damage from certain complications and delaying their progression in diabetes, including both type 1 diabetes (T1D), type 2 diabetes (T2D) and related complications. This evidence provides strong support for the recommendation of combination therapy for diabetes and highlights the importance of combined treatment. In this review, we first provided a brief overview of the phenotype and pathogenesis of diabetes and discussed several conventional anti-diabetic medications currently used for the treatment of diabetes. We then reviewed several clinical trials and pre-clinical animal experiments on T1D, T2D, and their common complications to evaluate the efficacy and safety of different classes of drug combinations. In general, combination therapy plays a pivotal role in the management of diabetes. Integrating the effectiveness of multiple drugs enables more comprehensive and effective control of blood glucose without increasing the risk of hypoglycemia or other serious adverse events. However, specific treatment regimens should be tailored to individual patients and implemented under the guidance of healthcare professionals

    STC3141 improves acute lung injury through neutralizing circulating histone in rat with experimentally-induced acute respiratory distress syndrome

    Get PDF
    Background: Acute respiratory distress syndrome (ARDS) remains a challenge because of its high morbidity and mortality. Circulation histones levels in ARDS patients were correlated to disease severity and mortality. This study examined the impact of histone neutralization in a rat model of acute lung injury (ALI) induced by a lipopolysaccharide (LPS) double-hit.Methods: Sixty-eight male Sprague-Dawley rats were randomized to sham (N = 8, received saline only) or LPS (N = 60). The LPS double-hit consisted of a 0.8 mg/kg intraperitoneal injection followed after 16 h by 5 mg/kg intra-tracheal nebulized LPS. The LPS group was then randomized into five groups: LPS only; LPS +5, 25, or 100 mg/kg intravenous STC3141 every 8 h (LPS + L, LPS + M, LPS + H, respectively); or LPS + intraperitoneal dexamethasone 2.5 mg/kg every 24 h for 56 h (LPS + D). The animals were observed for 72 h.Results: LPS animals developed ALI as suggested by lower oxygenation, lung edema formation, and histological changes compared to the sham animals. Compared to the LPS group, LPS + H and +D groups had significantly lower circulating histone levels and lung wet-to-dry ratio, and the LPS + D group also had lower BALF histone concentrations; the blood neutrophils and platelets counts in LPS + D group did not change, meanwhile, the LPS + L, +M and +H groups had significantly lower neutrophil counts and higher platelet counts in the blood; the total number of BALF WBC, platelet counts, MPO and H3 were significantly lower in the LPS + L, +M, +H and +D groups than in the LPS only group; and the degree of inflammation was significantly less in the LPS + L, +M, +H and +D groups, moreover, inflammation in the LPS + L, +M and +H animals showed a dose-dependent response; finally, the LPS + L, +M, +H and +D groups had improved oxygenation compared to the LPS group, and there were no statistical differences in PCO2 or pH among groups. All animals survived.Conclusion: Neutralization of histone using STC3141, especially at high dose, had similar therapeutic effects to dexamethasone in this LPS double-hit rat ALI model, with significantly decreased circulating histone concentration, improved acute lung injury and oxygenation

    Adverse Effect of Nano-Silicon Dioxide on Lung Function of Rats with or without Ovalbumin Immunization

    Get PDF
    BACKGROUND: The great advances of nanomaterials have brought out broad important applications, but their possible nanotoxicity and risks have not been fully understood. It is confirmed that exposure of environmental particulate matter (PM), especially ultrafine PM, are responsible for many lung function impairment and exacerbation of pre-existing lung diseases. However, the adverse effect of nanoparticles on allergic asthma is seldom investigated and the mechanism remains undefined. For the first time, this work investigates the relationship between allergic asthma and nanosized silicon dioxide (nano-SiO₂). METHODOLOGY/PRINCIPAL FINDINGS: Ovalbumin (OVA)-treated and saline-treated control rats were daily intratracheally administered 0.1 ml of 0, 40 and 80 µg/ml nano-SiO₂ solutions, respectively for 30 days. Increased nano-SiO₂ exposure results in adverse changes on inspiratory and expiratory resistance (Ri and Re), but shows insignificant effect on rat lung dynamic compliance (Cldyn). Lung histological observation reveals obvious airway remodeling in 80 µg/ml nano-SiO₂-introduced saline and OVA groups, but the latter is worse. Additionally, increased nano-SiO₂ exposure also leads to more severe inflammation. With increasing nano-SiO₂ exposure, IL-4 in lung homogenate increases and IFN-γ shows a reverse but insignificant change. Moreover, at a same nano-SiO₂ exposure concentration, OVA-treated rats exhibit higher (significant) IL-4 and lower (not significant) IFN-γ compared with the saline-treated rats. The percentages of eosinophil display an unexpected result, in which higher exposure results lower eosinophil percentages. CONCLUSIONS/SIGNIFICANCE: This was a preliminary study which for the first time involved the effect of nano-SiO₂ to OVA induced rat asthma model. The results suggested that intratracheal administration of nano-SiO₂ could lead to the airway hyperresponsiveness (AHR) and the airway remolding with or without OVA immunization. This occurrence may be due to the Th1/Th2 cytokine imbalance accelerated by the nano-SiO₂ through increasing the tissue IL-4 production

    Real-time Monitoring for the Next Core-Collapse Supernova in JUNO

    Full text link
    Core-collapse supernova (CCSN) is one of the most energetic astrophysical events in the Universe. The early and prompt detection of neutrinos before (pre-SN) and during the SN burst is a unique opportunity to realize the multi-messenger observation of the CCSN events. In this work, we describe the monitoring concept and present the sensitivity of the system to the pre-SN and SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), which is a 20 kton liquid scintillator detector under construction in South China. The real-time monitoring system is designed with both the prompt monitors on the electronic board and online monitors at the data acquisition stage, in order to ensure both the alert speed and alert coverage of progenitor stars. By assuming a false alert rate of 1 per year, this monitoring system can be sensitive to the pre-SN neutrinos up to the distance of about 1.6 (0.9) kpc and SN neutrinos up to about 370 (360) kpc for a progenitor mass of 30M⊙M_{\odot} for the case of normal (inverted) mass ordering. The pointing ability of the CCSN is evaluated by using the accumulated event anisotropy of the inverse beta decay interactions from pre-SN or SN neutrinos, which, along with the early alert, can play important roles for the followup multi-messenger observations of the next Galactic or nearby extragalactic CCSN.Comment: 24 pages, 9 figure
    • …
    corecore