74 research outputs found
Multi-GPU Graph Analytics
We present a single-node, multi-GPU programmable graph processing library
that allows programmers to easily extend single-GPU graph algorithms to achieve
scalable performance on large graphs with billions of edges. Directly using the
single-GPU implementations, our design only requires programmers to specify a
few algorithm-dependent concerns, hiding most multi-GPU related implementation
details. We analyze the theoretical and practical limits to scalability in the
context of varying graph primitives and datasets. We describe several
optimizations, such as direction optimizing traversal, and a just-enough memory
allocation scheme, for better performance and smaller memory consumption.
Compared to previous work, we achieve best-of-class performance across
operations and datasets, including excellent strong and weak scalability on
most primitives as we increase the number of GPUs in the system.Comment: 12 pages. Final version submitted to IPDPS 201
The Problem and Countermeasures of Cross-border E-commerce Logistics
With the development of free trade zone, strategy of One Belt and One Road and the policy of cross-border e-commerce, many domestic electric business platforms enter into industry of cross-border electricity. Cross-border e-commerce which is a new business model has advantages, such as, fewer links, short processing cycle and low cost. It can powerfully develop today. However, cross-border logistics which is short slab of cross-border e-commerce restricts the development of cross-border e-commerce. Cross-border e-commerce that has multi-function, multiple frequency operation and comprehensive characteristics require logistics service has the characteristics of agility, high efficiency, low cost and visualization. The model of cross-border logistics in our country has many problems. For example, high cost, long time, incomplete logistics information system, lack of large-scale logistics service enterprises. This factors seriously restrict the development of cross-border e-commerce. In this paper, cross-border electricity development present in our country and its existing problems have been discussed, and we put forward relevant countermeasures. And the goal of the paper is to provide electric commercial enterprises some reference and experience
Gunrock: GPU Graph Analytics
For large-scale graph analytics on the GPU, the irregularity of data access
and control flow, and the complexity of programming GPUs, have presented two
significant challenges to developing a programmable high-performance graph
library. "Gunrock", our graph-processing system designed specifically for the
GPU, uses a high-level, bulk-synchronous, data-centric abstraction focused on
operations on a vertex or edge frontier. Gunrock achieves a balance between
performance and expressiveness by coupling high performance GPU computing
primitives and optimization strategies with a high-level programming model that
allows programmers to quickly develop new graph primitives with small code size
and minimal GPU programming knowledge. We characterize the performance of
various optimization strategies and evaluate Gunrock's overall performance on
different GPU architectures on a wide range of graph primitives that span from
traversal-based algorithms and ranking algorithms, to triangle counting and
bipartite-graph-based algorithms. The results show that on a single GPU,
Gunrock has on average at least an order of magnitude speedup over Boost and
PowerGraph, comparable performance to the fastest GPU hardwired primitives and
CPU shared-memory graph libraries such as Ligra and Galois, and better
performance than any other GPU high-level graph library.Comment: 52 pages, invited paper to ACM Transactions on Parallel Computing
(TOPC), an extended version of PPoPP'16 paper "Gunrock: A High-Performance
Graph Processing Library on the GPU
Performance Characterization of High-Level Programming Models for GPU Graph Analytics
We identify several factors that are critical to high-performance GPU graph analytics: efficient building block operators, synchronization and data movement, workload distribution and load balancing, and memory access patterns. We analyze the impact of these critical factors through three GPU graph analytic frameworks, Gunrock, MapGraph, and VertexAPI2. We also examine their effect on different workloads: four common graph primitives from multiple graph application domains, evaluated through real-world and synthetic graphs. We show that efficient building block operators enable more powerful operations for fast information propagation and result in fewer device kernel invocations, less data movement, and fewer global synchronizations, and thus are key focus areas for efficient large-scale graph analytics on the GPU
Distributed Differential Privacy via Shuffling vs Aggregation: a Curious Study
How to achieve distributed differential privacy (DP) without a trusted central party is of great interest in both theory and practice. Recently, the shuffle model has attracted much attention. Unlike the local DP model in which the users send randomized data directly to the data collector/analyzer, in the shuffle model an intermediate untrusted shuffler is introduced to randomly permute the data, which have already been randomized by the users, before they reach the analyzer. The most appealing aspect is that while shuffling does not explicitly add more noise to the data, it can make privacy better. The privacy amplification effect in consequence means the users need to add less noise to the data than in the local DP model, but can achieve the same level of differential privacy. Thus, protocols in the shuffle model can provide better accuracy than those in the local DP model. What looks interesting to us is that the architecture of the shuffle model is similar to private aggregation, which has been studied for more than a decade. In private aggregation, locally randomized user data are aggregated by an intermediate untrusted aggregator. Thus, our question is whether aggregation also exhibits some sort of privacy amplification effect? And if so, how good is this ``aggregation model\u27\u27 in comparison with the shuffle model. We conducted the first comparative study between the two, covering privacy amplification, functionalities, protocol accuracy, and practicality. The results as yet suggest that the new shuffle model does not have obvious advantages over the old aggregation model. On the contrary, protocols in the aggregation model outperform those in the shuffle model, sometimes significantly, in many aspects
Benefits and risks of drug combination therapy for diabetes mellitus and its complications: a comprehensive review
Diabetes is a chronic metabolic disease, and its therapeutic goals focus on the effective management of blood glucose and various complications. Drug combination therapy has emerged as a comprehensive treatment approach for diabetes. An increasing number of studies have shown that, compared with monotherapy, combination therapy can bring significant clinical benefits while controlling blood glucose, weight, and blood pressure, as well as mitigating damage from certain complications and delaying their progression in diabetes, including both type 1 diabetes (T1D), type 2 diabetes (T2D) and related complications. This evidence provides strong support for the recommendation of combination therapy for diabetes and highlights the importance of combined treatment. In this review, we first provided a brief overview of the phenotype and pathogenesis of diabetes and discussed several conventional anti-diabetic medications currently used for the treatment of diabetes. We then reviewed several clinical trials and pre-clinical animal experiments on T1D, T2D, and their common complications to evaluate the efficacy and safety of different classes of drug combinations. In general, combination therapy plays a pivotal role in the management of diabetes. Integrating the effectiveness of multiple drugs enables more comprehensive and effective control of blood glucose without increasing the risk of hypoglycemia or other serious adverse events. However, specific treatment regimens should be tailored to individual patients and implemented under the guidance of healthcare professionals
STC3141 improves acute lung injury through neutralizing circulating histone in rat with experimentally-induced acute respiratory distress syndrome
Background: Acute respiratory distress syndrome (ARDS) remains a challenge because of its high morbidity and mortality. Circulation histones levels in ARDS patients were correlated to disease severity and mortality. This study examined the impact of histone neutralization in a rat model of acute lung injury (ALI) induced by a lipopolysaccharide (LPS) double-hit.Methods: Sixty-eight male Sprague-Dawley rats were randomized to sham (N = 8, received saline only) or LPS (N = 60). The LPS double-hit consisted of a 0.8Â mg/kg intraperitoneal injection followed after 16Â h by 5Â mg/kg intra-tracheal nebulized LPS. The LPS group was then randomized into five groups: LPS only; LPS +5, 25, or 100Â mg/kg intravenous STC3141 every 8Â h (LPS + L, LPS + M, LPS + H, respectively); or LPS + intraperitoneal dexamethasone 2.5Â mg/kg every 24Â h for 56Â h (LPS + D). The animals were observed for 72Â h.Results: LPS animals developed ALI as suggested by lower oxygenation, lung edema formation, and histological changes compared to the sham animals. Compared to the LPS group, LPS + H and +D groups had significantly lower circulating histone levels and lung wet-to-dry ratio, and the LPS + D group also had lower BALF histone concentrations; the blood neutrophils and platelets counts in LPS + D group did not change, meanwhile, the LPS + L, +M and +H groups had significantly lower neutrophil counts and higher platelet counts in the blood; the total number of BALF WBC, platelet counts, MPO and H3 were significantly lower in the LPS + L, +M, +H and +D groups than in the LPS only group; and the degree of inflammation was significantly less in the LPS + L, +M, +H and +D groups, moreover, inflammation in the LPS + L, +M and +H animals showed a dose-dependent response; finally, the LPS + L, +M, +H and +D groups had improved oxygenation compared to the LPS group, and there were no statistical differences in PCO2 or pH among groups. All animals survived.Conclusion: Neutralization of histone using STC3141, especially at high dose, had similar therapeutic effects to dexamethasone in this LPS double-hit rat ALI model, with significantly decreased circulating histone concentration, improved acute lung injury and oxygenation
Adverse Effect of Nano-Silicon Dioxide on Lung Function of Rats with or without Ovalbumin Immunization
BACKGROUND: The great advances of nanomaterials have brought out broad important applications, but their possible nanotoxicity and risks have not been fully understood. It is confirmed that exposure of environmental particulate matter (PM), especially ultrafine PM, are responsible for many lung function impairment and exacerbation of pre-existing lung diseases. However, the adverse effect of nanoparticles on allergic asthma is seldom investigated and the mechanism remains undefined. For the first time, this work investigates the relationship between allergic asthma and nanosized silicon dioxide (nano-SiO₂). METHODOLOGY/PRINCIPAL FINDINGS: Ovalbumin (OVA)-treated and saline-treated control rats were daily intratracheally administered 0.1 ml of 0, 40 and 80 µg/ml nano-SiO₂ solutions, respectively for 30 days. Increased nano-SiO₂ exposure results in adverse changes on inspiratory and expiratory resistance (Ri and Re), but shows insignificant effect on rat lung dynamic compliance (Cldyn). Lung histological observation reveals obvious airway remodeling in 80 µg/ml nano-SiO₂-introduced saline and OVA groups, but the latter is worse. Additionally, increased nano-SiO₂ exposure also leads to more severe inflammation. With increasing nano-SiO₂ exposure, IL-4 in lung homogenate increases and IFN-γ shows a reverse but insignificant change. Moreover, at a same nano-SiO₂ exposure concentration, OVA-treated rats exhibit higher (significant) IL-4 and lower (not significant) IFN-γ compared with the saline-treated rats. The percentages of eosinophil display an unexpected result, in which higher exposure results lower eosinophil percentages. CONCLUSIONS/SIGNIFICANCE: This was a preliminary study which for the first time involved the effect of nano-SiO₂ to OVA induced rat asthma model. The results suggested that intratracheal administration of nano-SiO₂ could lead to the airway hyperresponsiveness (AHR) and the airway remolding with or without OVA immunization. This occurrence may be due to the Th1/Th2 cytokine imbalance accelerated by the nano-SiO₂ through increasing the tissue IL-4 production
Real-time Monitoring for the Next Core-Collapse Supernova in JUNO
Core-collapse supernova (CCSN) is one of the most energetic astrophysical
events in the Universe. The early and prompt detection of neutrinos before
(pre-SN) and during the SN burst is a unique opportunity to realize the
multi-messenger observation of the CCSN events. In this work, we describe the
monitoring concept and present the sensitivity of the system to the pre-SN and
SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), which is
a 20 kton liquid scintillator detector under construction in South China. The
real-time monitoring system is designed with both the prompt monitors on the
electronic board and online monitors at the data acquisition stage, in order to
ensure both the alert speed and alert coverage of progenitor stars. By assuming
a false alert rate of 1 per year, this monitoring system can be sensitive to
the pre-SN neutrinos up to the distance of about 1.6 (0.9) kpc and SN neutrinos
up to about 370 (360) kpc for a progenitor mass of 30 for the case
of normal (inverted) mass ordering. The pointing ability of the CCSN is
evaluated by using the accumulated event anisotropy of the inverse beta decay
interactions from pre-SN or SN neutrinos, which, along with the early alert,
can play important roles for the followup multi-messenger observations of the
next Galactic or nearby extragalactic CCSN.Comment: 24 pages, 9 figure
- …