5,147 research outputs found
swTVM: Exploring the Automated Compilation for Deep Learning on Sunway Architecture
The flourish of deep learning frameworks and hardware platforms has been
demanding an efficient compiler that can shield the diversity in both software
and hardware in order to provide application portability. Among the exiting
deep learning compilers, TVM is well known for its efficiency in code
generation and optimization across diverse hardware devices. In the meanwhile,
the Sunway many-core processor renders itself as a competitive candidate for
its attractive computational power in both scientific and deep learning
applications. This paper combines the trends in these two directions.
Specifically, we propose swTVM that extends the original TVM to support
ahead-of-time compilation for architecture requiring cross-compilation such as
Sunway. In addition, we leverage the architecture features during the
compilation such as core group for massive parallelism, DMA for high bandwidth
memory transfer and local device memory for data locality, in order to generate
efficient code for deep learning application on Sunway. The experimental
results show the ability of swTVM to automatically generate code for various
deep neural network models on Sunway. The performance of automatically
generated code for AlexNet and VGG-19 by swTVM achieves 6.71x and 2.45x speedup
on average than hand-optimized OpenACC implementations on convolution and fully
connected layers respectively. This work is the first attempt from the compiler
perspective to bridge the gap of deep learning and high performance
architecture particularly with productivity and efficiency in mind. We would
like to open source the implementation so that more people can embrace the
power of deep learning compiler and Sunway many-core processor
Research on Military-Political Factors in the Transition of the Financial Center in the Sichuan-Chongqing Area Before the Anti-Japanese War
In the modern times, a major event was the transition of the financial center in the Sichuan-Chongqing area, which was caused by the fact that Chongqing was opened as a commercial port. The transition process was basically completed before the Anti-Japanese War. Academic circles have conducted in-depth studies from some factors such as Chongqing’s geography and transport, but the researches on the military-political factors in the transition are seldom made, especially on “the War of Two Warlords Surnamed Liu”, namely, war between two warlords of Liu Xiang and Liu Wenhui, before the unification of Sichuan Government, resulting in the establishment of Chongqing financial advantages; in addition, the research on how the Nanjing Government selected and established Chongqing as the financial center of the home front is less conducted. As a matter of fact,before “the War of Two Warlords Surnamed Liu”,Chongqing-centered financial circles have the advantages, but are insufficient to cover and radiate the Chengdu-centered financial circles.The ultimate victory of Liu Xiang military-political group played a pivotal role in the long-term transition, thereafter, Chiang Kai-shek group expelled Liu Xiang group out of the nest of Chongqing because of dual political scheme of “suppressing the communist and intending on Sichuan”, furthermore, strived to establish Chongqing as the financial center of the Anti-Japanese War and the founding of the state. Finally, the goal has come true through the baptism of the Anti-Japanese War
Recommended from our members
Calcineurin B-Like Proteins CBL4 and CBL10 Mediate Two Independent Salt Tolerance Pathways in Arabidopsis.
In Arabidopsis, the salt overly sensitive (SOS) pathway, consisting of calcineurin B-like protein 4 (CBL4/SOS3), CBL-interacting protein kinase 24 (CIPK24/SOS2) and SOS1, has been well defined as a crucial mechanism to control cellular ion homoeostasis by extruding Na+ to the extracellular space, thus conferring salt tolerance in plants. CBL10 also plays a critical role in salt tolerance possibly by the activation of Na+ compartmentation into the vacuole. However, the functional relationship of the SOS and CBL10-regulated processes remains unclear. Here, we analyzed the genetic interaction between CBL4 and CBL10 and found that the cbl4 cbl10 double mutant was dramatically more sensitive to salt as compared to the cbl4 and cbl10 single mutants, suggesting that CBL4 and CBL10 each directs a different salt-tolerance pathway. Furthermore, the cbl4 cbl10 and cipk24 cbl10 double mutants were more sensitive than the cipk24 single mutant, suggesting that CBL10 directs a process involving CIPK24 and other partners different from the SOS pathway. Although the cbl4 cbl10, cipk24 cbl10, and sos1 cbl10 double mutants showed comparable salt-sensitive phenotype to sos1 at the whole plant level, they all accumulated much lower Na+ as compared to sos1 under high salt conditions, suggesting that CBL10 regulates additional unknown transport processes that play distinct roles from the SOS1 in Na+ homeostasis
- …