915 research outputs found
eXCloud: Transparent runtime support for scaling mobile applications in cloud
Cloud computing augments applications with ease-of-access to the enormous resources on the Internet. Combined with mobile computing technologies, mobile applications can exploit the Cloud everywhere by statically distributing code segments or dynamically migrating running processes onto cloud services. Existing migration techniques are however too coarse-grained for mobile devices, so the overheads often offset the benefits of migration. To build a truly elastic mobile cloud computing infrastructure, we introduce eXCloud (eXtensible Cloud) - a middleware system with multi-level mobility support, ranging from as coarse as a VM instance to as fine as a runtime stack frame, and allows resources to be integrated and used dynamically. In eXCloud, a stack-on-demand (SOD) approach is used to support computation mobility throughout the mobile cloud environment. The approach is fully adaptive, goal-driven and transparent. By downward task migration, applications running on the cloud nodes can exploit or take control of special resources in mobile devices such as GPS and cameras. With a restorable MPI layer, task migrations of MPI parallel programs can happen between cloud nodes or be initiated from a mobile device. Our evaluation shows that SOD outperforms several existing migration mechanisms in terms of migration overhead and latency. All our techniques result in better resource utilization through task migrations among cloud nodes and mobile nodes.published_or_final_versionThe 2011 International Conference on Cloud and Service Computing (CSC), Hong Kong, China, 12-14 December 2011. In Proceedings of CSC, 2011, p. 103-11
Spectral modeling of switched-mode power converters in discontinuous conduction mode
Analytical spectral modeling of switched-mode PWM converters, operating in the discontinuous conduction mode, is presented. The modeling approach is to derive analytical expressions of the converter output response using the extended Volterra functional series. Nonlinear frequency responses, including additional DC offset as well as significant subharmonic and higher harmonic components, due to large-signal perturbations of both the duty ratio and source voltage, are investigated. Their vulnerability to switching frequencies are also described. This modeling approach has been applied to various PWM converters, boost, buck and buck-boost types, operating in the discontinuous conduction mode, and the results are verified.published_or_final_versio
Adaptive sampling-based profiling techniques for optimizing the distributed JVM runtime
Extending the standard Java virtual machine (JVM) for cluster-awareness is a transparent approach to scaling out multithreaded Java applications. While this clustering solution is gaining momentum in recent years, efficient runtime support for fine-grained object sharing over the distributed JVM remains a challenge. The system efficiency is strongly connected to the global object sharing profile that determines the overall communication cost. Once the sharing or correlation between threads is known, access locality can be optimized by collocating highly correlated threads via dynamic thread migrations. Although correlation tracking techniques have been studied in some page-based sof Tware DSM systems, they would entail prohibitively high overheads and low accuracy when ported to fine-grained object-based systems. In this paper, we propose a lightweight sampling-based profiling technique for tracking inter-thread sharing. To preserve locality across migrations, we also propose a stack sampling mechanism for profiling the set of objects which are tightly coupled with a migrant thread. Sampling rates in both techniques can vary adaptively to strike a balance between preciseness and overhead. Such adaptive techniques are particularly useful for applications whose sharing patterns could change dynamically. The profiling results can be exploited for effective thread-to-core placement and dynamic load balancing in a distributed object sharing environment. We present the design and preliminary performance result of our distributed JVM with the profiling implemented. Experimental results show that the profiling is able to obtain over 95% accurate global sharing profiles at a cost of only a few percents of execution time increase for fine- to medium- grained applications. © 2010 IEEE.published_or_final_versionThe 24th IEEE International Symposium on Parallel & Distributed Processing (IPDPS 2010), Atlanta, GA., 19-23 April 2010. In Proceedings of the 24th IPDPS, 2010, p. 1-1
Adaptive thread scheduling techniques for improving scalability of software transactional memory
Software transactional memory (STM) enhances both ease-of-use and concurrency, and is considered state-of-the-art for parallel applications to scale on modern multi-core hardware. However, there are certain situations where STM performs even worse than traditional locks. Upon hotspots where most threads contend over a few pieces of shared data, going transactional will result in excessive conflicts and aborts that adversely degrade performance. We present a new design of adaptive thread scheduler that manages concurrency when the system is about entering and leaving hotspots. The scheduler controls the number of threads spawning new transactions according to the live commit throughput. We implemented two feedback-control policies called Throttle and Probe to realize this adaptive scheduling. Performance evaluation with the STAMP benchmarks shows that enabling Throttle and Probe obtain best-case speedups of 87.5% and 108.7% respectively.postprintThe 10th IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN 2011), Innsbruck, Austria, 15-17 February 2011. In Proceedings of the 10th IASTED-PDCN, 2011, p. 91-9
Cache affinity optimization techniques for scaling software transactional memory systems on multi-CMP architectures
Software transactional memory (STM) enhances both ease-of-use and concurrency, and is considered one of the next-generation paradigms for parallel programming. Application programs may see hotspots where data conflicts are intensive and seriously degrade the performance. So advanced STM systems employ dynamic concurrency control techniques to curb the conflict rate through properly throttling the rate of spawning transactions. High-end computers may have two or more multicore processors so that data sharing among cores goes through a non-uniform cache memory hierarchy. This poses challenges to concurrency control designs as improper metadata placement and sharing will introduce scalability issues to the system. Poor thread-to-core mappings that induce excessive cache invalidation are also detrimental to the overall performance. In this paper, we share our experience in designing and implementing a new dynamic concurrency controller for Tiny STM, which helps keeping the system concurrency at a near-optimal level. By decoupling unfavourable metadata sharing, our controller design avoids costly inter-processor communications. It also features an affinity-aware thread migration technique that fine-tunes thread placements by observing inter-thread transactional conflicts. We evaluate our implementation using the STAMP benchmark suite and show that the controller can bring around 21% average speedup over the baseline execution. © 2015 IEEE.postprin
Adaptive live VM migration over a WAN: modeling and implementation
Recent advances in virtualization technology enable high mobility of virtual machines and resource provisioning at the data-center level. To streamline the migration process, various migration strategies have been proposed for VM live migration over a local-area network (LAN). The most common solution uses memory pre-copying and assumes the storage is shared on the LAN. While applied to a wide-area network (WAN), the VM live migration algorithms need a new design philosophy to address the challenges of long latency, limited bandwidth, unstable network conditions and the movement of storage. This paper proposes a three-phase fractional hybrid pre-copy and post-copy solution for both memory and storage to achieve highly adaptive migration over a WAN. In this hybrid solution, we selectively migrate an important fraction of memory and storage in the pre-copy and freeze-and-copy phase, while the rest (non-critical data set) is migrated during post-copying. We propose a new metric called performance restoration agility, which considers both the downtime and the VM speed degradation during the post-copy phase, to evaluate the migration process. We also develop a profiling framework and a novel probabilistic prediction model to adaptively find a predictably optimal combination of the memory and storage fractions to migrate. This model-based hybrid solution is implemented on Xen and evaluated in an emulated WAN environment. Experimental results show that our solution wins over all others in adaptiveness for various applications over a WAN, while retaining the responsiveness of post-copy algorithms.published_or_final_versio
Rhymes: a shared virtual memory system for non-coherent tiled many-core architectures
The rising core count per processor is pushing chip complexity to a level that hardware-based cache coherency protocols become too hard and costly to scale. We need new designs of many-core hardware and software other than traditional technologies to keep up with the ever-increasing scalability demands. The Intel Single-chip Cloud Computer (SCC) is a recent research processor exemplifying a new cluster-on-chip architecture which promotes a software-oriented approach instead of hardware support to implementing shared memory coherence. This paper presents a shared virtual memory (SVM) system, dubbed Rhymes, tailored to such a new processor kind of non-coherent and hybrid memory architectures. Rhymes features a two-way cache coherence protocol to enforce release consistency for pages allocated in shared physical memory (SPM) and scope consistency for pages in per-core private memory. It also supports page remapping on a per-core basis to boost data locality. We implement Rhymes on the SCC port of the Barrelfish OS. Experimental results show that our SVM outperforms the pure SPM approach used by Intel's software managed coherence (SMC) library by up to 12 times, with superlinear speedups (due to L2 cache effect) noted for applications with strong data reuse patterns.published_or_final_versio
Epidemiological and Evolutionary Dynamics of Influenza B Viruses in Malaysia, 2012-2014
Epidemiological and evolutionary dynamics of influenza B Victoria and Yamagata lineages remained poorly understood in the tropical Southeast Asia region, despite causing seasonal outbreaks worldwide. From 2012-2014, nasopharyngeal swab samples collected from outpatients experiencing acute upper respiratory tract infection symptoms in Kuala Lumpur, Malaysia, were screened for influenza viruses using a multiplex RT-PCR assay. Among 2,010/3,935 (51.1%) patients infected with at least one respiratory virus, 287 (14.3%) and 183 (9.1%) samples were tested positive for influenza A and B viruses, respectively. Influenza-positive cases correlate significantly with meteorological factors-total amount of rainfall, relative humidity, number of rain days, ground temperature and particulate matter (PM10). Phylogenetic reconstruction of haemagglutinin (HA) gene from 168 influenza B viruses grouped them into Yamagata Clade 3 (65, 38.7%), Yamagata Clade 2 (48, 28.6%) and Victoria Clade 1 (55, 32.7%). With neuraminidase (NA) phylogeny, 30 intra-clade (29 within Yamagata Clade 3, 1 within Victoria Clade 1) and 1 inter-clade (Yamagata Clade 2-HA/Yamagata Clade 3-NA) reassortants were identified. Study of virus temporal dynamics revealed a lineage shift from Victoria to Yamagata (2012-2013), and a clade shift from Yamagata Clade 2 to Clade 3 (2013-2014). Yamagata Clade 3 predominating in 2014 consisted of intra-clade reassortants that were closely related to a recent WHO vaccine candidate strain (B/Phuket/3073/2013), with the reassortment event occurred approximately 2 years ago based on Bayesian molecular clock estimation. Malaysian Victoria Clade 1 viruses carried H274Y substitution in the active site of neuraminidase, which confers resistance to oseltamivir. Statistical analyses on clinical and demographic data showed Yamagata-infected patients were older and more likely to experience headache while Victoria-infected patients were more likely to experience nasal congestion and sore throat. This study describes the evolution of influenza B viruses in Malaysia and highlights the importance of continuous surveillance for better vaccination policy in this region.published_or_final_versio
Effectiveness of a Chinese herbal medicine preparation in the treatment of cough in uncomplicated upper respiratory tract infection: a randomised double-blinded placebo-control trial
BACKGROUND: Rigorous scientific and well-designed clinical trials to evaluate the effect of traditional Chinese medicine (TCM) is lacking. We, therefore, designed this study to evaluate the effectiveness of a commonly used TCM preparation in treating acute cough of uncomplicated URTI in adults and to search for a safe, effective and affordable alternative treatment for this common condition. METHODS: A randomised, double-blinded, placebo-control study comparing this TCM preparation with a placebo was conducted in 82 patients who attended the Family Medicine Training Centre, Prince of Wales Hospital, Hong Kong between November and December, 2003. The TCM herbal preparation includes nine commonly used TCM herbs for cough such as chuanbei, fangfeng, jiegeng, gancao and baibu (see Table 1) The treatment lasted for 5 days and patients were followed-up for another 6 days. Patients were asked to fill in a cough score and validated Leicester cough questionnaire (LCQ). RESULTS: 62 patients (75.6%) had completed the trial and no adverse effects were reported. Both intervened and control groups had improved in cough score and LCQ in the follow-up period, despite no overall statistical significance was observed in the differences of scores between the two groups. Women taking TCM had significantly fewer problems with sputum production (p = 0.03) and older subjects (>35 years of age) reported a significant improvement in hoarseness (p = 0.05) when compared to those using placebo. CONCLUSION: TCM was well-tolerated and received among the Hong Kong Chinese population. This TCM preparation appeared to have some benefits in the treatment of cough. Future research on TCM should concentrate more on commonly encountered conditions such as UTRI and cough. Our experience on the sensitivity of assessment tools used in detecting subtle differences in an otherwise self-limiting illness and clinical trial methodology when applying the underlying theory of how TCM works in disease management was invaluable
Simultaneous dual-band optical coherence tomography for endoscopic applications
published_or_final_versio
- …