Search CORE

10 research outputs found

Exploiting commutativity to reduce the cost of updates to shared data in cache-coherent systems

Author: Horn Webb H
Sanchez Daniel
Zhang Guowei
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2015
Field of study

We present Coup, a technique to lower the cost of updates to shared data in cache-coherent systems. Coup exploits the insight that many update operations, such as additions and bitwise logical operations, are commutative: they produce the same final result regardless of the order they are performed in. Coup allows multiple private caches to simultaneously hold update-only permission to the same cache line. Caches with update-only permission can locally buffer and coalesce updates to the line, but cannot satisfy read requests. Upon a read request, Coup reduces the partial updates buffered in private caches to produce the final value. Coup integrates seamlessly into existing coherence protocols, requires inexpensive hardware, and does not affect the memory consistency model. We apply Coup to speed up single-word updates to shared data. On a simulated 128-core, 8-socket system, Coup accelerates state-of-the-art implementations of update-heavy algorithms by up to 2.4×.Center for Future Architectures ResearchNational Science Foundation (U.S.) (CAREER-1452994)Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Grier Presidential Fellowship)Microelectronics Advanced Research CorporationUnited States. Defense Advanced Research Projects Agenc

DSpace@MIT

Crossref

StakeNet: using social networks to analyse the stakeholders of large-scale software projects

Author: Finkelstein A.
Lim S.L.
Quercia D.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

Many software projects fail because they overlook stakeholders or involve the wrong representatives of significant groups. Unfortunately, existing methods in stakeholder analysis are likely to omit stakeholders, and consider all stakeholders as equally influential. To identify and prioritise stakeholders, we have developed StakeNet, which consists of three main steps: identify stakeholders and ask them to recommend other stakeholders and stakeholder roles, build a social network whose nodes are stakeholders and links are recommendations, and prioritise stakeholders using a variety of social network measures. To evaluate StakeNet, we conducted one of the first empirical studies of requirements stakeholders on a software project for a 30,000-user system. Using the data collected from surveying and interviewing 68 stakeholders, we show that StakeNet identifies stakeholders and their roles with high recall, and accurately prioritises them. StakeNet uncovers a critical stakeholder role overlooked in the project, whose omission significantly impacted project success

CiteSeerX

UCL Discovery

A Personalised Ranking Framework with Multiple Sampling Criteria for Venue Recommendation

Author: Burges Christopher J.
Cheng Chen
Manotumruksa Jarana
Rendle Steffen
Yuan Fajie
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Recommending a ranked list of interesting venues to users based on their preferences has become a key functionality in Location-Based Social Networks (LBSNs) such as Yelp and Gowalla. Bayesian Personalised Ranking (BPR) is a popular pairwise recommendation technique that is used to generate the ranked list of venues of interest to a user, by leveraging the user's implicit feedback such as their check-ins as instances of positive feedback, while randomly sampling other venues as negative instances. To alleviate the sparsity that affects the usefulness of recommendations by BPR for users with few check-ins, various approaches have been proposed in the literature to incorporate additional sources of information such as the social links between users, the textual content of comments, as well as the geographical location of the venues. However, such approaches can only readily leverage one source of additional information for negative sampling. Instead, we propose a novel Personalised Ranking Framework with Multiple sampling Criteria (PRFMC) that leverages both geographical influence and social correlation to enhance the effectiveness of BPR. In particular, we apply a multi-centre Gaussian model and a power-law distribution method, to capture geographical influence and social correlation when sampling negative venues, respectively. Finally, we conduct comprehensive experiments using three large-scale datasets from the Yelp, Gowalla and Brightkite LBSNs. The experimental results demonstrate the effectiveness of fusing both geographical influence and social correlation in our proposed PRFMC framework and its superiority in comparison to BPR-based and other similar ranking approaches. Indeed, our PRFMC approach attains a 37% improvement in MRR over a recently proposed approach that identifies negative venues only from social links

Crossref

Enlighten

Sorting Search Results of Literature Digital Libraries: Recent Developments and Future Research Directions

Author: Sulieman Bani-Ahmad
Publication venue: 'IntechOpen'
Publication date: 04/04/2011
Field of study

IntechOpen

Efficient Algorithm for Destabilization of Terrorist Networks

Author
Publication venue: 'MECS Publisher'
Publication date
Field of study

Crossref

가상화 환경을 위한 원격 메모리

Author: 조창연
Publication venue: 서울대학교 대학원
Publication date: 01/08/2021
Field of study

학위논문(박사) -- 서울대학교대학원 : 공과대학 전기·컴퓨터공학부, 2021.8. Bernhard Egger.클라우드 환경은 거대한 연산 자원을 상시 가동할 필요 없고 원하는 순간 원하는 양의 대한 연산 비용만을 지불하면 되기 때문에, 최근 인공지능 및 빅데이터 연산의 유행으로 인해 그 수요가 크게 증가하고 있다. 이러한 클라우드 컴퓨팅의 도입으로인해 고객은 서버 유지에 대한 비용을 크게 절감할 수 있고 서비스 제공자는 연산 자원의 이용 효율을 극대화 할 수 있다. 이러한 시나리오에서 데이터센터 입장에서는 연산 자원 활용 효율을 개선하는 것이 중요한 목표가 된다. 특히 최근 폭증하고 있는 데이터 센터의 규모를 고려하면 작은 효율 개선으로도 막대한 경제적 가치를 창출 할 수 있다. 데이터 센터의 효율은 위치 선정, 구조 설계, 냉각 시스템, 하드웨어 구성 등등 다양한 요소들에 영향을 받지만, 이 논문에서는 특히 연산 및 메모리 자원을 관리하는 소프트웨어 설계 및 구현을 다룬다. 본 논문에서는 데이터 센터 효율 개선을 획기적으로 개선하는 두가지 소프트웨어 기반 기술을 제안한다. 첫 째로 가상화 환경을 위한 소프트웨어 기반 메모리 분리 시스템을 제안한다. 최근 고속 네트워크의 발전으로 인해 원격 메모리 접근 비용이 획기적으로 줄어 들었고, 이 논문에서는 고성능 네트워킹 하드웨어를 이용하여 원격 메모리 위에서 실행되는 가상 머신의 큰 성능 저하 없이 실행할 수 있음을 보인다. 제안된 기술을 QEMU/KVM 가상머신 하이퍼바이저를 통해 평가한 결과, 본 논문에서 제안한 기법은 기존 시스템 대비 원격 페이징에 대한 꼬리 지연시간을 98.2% 개선함을 보인다. 또한 랙 규모의 작업처리 시뮬레이션을 통한 실험에서, 제안된 시스템은 전체 작업 처리 시간을 기존 시스템 대비 40.9% 줄일 수 있음을 보인다. 두 번째로 원격 메모리를 이용하는 즉각적인 가상머신 이주 기법을 제안하다. 가상화 환경의 원격 메모리 활용에 대한 확장은 그것만으로 자원 이용률 향상에 대해 큰 기여를 하지만, 여전히 한 서버에서 여러 어플리케이션이 경쟁적으로 자원을 이용하는 경우 성능이 크게 저하 될 수 있다. 이 논문에서 제안하는 즉각적인 가상머신 이주 기법은 원격 메모리 상에서 아주 작은 메타데이터의 전송만으로 가상머신의 이주를 가능하게 하며, 메모리 상에 키와 값을 저장하는 데이터베이스 벤치마크를 실행하는 가상머신을 기반으로 한 평가에서 기존 기법대비 실질적인 서비스 중단시간을 최대 92.6% 개선함을 보인다.The raising importance of big data and artificial intelligence (AI) has led to an unprecedented shift in moving local computation into the cloud. One of the key drivers behind this transformation was the exploding cost of owning and maintaining large computing systems powerful enough to process these new workloads. Customers experience a reduced cost by renting only the required resources and only when needed, while data center operators benefit from efficiency at scale. A key factor in operating a profitable data center is a high overall utilization of its resources. Due to the scale of modern data centers, small improvements in efficiency translate to significant savings in the total cost of ownership (TCO). There are many important elements that constitute an efficient data center such as its location, architecture, cooling system, or the employed hardware. In this thesis, we focus on software-related aspects, namely the utilization of computational and memory resources. Reports from data centers operated by Alibaba and Google show that the overall resource utilization has stagnated at a level of around 50 to 60 percent over the past decade. This low average utilization is mostly attributable to peak demand-driven resource allocation despite the high variability of modern workloads in their resource usage. In other words, data centers today lack an efficient way to put idle resources that are reserved but not used to work. In this dissertation we present RackMem, a software-based solution to address the problem of low resource utilization through two main contributions. First, we introduce a disaggregated memory system tailored for virtual environments. We observe that virtual machines can use remote memory without noticeable performance degradation under moderate memory pressure on modern networking infrastructure. We implement a specialized remote paging system for QEMU/KVM that reduces the remote paging tail-latency by 98.2% in comparison to the state of the art. A job processing simulation at rack-scale shows that the total makespan can be reduced by 40.9% under our memory system. While seamless disaggregated memory helps to balance memory usage across nodes, individual nodes can still suffer overloaded resources if co-located workloads exhibit high resource usage at the same time. In a second contribution, we present a novel live migration technique for machines running on top of our remote paging system. Under this instant live migration technique, entire virtual machines can be migrated in as little as 100 milliseconds. An evaluation with in-memory key-value database workloads shows that the presented migration technique improves the state of the art by a wide margin in all key performance metrics. The presented software-based solutions lay the technical foundations that allow data center operators to significantly improve the utilization of their computational and memory resources. As future work, we propose new job schedulers and load balancers to make full use of these new technical foundations.Chapter 1. Introduction 1 1.1 Contributions of the Dissertation 3 Chapter 2. Background 5 2.1 Resource Disaggregation 5 2.2 Transparent Remote Paging 7 2.3 Remote Direct Memory Access (RDMA) 9 2.4 Live Migration of Virtual Machines 10 Chapter 3. RackMem Overview 13 3.1 RackMem Virtual Memory 13 3.2 RackMem Distributed Virtual Storage 14 3.3 RackMem Networking 15 3.4 Instant VM Live Migration 16 Chapter 4. Virtual Memory 17 4.1 Design Considerations for Achieving Low-latency 19 4.2 Pagefault handling 20 4.2.1 Fast-path and slow-path in the pagefault handler 21 4.2.2 State transition of RackVM page 23 4.3 Latency Hiding Techniques 25 4.4 Implementation 26 4.4.1 RackMem Virtual Memory Module 27 4.4.2 Dynamic Rebalancing of Local Memory 29 4.4.3 RackVM for Virtual Machines 29 4.4.4 Running Unmodified Applications 30 Chapter 5. RackMem Distributed Virtual Storage 31 5.1 The distributed Storage Abstraction 32 5.2 Memory Management 33 5.2.1 Remote memory allocation 33 5.2.2 Remote memory reclamation 33 5.3 Fault Tolerance 34 5.3.1 Fault-tolerance and Write-duplication 34 5.4 Multiple Storage Support in RackMem 36 5.5 Implementation 38 5.5.1 The Remote Memory Backend 38 5.5.2 Linux Demand Paging on RackDVS 39 Chapter 6. Networking 40 6.1 Design of RackNet 40 6.2 Implementation 41 6.2.1 RPC message layout 41 6.2.2 RackNet RPC Implementation 42 Chapter 7. Instant VM Live Migration 44 7.1 Motivation 45 7.1.1 The need for a tailored live migration technique 45 7.1.2 Software Bottlenecks 46 7.1.3 Utilizing workload variability 46 7.2 Design of Instant 47 7.2.1 Instant Region Migration 47 7.3 Implementation 48 7.3.1 Extension of RackVM for Instant 49 7.3.2 Instant region migration 49 7.3.3 Pre-fetch optimizations 51 7.3.4 Downtime optimizations 51 7.3.5 QEMU modification for Instant 52 Chapter 8. Evaluation - RackMem 53 8.1 Execution Environment 54 8.2 Pagefault Handler Latency 56 8.3 Single Application Performance 57 8.3.1 Batch-oriented Applications 58 8.3.2 Internal Pagesize and Performance 59 8.3.3 Write-duplication overhead 60 8.3.4 RackDVS slab size and performance 62 8.3.5 Latency-oriented Applications 63 8.3.6 Network Bandwidth Analysis 64 8.3.7 Dynamic Local Memory Partitioning 66 8.3.8 Rack-scale Job Processing Simulation 67 Chapter 9. Evaluation - Instant VM Live Migration 69 9.1 Experimental setup 69 9.2 Target Applications 70 9.3 Comparison targets 70 9.4 Database and client setups 71 9.5 Memory disaggregation scenarios 71 9.6.1 Time-to-responsiveness 71 9.6.2 Effective Downtime 73 9.6.3 Effect of Instant optimizations 75 Chapter 10. Conclusion 77 10.1 Future Directions 78 요약 89박

SNU Open Repository and Archive

Reduced Order Model of a Spouted Fluidized Bed Utilizing Proper Orthogonal Decomposition

Author: Beck-Roth Stephanie R.
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2011
Field of study

A reduced order model utilizing proper orthogonal decomposition for approximation of gas and solids velocities as well as pressure, solids granular temperature and gas void fraction for use in multiphase incompressible fluidized beds is developed and presented. The methodology is then tested on data representing a flat-bottom spouted fluidized bed and comparative results against the software Multiphase Flow with Interphase eXchanges (MFIX) are provided. The governing equations for the model development are based upon those implemented in the (MFIX) software. The three reduced order models explored are projective, extrapolative and interpolative. The first is an extension of the system solution beyond an original time sequence. The second is a numerical approximation to a new solution based on a small selected parameter deviation from an existing CFD data set. Finally an interpolative methodology approximates a solution between two existing CFD data sets both which vary a single parameter

The Research Repository @ WVU (West Virginia University)

Dealing with Intransitivity, Non-Convexity, and Algorithmic Bias in Preference Learning

Author: Bower Amanda Ruth
Publication venue
Publication date: 01/01/2020
Field of study

Rankings are ubiquitous since they are a natural way to present information to people who are making decisions. There are seemingly countless scenarios where rankings arise, such as deciding whom to hire at a company, determining what movies to watch, purchasing products, understanding human perception, judging science fair projects, voting for political candidates, and so on. In many of these scenarios, the number of items in consideration is prohibitively large, such that asking someone to rank all of the choices is essentially impossible. On the other hand, collecting preference data on a small subset of the items is feasible, e.g., collecting answers to ``Do you prefer item A or item B?" or ``Is item A closer to item B or item C?". Therefore, an important machine learning task is to learn a ranking of the items based on this preference data. This thesis theoretically and empirically addresses three key challenges of preference learning: intransitivity in preference data, non-convex optimization, and algorithmic bias. Chapter 2 addresses the challenge of learning a ranking given pairwise comparison data that violates rational choice such as intransitivity. Our key observation is that two items compared in isolation from other items may be compared based on only a salient subset of features. Formalizing this framework, we propose the salient feature preference model and prove a sample complexity result for learning the parameters of our model and the underlying ranking with maximum likelihood estimation. Chapter 3 addresses the non-convexity of an optimization problem inspired by ordinal embedding, which is a preference learning task. We aim to understand the landscape, that is local minimizers and global minimizers, of the non-convex objective, which corresponds to the hinge loss arising from quadratic constraints. Under certain assumptions, we give necessary conditions for non-global, local minimizers of our objective and additionally show that in two dimensions, every local minimizer is a global minimizer. Chapters 4 and 5 address the challenge of algorithmic bias. We consider training machine learning models that are fair in the sense that their performance is invariant under certain sensitive perturbations to the inputs. For example, the performance of a resume screening system should be invariant under changes to the gender and ethnicity of the applicant. We formalize this notion of algorithmic fairness as a variant of individual fairness. In Chapter 4, we consider classification and develop a distributionally robust optimization approach, SenSR, that enforces this notion of individual fairness during training and provably learns individually fair classifiers. Chapter 5 builds upon Chapter 4. We develop a related algorithm, SenSTIR, to train provably individually fair learning-to-rank (LTR) models. The proposed approach ensures items from minority groups appear alongside similar items from majority groups. This notion of fair ranking is based on the individual fairness definition considered in Chapter 4 for the supervised learning context and is more nuanced than prior fair LTR approaches that simply provide underrepresented items with a basic level of exposure. The crux of our method is an optimal transport-based regularizer that enforces individual fairness and an efficient algorithm for optimizing the regularizer.PHDApplied and Interdisciplinary MathematicsUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/166120/1/amandarg_1.pd

Deep Blue Documents at the University of Michigan

Effective neural architectures for context-aware venue recommendation

Author: Manotumruksa Jarana
Publication venue
Publication date: 01/01/2019
Field of study

Users in Location-Based Social Networks (LBSNs), such as Yelp and Foursquare, can search for interesting venues such as restaurants and museums to visit, or share their location with their friends by making an implicit feedback (e.g. checking in at venues they have visited). The users can also leave explicit feedback on the venues they have visited by providing rat- ings and/or comments. Such explicit and implicit feedback by the users provide rich infor- mation about both users and venues, and thus can be leveraged to study the users’ movement in urban cities, as well as enhance the quality of personalised venue recommendations. Un- like traditional recommendation systems (e.g. book and movie recommendation systems), making effective venue recommendations is more challenging because we need to take into account the users’ current context (e.g. time of the day, user’s current location as well as his recently visited venues). Two common techniques that are widely used in the literature for venue recommen- dation systems are Matrix Factorisation (MF) and Bayesian Personalised Ranking (BPR). MF is a popular Collaborative Filtering (CF) technique that can leverage the users’ explicit feedback (e.g. the numerical ratings) to predict the users’ ratings on the venues and hence relevant venues can be suggested to the users based on these predicted ratings. On the other hand, BPR is a pairwise ranking-based model that can leverage implicit feedback to generate effective top-K venue recommendations. In this thesis, based upon MF and BPR models, we aim to generate effective context-aware venue recommendation that a user may wish to visit based on the user’s historical explicit and implicit feedbacks, the user’s contextual informa- tion (e.g. the user’s current location and time of the day) and additional information (e.g. the geographical location of venues and users’ social relationships). To achieve this goal, we need to address the following challenges: namely (C1) modelling the users’ preferences and the characteristic of venues, (C2) capturing the complex structure of user-venue inter- actions in a Collaborative Filtering manner, (C3) modelling the users’ short-term (dynamic) preferences from the sequential order of user’s observed feedback as well as the contextual information associated with the successive feedback, (C4) generating accurate top-K venue recommendations based on the users’ preferences using a pairwise ranking-based model and (C5) appropriately sampling potential negative instances to train a ranking-based model. First, to address challenge C1, we leverage the users’ explicit feedback (e.g. their rat- ings and the textual content of the comments) and additional information (e.g. users’ social relationships) to effectively model the users’ preferences and the characteristics of venues. In particular, we propose a novel regularisation technique and a factorisation-based model that leverages the users’ explicit feedback and the additional information to improve the rat- ing prediction accuracy of the traditional MF model. Experiments conducted on a large scale rating dataset on LBSN demonstrate that the textual content of comments plays an important role in enhancing the accuracy of rating prediction. Second, we investigate how to leverage the users’ implicit feedback and additional in- formation such as the users’ social relationship and the geographical location of venues to improve the quality of top-K venue recommendations. We argue that the potential negative instances can be effectively sampled based on the social correlations between users and their friends as well as the geographical influences between the users’ and venues’ geographi- cal location. In particular, to address challenges C4 and C5, we propose a novel pairwise ranking-based framework for top-K venue recommendations that can incorporate multiple sources of additional information (e.g. the users’ social relationship and the geographical location of venues) to effectively sample the potential negative instances. Experimental re- sults on three large scale checkin and rating datasets from LBSNs demonstrate that the social correlations and the geographical influences play an important role to the quality of sampled negative instances and hence can improve the quality of top-K venue recommendations. Finally, to address challenges C2 and C3, we propose a framework for context-aware venue recommendations that exploits Deep Neural Network (DNN) models to effectively capture the complex structure of user-venue interactions and the users’ long-term (dynamic) preferences from their sequential order of checkins. In particular, within the framework, we propose a novel Recurrent Neural Network (RNN) architecture that can effectively in- corporate the contextual information associated with the successive implicit feedback (e.g. the time interval and the geographical distance between two successive checkins) to gener- ate high quality context-aware venue recommendations. Experimental results on three large scale checkin and rating datasets from LBSNs demonstrate the effectiveness and robustness of our proposed framework for context-aware venue recommendations. In particular, the results demonstrate that the sequential order of users’ implicit feedback can be leveraged to effectively improve the effectiveness of context-aware venue recommendation system. In addition, the time intervals and the geographical distances between two successive checkins play an important role in capturing the users’ short-term preferences

Glasgow Theses Service