240,222 research outputs found

    Lustre, Hadoop, Accumulo

    Full text link
    Data processing systems impose multiple views on data as it is processed by the system. These views include spreadsheets, databases, matrices, and graphs. There are a wide variety of technologies that can be used to store and process data through these different steps. The Lustre parallel file system, the Hadoop distributed file system, and the Accumulo database are all designed to address the largest and the most challenging data storage problems. There have been many ad-hoc comparisons of these technologies. This paper describes the foundational principles of each technology, provides simple models for assessing their capabilities, and compares the various technologies on a hypothetical common cluster. These comparisons indicate that Lustre provides 2x more storage capacity, is less likely to loose data during 3 simultaneous drive failures, and provides higher bandwidth on general purpose workloads. Hadoop can provide 4x greater read bandwidth on special purpose workloads. Accumulo provides 10,000x lower latency on random lookups than either Lustre or Hadoop but Accumulo's bulk bandwidth is 10x less. Significant recent work has been done to enable mix-and-match solutions that allow Lustre, Hadoop, and Accumulo to be combined in different ways.Comment: 6 pages; accepted to IEEE High Performance Extreme Computing conference, Waltham, MA, 201

    Geo-Spotting: Mining Online Location-based Services for Optimal Retail Store Placement

    Full text link
    The problem of identifying the optimal location for a new retail store has been the focus of past research, especially in the field of land economy, due to its importance in the success of a business. Traditional approaches to the problem have factored in demographics, revenue and aggregated human flow statistics from nearby or remote areas. However, the acquisition of relevant data is usually expensive. With the growth of location-based social networks, fine grained data describing user mobility and popularity of places has recently become attainable. In this paper we study the predictive power of various machine learning features on the popularity of retail stores in the city through the use of a dataset collected from Foursquare in New York. The features we mine are based on two general signals: geographic, where features are formulated according to the types and density of nearby places, and user mobility, which includes transitions between venues or the incoming flow of mobile users from distant areas. Our evaluation suggests that the best performing features are common across the three different commercial chains considered in the analysis, although variations may exist too, as explained by heterogeneities in the way retail facilities attract users. We also show that performance improves significantly when combining multiple features in supervised learning algorithms, suggesting that the retail success of a business may depend on multiple factors.Comment: Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, Chicago, 2013, Pages 793-80

    Impact of Performance and Expressiveness Value of Store Service Quality on the Mediating Role of Satisfaction with Store

    Get PDF
    This study explores the extent to which store service attributes having appeal for consumer self-image impacts store satisfaction and patronage intentions and discovers that this "expressiveness" value has significant associations with both. By using the adapted RSQS for measuring service quality in the Indian appare! retail context, this paper finds that service expressiveness value is distinct from the performance value obtained from service delivery. This paper provides empirical evidence that the mediation effect of satisfaction varies depending on consumer perceived value from service and that it is neither as universal nor as strong as retailers and researchers lend to believe.

    Ensemble evaluation of hydrological model hypotheses

    Get PDF
    It is demonstrated for the first time how model parameter, structural and data uncertainties can be accounted for explicitly and simultaneously within the Generalized Likelihood Uncertainty Estimation (GLUE) methodology. As an example application, 72 variants of a single soil moisture accounting store are tested as simplified hypotheses of runoff generation at six experimental grassland field-scale lysimeters through model rejection and a novel diagnostic scheme. The fields, designed as replicates, exhibit different hydrological behaviors which yield different model performances. For fields with low initial discharge levels at the beginning of events, the conceptual stores considered reach their limit of applicability. Conversely, one of the fields yielding more discharge than the others, but having larger data gaps, allows for greater flexibility in the choice of model structures. As a model learning exercise, the study points to a “leaking” of the fields not evident from previous field experiments. It is discussed how understanding observational uncertainties and incorporating these into model diagnostics can help appreciate the scale of model structural error

    A decision-making approach for investigating the potential effects of near sourcing on supply chain

    Get PDF
    Purpose - Near sourcing is starting to be regarded as a valid alternative to global sourcing in order to leverage supply chain (SC) responsiveness and economic efficiency. The present work proposes a decision-making approach developed in collaboration with a leading Italian retailer that was willing to turn the global store furniture procurement process into near sourcing. Design/methodology/approach - Action research is employed. The limitations of the traditional SC organisation and purchasing process of the company are first identified. On such basis, an inventory management model is applied to run spreadsheet estimates where different purchasing and SC management strategies are adopted to determine the solution providing the lowest cost performance. Finally, a risk analysis of the selected best SC arrangement is conducted and results are discussed. Findings - Switching from East Asian suppliers to continental vendors enables a SC reengineering that increases flexibility and responsiveness to demand uncertainty which, together with decreased transportation costs, assures economic viability, thus proving the benefits of near sourcing. Research limitations/implications - The decision-making framework provides a methodological roadmap to address the comparison between near and global sourcing policies and to calculate the savings of the former against the latter. The approach could include additional organisational aspects and cost categories impacting on near sourcing and could be adapted to investigate different products, services, and business sectors. Originality/value - The work provides SC researchers and practitioners with a structured approach for understanding what drives companies to adopt near sourcing and for quantitatively assessing its advantage

    Notions of Knowledge Management

    Get PDF
    {Excerpt} Knowledge management is getting the right knowledge to the right people at the right time, and helping them (with incentives) to apply it in ways that strive to improve organizational performance. Data are facts, and information is interpreted data. Knowledge is created and organized by flows of information, shaped by their holder. It is tacit or explicit. Tacit knowledge is nonverbalized, intuitive, and unarticulated knowledge that people carry in their heads. It is hard to formalize and communicate because it is rooted in skills, experiences, insight, intuition, and judgment, but it can be shared in discussion, storytelling, and personal interactions. It has a technical dimension, which encompasses skills and capabilities referred to as know-how. It has a cognitive dimension, which consists of beliefs, ideals,values, schemata, or mental models. Explicit knowledge is codified knowledge that can be expressed in writing, drawings, or computer programs, for example, and transmitted in various forms. Tacit knowledge and explicit knowledge are mutually complementary forms of meaning
    • 

    corecore