299,193 research outputs found

    Multi-node approach for map data processing

    Get PDF
    OpenStreetMap (OSM) is a popular collaborative open-source project that offers free editable map across the whole world. However, this data often needs a further on-purpose processing to become the utmost valuable information to work with. That is why the main motivation of this paper is to propose a design for big data processing along with data mining leading to the obtaining of statistics with a focus on the detail of a traffic data as a result in order to create graphs representing a road network. To ensure our High-Performance Computing (HPC) platform routing algorithms work correctly, it is absolutely essential to prepare OSM data to be useful and applicable for above-mentioned graph, and to store this persistent data in both spatial database and HDF5 format.Web of Science8971049

    Integrating big data into a sustainable mobility policy 2.0 planning support system

    Get PDF
    It is estimated that each of us, on a daily basis, produces a bit more than 1 GB of digital content through our mobile phone and social networks activities, bank card payments, location-based positioning information, online activities, etc. However, the implementation of these large data amounts in city assets planning systems still remains a rather abstract idea for several reasons, including the fact that practical examples are still very strongly services-oriented, and are a largely unexplored and interdisciplinary field; hence, missing the cross-cutting dimension. In this paper, we describe the Policy 2.0 concept and integrate user generated content into Policy 2.0 platform for sustainable mobility planning. By means of a real-life example, we demonstrate the applicability of such a big data integration approach to smart cities planning process. Observed benefits range from improved timeliness of the data and reduced duration of the planning cycle to more informed and agile decision making, on both the citizens and the city planners end. The integration of big data into the planning process, at this stage, does not have uniform impact across all levels of decision making and planning process, therefore it should be performed gradually and with full awareness of existing limitations

    How do top- and bottom-performing companies differ in using business analytics?

    Get PDF
    Purpose Business analytics (BA) has attracted growing attention mainly due to the phenomena of big data. While studies suggest that BA positively affects organizational performance, there is a lack of academic research. The purpose of this paper, therefore, is to examine the extent to which top- and bottom-performing companies differ regarding their use and organizational facilitation of BA. Design/methodology/approach Hypotheses are developed drawing on the information processing view and contingency theory, and tested using multivariate analysis of variance to analyze data collected from 117 UK manufacture companies. Findings Top- and bottom-performing companies differ significantly in their use of BA, data-driven environment, and level of fit between BA and data-drain environment. Practical implications Extensive use of BA and data-driven decisions will lead to superior firm performance. Companies wishing to use BA to improve decision making and performance need to develop relevant analytical strategy to guide BA activities and design its structure and business processes to embed BA activities. Originality/value This study provides useful management insights into the effective use of BA for improving organizational performance

    Distributed Holistic Clustering on Linked Data

    Full text link
    Link discovery is an active field of research to support data integration in the Web of Data. Due to the huge size and number of available data sources, efficient and effective link discovery is a very challenging task. Common pairwise link discovery approaches do not scale to many sources with very large entity sets. We here propose a distributed holistic approach to link many data sources based on a clustering of entities that represent the same real-world object. Our clustering approach provides a compact and fused representation of entities, and can identify errors in existing links as well as many new links. We support a distributed execution of the clustering approach to achieve faster execution times and scalability for large real-world data sets. We provide a novel gold standard for multi-source clustering, and evaluate our methods with respect to effectiveness and efficiency for large data sets from the geographic and music domains
    corecore