20 research outputs found

    An Efficient Approach of Discovery of Frequent Data Set from Big Operational Database

    Get PDF
    Currently in real world scenario data uncertainty is the most major issue in the real time applications where these data are generated from various devices daily from various users. So, the important part is to find the important data from them. In this paper, we propose to measure pattern frequentness based on the various possible world semantics. We are looking to establish two uncertain sequence data models abstracted from many real-life applications involving uncertain sequence data, and based on that formulate the problem of mining probabilistically frequent sequential patterns (or p-FSPs) from data that conform to our models. By using the projection strategy of famous prefixspan algorithm, we are looking to develop an algorithm called U-PrefixSpan for probabilistically frequent sequential pattern mining. UPrefixSpan avoids the problem of “possible world explosion” and when combined with pruning techniques and one validating technique achieves good performance. Theoretically study and analysis shows that our work proposed do the better with compare to existing system. DOI: 10.17762/ijritcc2321-8169.15078

    FP-Growth Tree Based Algorithms Analysis: CP-Tree and K Map

    Get PDF
    We propose a novel frequent-pattern tree (FP-tree) structure; our performance study shows that the FP-growth method is efficient and scalable for mining both long and short frequent patterns, and is about an order of magnitude faster than the Apriori algorithm and also faster than some recently reported new frequent-pattern mining methods. FP-tree method is efficient algorithm in association mining to mine frequent patterns in data mining, in spite of long or short frequent data patterns. By using compact best tree structure and partitioning-based and divide-and-conquer data mining searching method, it can be reduces the costs searchsubstantially .it just as the analysis multi-CPU or reduce computer memory to solve problem. But this approach can be apparently decrease the costs for exchanging and combining control information and the algorithm complexity is also greatly decreased, solve this problem efficiently. Even if main adopting multi-CPU technique, raising the requirement is basically hardware, best performanceimprovement is still to be limited. Is there any other way that most one may it can reduce these costs in FP-tree construction, performance best improvement is still limited

    Digitizing Offline Shopping Behavior Towards Mobile Marketing

    Get PDF
    The proliferation of mobile technologies makes it possible for mobile advertisers to go beyond the real-time snapshot of the static location and contextual information about consumers. In this study, we propose a novel mobile advertising strategy that leverages full information on consumers’ offline moving trajectories. To evaluate the effectiveness of this strategy, we design a large-scale randomized field experiment in a large shopping mall in Asia based on 83,370 unique user responses for two weeks in 2014. We found the new mobile trajectory-based advertising is significantly more effective for focal advertising store compared to several existing baselines. It is especially effective in attracting high-income consumers. Interestingly, it becomes less effective during the weekend. This indicates closely targeted mobile ads may constrict consumer focus and significantly reduce the impulsive purchase behavior. Our finding suggests marketers should carefully design mobile advertising strategy, depending on different business contexts

    Spatial Data Quality in the IoT Era:Management and Exploitation

    Get PDF
    Within the rapidly expanding Internet of Things (IoT), growing amounts of spatially referenced data are being generated. Due to the dynamic, decentralized, and heterogeneous nature of the IoT, spatial IoT data (SID) quality has attracted considerable attention in academia and industry. How to invent and use technologies for managing spatial data quality and exploiting low-quality spatial data are key challenges in the IoT. In this tutorial, we highlight the SID consumption requirements in applications and offer an overview of spatial data quality in the IoT setting. In addition, we review pertinent technologies for quality management and low-quality data exploitation, and we identify trends and future directions for quality-aware SID management and utilization. The tutorial aims to not only help researchers and practitioners to better comprehend SID quality challenges and solutions, but also offer insights that may enable innovative research and applications

    A conceptual framework for developing dashboards for big mobility data

    Full text link
    Dashboards are an increasingly popular form of data visualization. Large, complex, and dynamic mobility data present a number of challenges in dashboard design. The overall aim for dashboard design is to improve information communication and decision making, though big mobility data in particular require considering privacy alongside size and complexity. Taking these issues into account, a gap remains between wrangling mobility data and developing meaningful dashboard output. Therefore, there is a need for a framework that bridges this gap to support the mobility dashboard development and design process. In this paper we outline a conceptual framework for mobility data dashboards that provides guidance for the development process while considering mobility data structure, volume, complexity, varied application contexts, and privacy constraints. We illustrate the proposed framework’s components and process using example mobility dashboards with varied inputs, end-users and objectives. Overall, the framework offers a basis for developers to understand how informational displays of big mobility data are determined by end-user needs as well as the types of data selection, transformation, and display available to particular mobility datasets

    How you move reveals who you are: understanding human behavior by analyzing trajectory data

    Get PDF
    The widespread use of mobile devices is producing a huge amount of trajectory data, making the discovery of movement patterns possible, which are crucial for understanding human behavior. Significant advances have been made with regard to knowledge discovery, but the process now needs to be extended bearing in mind the emerging field of behavior informatics. This paper describes the formalization of a semantic-enriched KDD process for supporting meaningful pattern interpretations of human behavior. Our approach is based on the integration of inductive reasoning (movement pattern discovery) and deductive reasoning (human behavior inference). We describe the implemented Athena system, which supports such a process, along with the experimental results on two different application domains related to traffic and recreation management
    corecore