78 research outputs found

    Forecasting Player Behavioral Data and Simulating in-Game Events

    Full text link
    Understanding player behavior is fundamental in game data science. Video games evolve as players interact with the game, so being able to foresee player experience would help to ensure a successful game development. In particular, game developers need to evaluate beforehand the impact of in-game events. Simulation optimization of these events is crucial to increase player engagement and maximize monetization. We present an experimental analysis of several methods to forecast game-related variables, with two main aims: to obtain accurate predictions of in-app purchases and playtime in an operational production environment, and to perform simulations of in-game events in order to maximize sales and playtime. Our ultimate purpose is to take a step towards the data-driven development of games. The results suggest that, even though the performance of traditional approaches such as ARIMA is still better, the outcomes of state-of-the-art techniques like deep learning are promising. Deep learning comes up as a well-suited general model that could be used to forecast a variety of time series with different dynamic behaviors

    Crop protection from animals based on machine learning

    Get PDF
    This paper will discuss the present level of research and development on human-wildlife conflicts, there are bad interactions between humans and wild animals that are bad for both the resources of the human population as well as the ecosystems of wildlife. Animal detection has an impact on both human food security and animal welfare because it arises from the conflict between humans and wildlife over natural resources. As the population has grown and many communities' land use patterns have changed, these disputes have become more prevalent in recent years. The Mobile Net SSD type is used in this automatic intrusion and deterrent system for enhanced performance. When a dangerous animal is discovered, the system emits an alarm sound and notifies the relevant authorities, alerting them to the discovery. It is more human- friendly due to the quick detection process, and it is more animal-friendly due to the gentle repulsive process

    A text segmentation approach for automated annotation of online customer reviews, based on topic modeling

    Full text link
    Online customer review classification and analysis have been recognized as an important problem in many domains, such as business intelligence, marketing, and e-governance. To solve this problem, a variety of machine learning methods was developed in the past decade. Existing methods, however, either rely on human labeling or have high computing cost, or both. This makes them a poor fit to deal with dynamic and ever-growing collections of short but semantically noisy texts of customer reviews. In the present study, the problem of multi-topic online review clustering is addressed by generating high quality bronze-standard labeled sets for training efficient classifier models. A novel unsupervised algorithm is developed to break reviews into sequential semantically homogeneous segments. Segment data is then used to fine-tune a Latent Dirichlet Allocation (LDA) model obtained for the reviews, and to classify them along categories detected through topic modeling. After testing the segmentation algorithm on a benchmark text collection, it was successfully applied in a case study of tourism review classification. In all experiments conducted, the proposed approach produced results similar to or better than baseline methods. The paper critically discusses the main findings and paves ways for future work

    Development of a Common Framework for Analysing Public Transport Smart Card Data

    Get PDF
    The data generated in public transport systems have proven to be of great importance in improving knowledge of public transport systems, being very valuable in promoting the sustainability of public transport through rational management. However, the analysis of this data involves numerous tasks, so that when the value of analysing the data is finally verified, the effort has already been very great. The management and analysis of the collected data face some difficulties. This is the case of the data collected by the current automated fare collection systems. These systems do not follow any open standards and are not usually designed with a multipurpose nature, so they do not facilitate the data analysis workflow (i.e., acquisition, storage, quality control, integration and quantitative analysis). Intending to reduce this workload, we propose a conceptual framework for analysing data from automated fare collection systems in mobility studies. The main components of this framework are (1) a simple data model, (2) scripts for creating and querying the database and (3) a system for reusing the most useful queries. This framework has been tested in a real public transport consortium in a Spanish region shaped by tourism. The outcomes of this research work could be reused and applied, with a lower initial effort, in other areas that have data recorded by an automated fare collection system but are not sure if it is worth investing in exploiting the data. After this experience, we consider that, even with the legal limitations applicable to the analysis of this type of data, the use of open standards by automated fare collection systems would facilitate the use of this type of data to its full potential. Meanwhile, the use of a common framework may be enough to start analysing the data

    The Winning Solution to the IEEE CIG 2017 Game Data Mining Competition

    Full text link
    Machine learning competitions such as those organized by Kaggle or KDD represent a useful benchmark for data science research. In this work, we present our winning solution to the Game Data Mining competition hosted at the 2017 IEEE Conference on Computational Intelligence and Games (CIG 2017). The contest consisted of two tracks, and participants (more than 250, belonging to both industry and academia) were to predict which players would stop playing the game, as well as their remaining lifetime. The data were provided by a major worldwide video game company, NCSoft, and came from their successful massively multiplayer online game Blade and Soul. Here, we describe the long short-term memory approach and conditional inference survival ensemble model that made us win both tracks of the contest, as well as the validation procedure that we followed in order to prevent overfitting. In particular, choosing a survival method able to deal with censored data was crucial to accurately predict the moment in which each player would leave the game, as censoring is inherent in churn. The selected models proved to be robust against evolving conditions---since there was a change in the business model of the game (from subscription-based to free-to-play) between the two sample datasets provided---and efficient in terms of time cost. Thanks to these features and also to their a ability to scale to large datasets, our models could be readily implemented in real business settings

    Video Image Enhancement and Machine Learning Pipeline for Underwater Animal Detection and Classification at Cabled Observatories

    Get PDF
    Corrección de una afiliación en Sensors 2023, 23, 16. https://doi.org/10.3390/s23010016An understanding of marine ecosystems and their biodiversity is relevant to sustainable use of the goods and services they offer. Since marine areas host complex ecosystems, it is important to develop spatially widespread monitoring networks capable of providing large amounts of multiparametric information, encompassing both biotic and abiotic variables, and describing the ecological dynamics of the observed species. In this context, imaging devices are valuable tools that complement other biological and oceanographic monitoring devices. Nevertheless, large amounts of images or movies cannot all be manually processed, and autonomous routines for recognizing the relevant content, classification, and tagging are urgently needed. In this work, we propose a pipeline for the analysis of visual data that integrates video/image annotation tools for defining, training, and validation of datasets with video/image enhancement and machine and deep learning approaches. Such a pipeline is required to achieve good performance in the recognition and classification tasks of mobile and sessile megafauna, in order to obtain integrated information on spatial distribution and temporal dynamics. A prototype implementation of the analysis pipeline is provided in the context of deep-sea videos taken by one of the fixed cameras at the LoVe Ocean Observatory network of Lofoten Islands (Norway) at 260 m depth, in the Barents Sea, which has shown good classification results on an independent test dataset with an accuracy value of 76.18% and an area under the curve (AUC) value of 87.59%.This work was developed within the framework of the Tecnoterra (ICM-CSIC/UPC) and the following project activities: ARIM (Autonomous Robotic Sea-Floor Infrastructure for Benthopelagic Monitoring; MarTERA ERA-Net Cofound) and RESBIO (TEC2017-87861-R; Ministerio de Ciencia, Innovación y Universidades)

    Exploring mobile network data for tourism statistics: the collaboration between Istat and Vodafone Business Italia

    Get PDF
    The paper describes the collaboration between Istat and Vodafone Business Italia to innovate and enhance tourism statistics. The common goal is to evaluate the potential uses of mobile phone data in current surveys and to investigate new outputs for official statistics, such as visiting routes and means of transport. The analysis concerned inbound tourism (foreigners in Italy), domestic tourism (Italians in Italy), and outbound tourism (Italians abroad). The work presents analyses and results for the Province of Rimini and the Municipality of Roma, referred to August 2019/2020 and April 2020, and a trial of the use of the “Welcome SMS” for the estimate of the residents in Italy who travel to foreign countries. Phone data required specific treatments to meet the definitions of official statistics. Some aspects related to the location and definition of overnight stays will require further investigation

    Exploring mobile network data for tourism statistics: the collaboration between Istat and Vodafone Business Italia

    Get PDF
    The paper describes the collaboration between Istat and Vodafone Business Italia to innovate and enhance tourism statistics. The common goal is to evaluate the potential uses of mobile phone data in current surveys and to investigate new outputs for official statistics, such as visiting routes and means of transport. The analysis concerned inbound tourism (foreigners in Italy), domestic tourism (Italians in Italy), and outbound tourism (Italians abroad). The work presents analyses and results for the Province of Rimini and the Municipality of Roma, referred to August 2019/2020 and April 2020, and a trial of the use of the “Welcome SMS” for the estimate of the residents in Italy who travel to foreign countries. Phone data required specific treatments to meet the definitions of official statistics. Some aspects related to the location and definition of overnight stays will require further investigation
    corecore