2 research outputs found

    Data Mining Techniques for Complex User-Generated Data

    Get PDF
    Nowadays, the amount of collected information is continuously growing in a variety of different domains. Data mining techniques are powerful instruments to effectively analyze these large data collections and extract hidden and useful knowledge. Vast amount of User-Generated Data (UGD) is being created every day, such as user behavior, user-generated content, user exploitation of available services and user mobility in different domains. Some common critical issues arise for the UGD analysis process such as the large dataset cardinality and dimensionality, the variable data distribution and inherent sparseness, and the heterogeneous data to model the different facets of the targeted domain. Consequently, the extraction of useful knowledge from such data collections is a challenging task, and proper data mining solutions should be devised for the problem under analysis. In this thesis work, we focus on the design and development of innovative solutions to support data mining activities over User-Generated Data characterised by different critical issues, via the integration of different data mining techniques in a unified frame- work. Real datasets coming from three example domains characterized by the above critical issues are considered as reference cases, i.e., health care, social network, and ur- ban environment domains. Experimental results show the effectiveness of the proposed approaches to discover useful knowledge from different domains

    Predicting cardiopulmonary response to incremental exercise test

    No full text
    Cardiopulmonary exercise testing is a non-invasive method widely used to monitor various physiological signals, describing the cardiac and respiratory response of the patient to increasing workload. Since this method is physically very demanding, innovative data analysis techniques are needed to predict patient response thus lowering body stress and avoiding cardiopulmonary overload. This paper proposes the Cardiopulmonary Response Prediction (CRP) framework for early predicting the physiological signal values that can be reached during an incremental exercise test. The learning phase creates different models tailored to specific conditions (i.e., single test and multiple-test models). Each model can be exploited in the real-time stream prediction phase to periodically predict, during the test execution, signal values achievable by the patient. Experimental results on a real dataset showed that CRP prediction is performed with a limited and acceptable erro
    corecore