1,928 research outputs found
Big data analytics:Computational intelligence techniques and application areas
Big Data has significant impact in developing functional smart cities and supporting modern societies. In this paper, we investigate the importance of Big Data in modern life and economy, and discuss challenges arising from Big Data utilization. Different computational intelligence techniques have been considered as tools for Big Data analytics. We also explore the powerful combination of Big Data and Computational Intelligence (CI) and identify a number of areas, where novel applications in real world smart city problems can be developed by utilizing these powerful tools and techniques. We present a case study for intelligent transportation in the context of a smart city, and a novel data modelling methodology based on a biologically inspired universal generative modelling approach called Hierarchical Spatial-Temporal State Machine (HSTSM). We further discuss various implications of policy, protection, valuation and commercialization related to Big Data, its applications and deployment
Electricity clustering framework for automatic classification of customer loads
Clustering in energy markets is a top topic with high significance on expert and intelligent systems. The main impact of is paper is the proposal of a new clustering framework for the automatic classification of electricity customers’ loads. An automatic selection of the clustering classification algorithm is also highlighted. Finally, new customers can be assigned to a predefined set of clusters in the classificationphase. The computation time of the proposed framework is less than that of previous classification tech- niques, which enables the processing of a complete electric company sample in a matter of minutes on a personal computer. The high accuracy of the predicted classification results verifies the performance of the clustering technique. This classification phase is of significant assistance in interpreting the results, and the simplicity of the clustering phase is sufficient to demonstrate the quality of the complete mining framework.Ministerio de Economía y Competitividad TEC2013-40767-RMinisterio de Economía y Competitividad IDI- 2015004
Deep Generative Models for Reject Inference in Credit Scoring
Credit scoring models based on accepted applications may be biased and their
consequences can have a statistical and economic impact. Reject inference is
the process of attempting to infer the creditworthiness status of the rejected
applications. In this research, we use deep generative models to develop two
new semi-supervised Bayesian models for reject inference in credit scoring, in
which we model the data generating process to be dependent on a Gaussian
mixture. The goal is to improve the classification accuracy in credit scoring
models by adding reject applications. Our proposed models infer the unknown
creditworthiness of the rejected applications by exact enumeration of the two
possible outcomes of the loan (default or non-default). The efficient
stochastic gradient optimization technique used in deep generative models makes
our models suitable for large data sets. Finally, the experiments in this
research show that our proposed models perform better than classical and
alternative machine learning models for reject inference in credit scoring
Recommended from our members
Identifying residential consumption patterns using data-mining techniques: A large-scale study of smart meter data in Chengdu, China
The fine-grained electricity consumption data created by advanced metering technologies offers an opportunity to understand residential demand from new angles. Although there exists a large body of research on demand response in short- and long-term forecasting, a comprehensive analysis to identify household consumption behaviour in different scenarios has not been conducted. The study’s novelty lies in its use of unsupervised machine learning tools to explore residential customers’ demand patterns and response without the assistance of traditional survey tools. We investigate behavioural response in three different contexts: 1) seasonal (using weekly consumption profiles); 2) holidays/festivals; and 3) extreme weather situations. The analysis is based on the smart metering data of 2,000 households in Chengdu, China over three years from 2014 to 2016. Workday/weekend profiles indicate that there are two distinct groups of households that appear to be white-collar or relatively affluent families. Demand patterns at the major festivals in China, especially the Spring Festival, reveal various types of lifestyle and households. In terms of extreme weather response, the most striking finding was that in summer, at night-time, over 72% of households doubled (or more) their electricity usage, while consumption changes in winter do not seem to be significant. Our research offers more detailed insight into Chinese residential consumption and provides a practical framework to understand households’ behaviour patterns in different settings
Recommended from our members
An Overview of the Use of Neural Networks for Data Mining Tasks
In the recent years the area of data mining has experienced a considerable demand for technologies that extract knowledge from large and complex data sources. There is a substantial commercial interest as well as research investigations in the area that aim to develop new and improved approaches for extracting information, relationships, and patterns from datasets. Artificial Neural Networks (NN) are popular biologically inspired intelligent methodologies, whose classification, prediction and pattern recognition capabilities have been utilised successfully in many areas, including science, engineering, medicine, business, banking, telecommunication, and many other fields. This paper highlights from a data mining perspective the implementation of NN, using supervised and unsupervised learning, for pattern recognition, classification, prediction and cluster analysis, and focuses the discussion on their usage in bioinformatics and financial data analysis tasks
Modified Kohonen network algorithm for selection of the initial centres of Gustafson-Kessel algorithm in credit scoring
Credit risk assessment has become an important topic in financial risk administration. Fuzzy clustering analysis has been applied in credit scoring. Gustafson-Kessel (GK) algorithm has been utilised to cluster creditworthy customers as against non-creditworthy ones. A good clustering analysis implemented by good Initial Centres of clusters should be selected. To overcome this problem of Gustafson-Kessel (GK) algorithm, we proposed a modified version of Kohonen Network (KN) algorithm to select the initial centres. Utilising similar degree between points to get similarity density, and then by means of maximum density points selecting; the modified Kohonen Network method generate clustering initial centres to get more reasonable clustering results. The comparative was conducted using three credit scoring datasets: Australian, German and Taiwan. Internal and external indexes of validity clustering are computed and the proposed method was found to have the best performance in these three data sets
Behavioural patterns in aggregated demand response developments for communities targeting renewables
Encouraging consumers to embrace renewable energies and energy-efficient technologies is at stake, and so the energy players such as utilities and policy-makers are opening up a range of new value propositions towards more sustainable communities. For instance, developments of turn-key demand response aggregation and optimisation of distributed loads are rapidly emerging across the globe in a variety of business models focused on maximising the inherent flexibility and diversity of the behind-the-meter assets. However, even though these developments" added value is understood and of wide interest, measurement of the desired levels of consumer engagement is still on demonstration stages and assessment of technology readiness. In this paper, we analyse the characteristics of the loads, the behaviour of parameters, and in a final extent, the behaviour of each kind of consumer participating in aggregated demand scheduling. We apply both non-automatic and machine learning methods to extract the relevant factors and to recognise the potential consumer behaviour on a series of scenarios that are drawn using both synthetic data and living labs datasets. Our experimentation showcases a number of three patterns in which factors like the community"s demand volume and the consumer"s flexibility dominate and impact the performance of the tested development. The experimentation also makes current limitations arise within the existing electricity consumption datasets and their potential for inference and forecasting demand flexibility analytics.Comunidad de Madri
- …