5,965 research outputs found
Improving Knowledge-Based Systems with statistical techniques, text mining, and neural networks for non-technical loss detection
Currently, power distribution companies have several problems that are related to energy losses. For
example, the energy used might not be billed due to illegal manipulation or a breakdown in the customer’s
measurement equipment. These types of losses are called non-technical losses (NTLs), and these
losses are usually greater than the losses that are due to the distribution infrastructure (technical losses).
Traditionally, a large number of studies have used data mining to detect NTLs, but to the best of our
knowledge, there are no studies that involve the use of a Knowledge-Based System (KBS) that is created
based on the knowledge and expertise of the inspectors. In the present study, a KBS was built that is
based on the knowledge and expertise of the inspectors and that uses text mining, neural networks,
and statistical techniques for the detection of NTLs. Text mining, neural networks, and statistical techniques
were used to extract information from samples, and this information was translated into rules,
which were joined to the rules that were generated by the knowledge of the inspectors. This system
was tested with real samples that were extracted from Endesa databases. Endesa is one of the most
important distribution companies in Spain, and it plays an important role in international markets in
both Europe and South America, having more than 73 million customers
Detection of Non-Technical Losses: The Project MIDAS
The MIDAS project began in 2006 as collaboration between Endesa, Sadiel, and the University of
Seville. The objective of the MIDAS project is the detection of Non-Technical Losses (NTLs) on power
utilities. The NTLs represent the non-billed energy due to faults or illegal manipulations in clients’ fa cilities. Initially, research lines study the application of techniques of data mining and neural networks.
After several researches, the studies are expanded to other research fields: expert systems, text mining,
statistical techniques, pattern recognition, etc. These techniques have provided an automated system
for detection of NTLs on company databases. This system is in the test phase, and it is applied in real
cases in company databases
Review of Non-Technical Losses Identification Techniques
Illegally consumption of electric power, termed as non-technical losses for the distribution companies is one of the dominant factors all over the world for many years. Although there are some conventional methods to identify these irregularities, such as physical inspection of meters at the consumer premises etc, but it requires large number of manpower and time; then also it does not seem to be adequate. Now a days there are various methods and algorithms have been developed that are proposed in different research papers, to detect non-technical losses. In this paper these methods are reviewed, their important features are highlighted and also the limitations are identified. Finally, the qualitative comparison of various non-technical losses identification algorithms is presented based on their performance, costs, data handling, quality control and execution times. It can be concluded that the graph-based classifier, Optimum-Path Forest algorithm that have both supervised and unsupervised variants, yields the most accurate result to detect non-technical losses
A real application on non-technical losses detection: the MIDAS Project
The MIDAS project began at 2006 as collaboration between Endesa, Sadiel and the University of Seville. The objective of the MIDAS project is the detection of Non-Technical Losses (NTLs) on power utilities. The NTLs represent the non-billed energy due to faults or illegal manipulations in clients’ facilities. Initially, research lines study the application of techniques of data mining and neural networks. After several researches, the studies are expanded to other research fields: expert systems, text mining, statistical techniques, pattern recognition, etc. These techniques have provided an automated system for detection of NTLs on company databases. This system is in test phase and it is applied in real cases in company databases
SmartFD: A Real Big Data Application for Electrical Fraud Detection
The main objective of this paper is the application of big
data analytics to a real case in the field of smart electric networks. Smart
meters are not only elements to measure consumption, but they also con stitute a network of millions of sensors in the electricity network. These
sensors provide a huge amount of data that, once analyzed, can lead to
significant advances for the society. In this way, tools are being developed
in order to reach certain goals, such as obtaining a better consumption
estimation (which would imply a better production planning), finding
better rates based on the time discrimination or the contracted power,
or minimizing the non-technical losses in the network, whose actual costs
are eventually paid by end-consumers, among others. In this work, real
data from Spanish consumers have been analyzed to detect fraud in con sumption. First, 1 TB of raw data was preprocessed in a HDFS-Spark
infrastructure. Second, data duplication and outliers were removed, and
missing values handled with specific big data algorithms. Third, cus tomers were characterized by means of clustering techniques in different
scenarios. Finally, several key factors in fraud consumption were found.
Very promising results were achieved, verging on 80% accuracyMinisterio de Economía y Competitividad TIN2014-55894-C2-RMinisterio de Economía y Competitividad TIN2017-88209-C2-
Analyzing the Production and Use of Fossil Fuels: A Case for Data Mining and GIS
As technology progresses and data grows both larger and more complex, techniques are being developed to keep up with the exponential growth of information. The term “data mining” is a blanket term used to describe an approach to find anomalies and correlations in a large dataset. This approach involves leveraging data mining software to manipulate and prepare data, apply statistics to quantify trends and characteristics in the data from a high level, and potentially apply advanced techniques like machine learning to identify patterns that wouldn’t be apparent otherwise. In this case study, data mining aided a GIS in displaying substantial amounts of oil, gas, and coal data to make observations regarding two groups: OPEC and the largest non-OPEC fossil fuel producers from 1980 to 2020. To make more sophisticated observations and apply additional context to the trends observed in the data, populations and GDP data for the same period were included in the analysis to enrich the hydrocarbon production and consumption data and to help explain how these valuable resources are traded and consumed. This case study will apply appropriate data mining methods to feed data to a GIS and showcase trends that wouldn’t be apparent otherwise and will additionally identify topics for further research
Performance Analysis of Different Types of Machine Learning Classifiers for Non-Technical Loss Detection
With the ever-growing demand of electric power, it is quite challenging to detect and prevent Non-Technical Loss (NTL) in power industries. NTL is committed by meter bypassing, hooking from the main lines, reversing and tampering the meters. Manual on-site checking and reporting of NTL remains an unattractive strategy due to the required manpower and associated cost. The use of machine learning classifiers has been an attractive option for NTL detection. It enhances data-oriented analysis and high hit ratio along with less cost and manpower requirements. However, there is still a need to explore the results across multiple types of classifiers on a real-world dataset. This paper considers a real dataset from a power supply company in Pakistan to identify NTL. We have evaluated 15 existing machine learning classifiers across 9 types which also include the recently developed CatBoost, LGBoost and XGBoost classifiers. Our work is validated using extensive simulations. Results elucidate that ensemble methods and Artificial Neural Network (ANN) outperform the other types of classifiers for NTL detection in our real dataset. Moreover, we have also derived a procedure to identify the top-14 features out of a total of 71 features, which are contributing 77% in predicting NTL. We conclude that including more features beyond this threshold does not improve performance and thus limiting to the selected feature set reduces the computation time required by the classifiers. Last but not least, the paper also analyzes the results of the classifiers with respect to their types, which has opened a new area of research in NTL detection
- …