Search CORE

5,965 research outputs found

Improving Knowledge-Based Systems with statistical techniques, text mining, and neural networks for non-technical loss detection

Author: Biscarri Triviño Félix
Biscarri Triviño Jesús
Guerrero Alonso Juan Ignacio
León de Mora Carlos
Monedero Goicoechea Iñigo Luis
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

Currently, power distribution companies have several problems that are related to energy losses. For example, the energy used might not be billed due to illegal manipulation or a breakdown in the customer’s measurement equipment. These types of losses are called non-technical losses (NTLs), and these losses are usually greater than the losses that are due to the distribution infrastructure (technical losses). Traditionally, a large number of studies have used data mining to detect NTLs, but to the best of our knowledge, there are no studies that involve the use of a Knowledge-Based System (KBS) that is created based on the knowledge and expertise of the inspectors. In the present study, a KBS was built that is based on the knowledge and expertise of the inspectors and that uses text mining, neural networks, and statistical techniques for the detection of NTLs. Text mining, neural networks, and statistical techniques were used to extract information from samples, and this information was translated into rules, which were joined to the rules that were generated by the knowledge of the inspectors. This system was tested with real samples that were extracted from Endesa databases. Endesa is one of the most important distribution companies in Spain, and it plays an important role in international markets in both Europe and South America, having more than 73 million customers

idUS. Depósito de Investigación Universidad de Sevilla

Detection of Non-Technical Losses: The Project MIDAS

Author: Acharjya D.P. (Coordinador)
Biscarri Triviño Félix
Biscarri Triviño Jesús
Guerrero Alonso Juan Ignacio
Monedero Goicoechea Iñigo Luis
Tripathy B.K. (Coordinador)
Publication venue: 'IGI Global'
Publication date: 01/01/2014
Field of study

The MIDAS project began in 2006 as collaboration between Endesa, Sadiel, and the University of Seville. The objective of the MIDAS project is the detection of Non-Technical Losses (NTLs) on power utilities. The NTLs represent the non-billed energy due to faults or illegal manipulations in clients’ fa cilities. Initially, research lines study the application of techniques of data mining and neural networks. After several researches, the studies are expanded to other research fields: expert systems, text mining, statistical techniques, pattern recognition, etc. These techniques have provided an automated system for detection of NTLs on company databases. This system is in the test phase, and it is applied in real cases in company databases

idUS. Depósito de Investigación Universidad de Sevilla

Review of Non-Technical Losses Identification Techniques

Author: Pal Kalyan
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/03/2021
Field of study

Illegally consumption of electric power, termed as non-technical losses for the distribution companies is one of the dominant factors all over the world for many years. Although there are some conventional methods to identify these irregularities, such as physical inspection of meters at the consumer premises etc, but it requires large number of manpower and time; then also it does not seem to be adequate. Now a days there are various methods and algorithms have been developed that are proposed in different research papers, to detect non-technical losses. In this paper these methods are reviewed, their important features are highlighted and also the limitations are identified. Finally, the qualitative comparison of various non-technical losses identification algorithms is presented based on their performance, costs, data handling, quality control and execution times. It can be concluded that the graph-based classifier, Optimum-Path Forest algorithm that have both supervised and unsupervised variants, yields the most accurate result to detect non-technical losses

International Journal on Recent and Innovation Trends in Computing and Communication

A real application on non-technical losses detection: the MIDAS Project

Author: Biscarri Triviño Félix
Biscarri Triviño Jesús
Guerrero Alonso Juan Ignacio
León de Mora Carlos
Millán Navarro María del Rocío
Monedero Goicoechea Iñigo Luis
Publication venue
Publication date: 01/01/2011
Field of study

The MIDAS project began at 2006 as collaboration between Endesa, Sadiel and the University of Seville. The objective of the MIDAS project is the detection of Non-Technical Losses (NTLs) on power utilities. The NTLs represent the non-billed energy due to faults or illegal manipulations in clients’ facilities. Initially, research lines study the application of techniques of data mining and neural networks. After several researches, the studies are expanded to other research fields: expert systems, text mining, statistical techniques, pattern recognition, etc. These techniques have provided an automated system for detection of NTLs on company databases. This system is in test phase and it is applied in real cases in company databases

idUS. Depósito de Investigación Universidad de Sevilla

SmartFD: A Real Big Data Application for Electrical Fraud Detection

Author: Arcos Vargas Ángel
Fábregas J. A.
Gutiérrez Avilés David
Martínez Álvarez Francisco
Riquelme Santos José Cristóbal
Tejedor Javier
Troncoso Lora Alicia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

The main objective of this paper is the application of big data analytics to a real case in the field of smart electric networks. Smart meters are not only elements to measure consumption, but they also con stitute a network of millions of sensors in the electricity network. These sensors provide a huge amount of data that, once analyzed, can lead to significant advances for the society. In this way, tools are being developed in order to reach certain goals, such as obtaining a better consumption estimation (which would imply a better production planning), finding better rates based on the time discrimination or the contracted power, or minimizing the non-technical losses in the network, whose actual costs are eventually paid by end-consumers, among others. In this work, real data from Spanish consumers have been analyzed to detect fraud in con sumption. First, 1 TB of raw data was preprocessed in a HDFS-Spark infrastructure. Second, data duplication and outliers were removed, and missing values handled with specific big data algorithms. Third, cus tomers were characterized by means of clustering techniques in different scenarios. Finally, several key factors in fraud consumption were found. Very promising results were achieved, verging on 80% accuracyMinisterio de Economía y Competitividad TIN2014-55894-C2-RMinisterio de Economía y Competitividad TIN2017-88209-C2-

idUS. Depósito de Investigación Universidad de Sevilla

Analyzing the Production and Use of Fossil Fuels: A Case for Data Mining and GIS

Author: Conde Alejandro
Publication venue: Digital Commons @ DU
Publication date: 02/10/2022
Field of study

As technology progresses and data grows both larger and more complex, techniques are being developed to keep up with the exponential growth of information. The term “data mining” is a blanket term used to describe an approach to find anomalies and correlations in a large dataset. This approach involves leveraging data mining software to manipulate and prepare data, apply statistics to quantify trends and characteristics in the data from a high level, and potentially apply advanced techniques like machine learning to identify patterns that wouldn’t be apparent otherwise. In this case study, data mining aided a GIS in displaying substantial amounts of oil, gas, and coal data to make observations regarding two groups: OPEC and the largest non-OPEC fossil fuel producers from 1980 to 2020. To make more sophisticated observations and apply additional context to the trends observed in the data, populations and GDP data for the same period were included in the analysis to enrich the hydrocarbon production and consumption data and to help explain how these valuable resources are traded and consumed. This case study will apply appropriate data mining methods to feed data to a GIS and showcase trends that wouldn’t be apparent otherwise and will additionally identify topics for further research

University of Denver

Performance Analysis of Different Types of Machine Learning Classifiers for Non-Technical Loss Detection

Author: Abbasi Rabeeh Ayaz
Awais Muhammad
Ghori Khawaja Moyeezullah
Imran Muhammad
Szathmary Laszlo
Ullah Ata
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

With the ever-growing demand of electric power, it is quite challenging to detect and prevent Non-Technical Loss (NTL) in power industries. NTL is committed by meter bypassing, hooking from the main lines, reversing and tampering the meters. Manual on-site checking and reporting of NTL remains an unattractive strategy due to the required manpower and associated cost. The use of machine learning classifiers has been an attractive option for NTL detection. It enhances data-oriented analysis and high hit ratio along with less cost and manpower requirements. However, there is still a need to explore the results across multiple types of classifiers on a real-world dataset. This paper considers a real dataset from a power supply company in Pakistan to identify NTL. We have evaluated 15 existing machine learning classifiers across 9 types which also include the recently developed CatBoost, LGBoost and XGBoost classifiers. Our work is validated using extensive simulations. Results elucidate that ensemble methods and Artificial Neural Network (ANN) outperform the other types of classifiers for NTL detection in our real dataset. Moreover, we have also derived a procedure to identify the top-14 features out of a total of 71 features, which are contributing 77% in predicting NTL. We conclude that including more features beyond this threshold does not improve performance and thus limiting to the selected feature set reduces the computation time required by the classifiers. Last but not least, the paper also analyzes the results of the classifiers with respect to their types, which has opened a new area of research in NTL detection

University of Debrecen Electronic Archive

Edge Hill University Research Information Repository

Federation ResearchOnline

University of East Anglia digital repository