20,954 research outputs found
Improving Knowledge-Based Systems with statistical techniques, text mining, and neural networks for non-technical loss detection
Currently, power distribution companies have several problems that are related to energy losses. For
example, the energy used might not be billed due to illegal manipulation or a breakdown in the customer’s
measurement equipment. These types of losses are called non-technical losses (NTLs), and these
losses are usually greater than the losses that are due to the distribution infrastructure (technical losses).
Traditionally, a large number of studies have used data mining to detect NTLs, but to the best of our
knowledge, there are no studies that involve the use of a Knowledge-Based System (KBS) that is created
based on the knowledge and expertise of the inspectors. In the present study, a KBS was built that is
based on the knowledge and expertise of the inspectors and that uses text mining, neural networks,
and statistical techniques for the detection of NTLs. Text mining, neural networks, and statistical techniques
were used to extract information from samples, and this information was translated into rules,
which were joined to the rules that were generated by the knowledge of the inspectors. This system
was tested with real samples that were extracted from Endesa databases. Endesa is one of the most
important distribution companies in Spain, and it plays an important role in international markets in
both Europe and South America, having more than 73 million customers
Object Detection in 20 Years: A Survey
Object detection, as of one the most fundamental and challenging problems in
computer vision, has received great attention in recent years. Its development
in the past two decades can be regarded as an epitome of computer vision
history. If we think of today's object detection as a technical aesthetics
under the power of deep learning, then turning back the clock 20 years we would
witness the wisdom of cold weapon era. This paper extensively reviews 400+
papers of object detection in the light of its technical evolution, spanning
over a quarter-century's time (from the 1990s to 2019). A number of topics have
been covered in this paper, including the milestone detectors in history,
detection datasets, metrics, fundamental building blocks of the detection
system, speed up techniques, and the recent state of the art detection methods.
This paper also reviews some important detection applications, such as
pedestrian detection, face detection, text detection, etc, and makes an in-deep
analysis of their challenges as well as technical improvements in recent years.Comment: This work has been submitted to the IEEE TPAMI for possible
publicatio
Detection of Non-Technical Losses: The Project MIDAS
The MIDAS project began in 2006 as collaboration between Endesa, Sadiel, and the University of
Seville. The objective of the MIDAS project is the detection of Non-Technical Losses (NTLs) on power
utilities. The NTLs represent the non-billed energy due to faults or illegal manipulations in clients’ fa cilities. Initially, research lines study the application of techniques of data mining and neural networks.
After several researches, the studies are expanded to other research fields: expert systems, text mining,
statistical techniques, pattern recognition, etc. These techniques have provided an automated system
for detection of NTLs on company databases. This system is in the test phase, and it is applied in real
cases in company databases
Combination of Domain Knowledge and Deep Learning for Sentiment Analysis of Short and Informal Messages on Social Media
Sentiment analysis has been emerging recently as one of the major natural
language processing (NLP) tasks in many applications. Especially, as social
media channels (e.g. social networks or forums) have become significant sources
for brands to observe user opinions about their products, this task is thus
increasingly crucial. However, when applied with real data obtained from social
media, we notice that there is a high volume of short and informal messages
posted by users on those channels. This kind of data makes the existing works
suffer from many difficulties to handle, especially ones using deep learning
approaches. In this paper, we propose an approach to handle this problem. This
work is extended from our previous work, in which we proposed to combine the
typical deep learning technique of Convolutional Neural Networks with domain
knowledge. The combination is used for acquiring additional training data
augmentation and a more reasonable loss function. In this work, we further
improve our architecture by various substantial enhancements, including
negation-based data augmentation, transfer learning for word embeddings, the
combination of word-level embeddings and character-level embeddings, and using
multitask learning technique for attaching domain knowledge rules in the
learning process. Those enhancements, specifically aiming to handle short and
informal messages, help us to enjoy significant improvement in performance once
experimenting on real datasets.Comment: A Preprint of an article accepted for publication by Inderscience in
IJCVR on September 201
On the role of pre and post-processing in environmental data mining
The quality of discovered knowledge is highly depending on data quality. Unfortunately real data use to contain noise, uncertainty, errors, redundancies or even irrelevant information. The more complex is the reality to be analyzed, the higher the risk of getting low quality data. Knowledge Discovery from Databases (KDD) offers a global framework to prepare data in the right form to perform correct analyses. On the other hand, the quality of decisions taken upon KDD results, depend not only on the quality of the results themselves, but on the capacity of the system to communicate those results in an understandable form. Environmental systems are particularly complex and environmental users particularly require clarity in their results. In this paper some details about how this can be achieved are provided. The role of the pre and post processing in the whole process of Knowledge Discovery in environmental systems is discussed
- …