44 research outputs found

    Applying Kohonen Vector Quantization Networks for Profiling Customers of Mobile Telecommunication Services

    Get PDF
    Customer clustering is used to understand customers’ preferences and behaviors by examining the differences and similarities between customers. Kohonen vector quantization clustering technology is used in this research and is compared with Kmeans clustering. The data set consists of customer records obtained from a mobile telecommunications service provider. The customers are clustered using various attributes that are broadly grouped under usage, revenue, handset, and service. The clustering results are examined to see the relationships between different types of attributes. The analysis leads to the discovery of several interesting facts about customers that may be of use to mobile service providers

    User profiling and classification for fraud detection in mobile communications networks

    Get PDF
    The topic of this thesis is fraud detection in mobile communications networks by means of user profiling and classification techniques. The goal is to first identify relevant user groups based on call data and then to assign a user to a relevant group. Fraud may be defined as a dishonest or illegal use of services, with the intention to avoid service charges. Fraud detection is an important application, since network operators lose a relevant portion of their revenue to fraud. Whereas the intentions of the mobile phone users cannot be observed, it is assumed that the intentions are reflected in the call data. The call data is subsequently used in describing behavioral patterns of users. Neural networks and probabilistic models are employed in learning these usage patterns from call data. These models are used either to detect abrupt changes in established usage patterns or to recognize typical usage patterns of fraud. The methods are shown to be effective in detecting fraudulent behavior by empirically testing the methods with data from real mobile communications networks.reviewe

    Data mining using neural networks

    Get PDF
    Data mining is about the search for relationships and global patterns in large databases that are increasing in size. Data mining is beneficial for anyone who has a huge amount of data, for example, customer and business data, transaction, marketing, financial, manufacturing and web data etc. The results of data mining are also referred to as knowledge in the form of rules, regularities and constraints. Rule mining is one of the popular data mining methods since rules provide concise statements of potentially important information that is easily understood by end users and also actionable patterns. At present rule mining has received a good deal of attention and enthusiasm from data mining researchers since rule mining is capable of solving many data mining problems such as classification, association, customer profiling, summarization, segmentation and many others. This thesis makes several contributions by proposing rule mining methods using genetic algorithms and neural networks. The thesis first proposes rule mining methods using a genetic algorithm. These methods are based on an integrated framework but capable of mining three major classes of rules. Moreover, the rule mining processes in these methods are controlled by tuning of two data mining measures such as support and confidence. The thesis shows how to build data mining predictive models using the resultant rules of the proposed methods. Another key contribution of the thesis is the proposal of rule mining methods using supervised neural networks. The thesis mathematically analyses the Widrow-Hoff learning algorithm of a single-layered neural network, which results in a foundation for rule mining algorithms using single-layered neural networks. Three rule mining algorithms using single-layered neural networks are proposed for the three major classes of rules on the basis of the proposed theorems. The thesis also looks at the problem of rule mining where user guidance is absent. The thesis proposes a guided rule mining system to overcome this problem. The thesis extends this work further by comparing the performance of the algorithm used in the proposed guided rule mining system with Apriori data mining algorithm. Finally, the thesis studies the Kohonen self-organization map as an unsupervised neural network for rule mining algorithms. Two approaches are adopted based on the way of self-organization maps applied in rule mining models. In the first approach, self-organization map is used for clustering, which provides class information to the rule mining process. In the second approach, automated rule mining takes the place of trained neurons as it grows in a hierarchical structure

    Exploration of customer churn routes using machine learning probabilistic models

    Get PDF
    The ongoing processes of globalization and deregulation are changing the competitive framework in the majority of economic sectors. The appearance of new competitors and technologies entails a sharp increase in competition and a growing preoccupation among service providing companies with creating stronger bonds with customers. Many of these companies are shifting resources away from the goal of capturing new customers and are instead focusing on retaining existing ones. In this context, anticipating the customer¿s intention to abandon, a phenomenon also known as churn, and facilitating the launch of retention-focused actions represent clear elements of competitive advantage. Data mining, as applied to market surveyed information, can provide assistance to churn management processes. In this thesis, we mine real market data for churn analysis, placing a strong emphasis on the applicability and interpretability of the results. Statistical Machine Learning models for simultaneous data clustering and visualization lay the foundations for the analyses, which yield an interpretable segmentation of the surveyed markets. To achieve interpretability, much attention is paid to the intuitive visualization of the experimental results. Given that the modelling techniques under consideration are nonlinear in nature, this represents a non-trivial challenge. Newly developed techniques for data visualization in nonlinear latent models are presented. They are inspired in geographical representation methods and suited to both static and dynamic data representation

    Adaptive scheduling in cellular access, wireless mesh and IP networks

    Get PDF
    Networking scenarios in the future will be complex and will include fixed networks and hybrid Fourth Generation (4G) networks, consisting of both infrastructure-based and infrastructureless, wireless parts. In such scenarios, adaptive provisioning and management of network resources becomes of critical importance. Adaptive mechanisms are desirable since they enable a self-configurable network that is able to adjust itself to varying traffic and channel conditions. The operation of adaptive mechanisms is heavily based on measurements. The aim of this thesis is to investigate how measurement based, adaptive packet scheduling algorithms can be utilized in different networking environments. The first part of this thesis is a proposal for a new delay-based scheduling algorithm, known as Delay-Bounded Hybrid Proportional Delay (DBHPD), for delay adaptive provisioning in DiffServ-based fixed IP networks. This DBHPD algorithm is thoroughly evaluated by ns2-simulations and measurements in a FreeBSD prototype router network. It is shown that DBHPD results in considerably more controllable differentiation than basic static bandwidth sharing algorithms. The prototype router measurements also prove that a DBHPD algorithm can be easily implemented in practice, causing less processing overheads than a well known CBQ algorithm. The second part of this thesis discusses specific scheduling requirements set by hybrid 4G networking scenarios. Firstly, methods for joint scheduling and transmit beamforming in 3.9G or 4G networks are described and quantitatively analyzed using statistical methods. The analysis reveals that the combined gain of channel-adaptive scheduling and transmit beamforming is substantial and that an On-off strategy can achieve the performance of an ideal Max SNR strategy if the feedback threshold is optimized. Finally, a novel cross-layer energy-adaptive scheduling and queue management framework EAED (Energy Aware Early Detection), for preserving delay bounds and minimizing energy consumption in WLAN mesh networks, is proposed and evaluated with simulations. The simulations show that our scheme can save considerable amounts of transmission energy without violating application level QoS requirements when traffic load and distances are reasonable

    Non intrusive load monitoring & identification for energy management system using computational intelligence approach

    Get PDF
    Includes bibliography.Electrical energy is the life line to every nation’s or continent development and economic progress. Referable to the recent growth in the demand for electricity and shortage in production, it is indispensable to develop strategies for effective energy management and system delivery. Load monitoring such as intrusive load monitoring, non-intrusive load monitoring, and identification of domestic electrical appliances is proposed especially at the residential level since it is the major energy consumer. The intrusive load monitoring provides accurate results and would allow each individual appliance's energy consumption to be transmitted to a central hub. Nevertheless, there are many practical disadvantages to this method that have motivated the introduction of non-intrusive load monitoring system. The fiscal cost of manufacturing and installing enough monitoring devices to match the number of domestic appliances is considered to be a disadvantage. In addition, the installation of one meter per household appliances would lead to congestion in the house and thus cause inconvenience to the occupants of the house, therefore, non-intrusive load monitoring technique was developed to alleviate the aforementioned challenges of intrusive load monitoring. Non-intrusive load monitoring (NILM) is the process of disaggregating a household’s total energy consumption into its contributing appliances. The total household load is monitored via a single monitoring device such as smart meter (SM). NILM provides cost effective and convenient means of load monitoring and identification. Several nonintrusive load monitoring and identification techniques are reviewed. However, the literature lacks a comprehensive system that can identify appliances with small energy consumption, appliances with overlapping energy consumption and a group of appliance ranges at once. This has been the major setback to most of the adopted techniques. In this dissertation, we propose techniques that overcome these setbacks by combining artificial neural networks (ANN) with a developed algorithm to identify appliances ranges that contribute to the energy consumption within a given period of time usually an hour interval

    Exploraty Multivariate Statistical Methods Applied to Pharmaceutical Industry CRM Data

    Get PDF
    Dissertação apresentada como requisito parcial para obtenção do grau de Mestre em Estatística e Gestão de InformaçãoAn analysis of the current CRM systems in the Pharmaceutical Industry, the way the pharmaceutical companies developed them and a comparison between Europe and United States was done in this study. Overall the CRM in the pharmaceutical industry is far-behind, when compared with other business areas, like consumer goods, finance (banking) or insurance companies, being pharmaceutical CRM specifically less developed in Europe when compared to United States. One of the big obstacles for the success of CRM in the pharmaceutical industry is the poor analytics applied to the current CRM programs. Improving Sales and Marketing Effectiveness by apllying, multivariate exploratory statistical methods, specifically Factor Analysis and Clustering into pharmaceutical CRM data from a Portuguese pharmaceutical company was the main goal of this thesis. Their overall usefulness when applied to the business was demonstrated, and specifically in relation to the cluster methods, SOMs outperformed the hierarchical methods by producing a more meaningful business solution

    Data mining industry : emerging trends and new opportunities

    Get PDF
    Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, June 2000."May 2000."Includes bibliographical references (leaves 170-179).by Walter Alberto Aldana.M.Eng

    Improving Electricity Distribution System State Estimation with AMR-Based Load Profiles

    Get PDF
    The ongoing battle against global warming is rapidly increasing the amount of renewable power generation, and smart solutions are needed to integrate these new generation units into the existing distribution systems. Smart grids answer this call by introducing intelligent ways of controlling the network and active resources connected to it. However, before the network can be controlled, the automation system must know what the node voltages and line currents defining the network state are.Distribution system state estimation (DSSE) is needed to find the most likely state of the network when the number and accuracy of measurements are limited. Typically, two types of measurements are used in DSSE: real-time measurements and pseudomeasurements. In recent years, finding cost-efficient ways to improve the DSSE accuracy has been a popular subject in the literature. While others have focused on optimizing the type, amount and location of real-time measurements, the main hypothesis of this thesis is that it is possible to enhance the DSSE accuracy by using interval measurements collected with automatic meter reading (AMR) to improve the load profiles used as pseudo-measurements.The work done in this thesis can be divided into three stages. In the first stage, methods for creating new AMR-based load profiles are studied. AMR measurements from thousands of customers are used to test and compare the different options for improving the load profiling accuracy. Different clustering algorithms are tested and a novel twostage clustering method for load profiling is developed. In the second stage, a DSSE algorithm suited for smart grid environment is developed. Simulations and real-life demonstrations are conducted to verify the accuracy and applicability of the developed state estimator. In the third and final stage, the AMR-based load profiling and DSSE are combined. Matlab simulations with real AMR data and a real distribution network model are made and the developed load profiles are compared with other commonly used pseudo-measurements.The results indicate that clustering is an efficient way to improve the load profiling accuracy. With the help of clustering, both the customer classification and customer class load profiles can be updated simultaneously. Several of the tested clustering algorithms were suited for clustering electricity customers, but the best results were achieved with a modified k-means algorithm. Results from the third stage simulations supported the main hypothesis that the new AMR-based load profiles improve the DSSE accuracy.The results presented in this thesis should motivate distribution system operators and other actors in the field of electricity distribution to utilize AMR data and clustering algorithms in load profiling. It improves not only the DSSE accuracy but also many other functions that rely on load flow calculation and need accurate load estimates or forecasts

    Quality of service analysis of internet links with minimal information

    Full text link
    Tesis doctoral inédita. Universidad Autónoma de Madrid, Escuela Politécnica Superior, julio de 201
    corecore