5 research outputs found
A TOOL FOR EFFECTIVE DETECTION OF FRAUD IN CREDIT CARD SYSTEM
Due to the rise and rapid growth of E-Commerce, use of credit cards for online purchases has dramatically increased and it caused an explosion in the credit card fraud. Fraud is one of the major ethical issues in the credit card industry. As credit card becomes the most popular mode of payment for both online as well as regular purchase, cases of fraud associated with it are also rising. In real life, fraudulent transactions are scattered with genuine transactions and simple pattern matching techniques are not often sufficient to detect those frauds accurately. Implementation of efficient fraud detection systems has thus become imperative for all credit card issuing banks to minimize their losses. Many modern techniques based on Artificial Intelligence, Data mining, Fuzzy logic, Machine learning, Sequence Alignment, Genetic Programming etc., has evolved in detecting various credit card fraudulent transactions
Optimizing Solar Energy Harvesting: Supervised Machine Learning-Driven Peak Power Point Tracking for Diverse Weather Conditions
Solar Power is one of the significant prevalent forms of clean energy due to its perceived to be pollution-free and easily accessible. The market for renewable energy was established by the rapid development in electrical energy consumption and the diminution of conventional energy resources (CER). Under varying weather condition extracted energy from solar system is not constant and maximum. This study suggests the applicability of machine learning algorithm (MLA) in Peak power point tracking (P3T) methods to maximize power of a PV arrangement under varying weather conditions. Machine learning methods optimize peak power point tracking in solar photovoltaic systems by bringing agility, data-driven decision-making, and increased accuracy. MLAs improve the overall efficiency, stability, and dependability of these systems by handling the unpredictability of solar energy production under varying weather circumstances and PSCs Because MLAs are able to learn and adjust to non-linear relationships between solar intensity and PVS output. In this study, the squared multiple squared exponential Gaussian process regression method SGPRA tested in three rapidly varying ecological conditions. The performance of ML-P3T methods is validated using Matlab/Simulink, and the simulation outcome are compared with one of the most used algorithms, the variable step size incremental conductance algorithm (VINA). The Matlab/Simulink findings show that SGPRA operates significantly better under varying weather circumstances, harnessing more peak power efficiency 90%, shorter tracking time 0.13 sec, a mean error of 0.042, and superior stability
Performance Evaluation of an Independent Time Optimized Infrastructure for Big Data Analytics that Maintains Symmetry
Traditional data analytics tools are designed to deal with the asymmetrical type of data i.e., structured, semi-structured, and unstructured. The diverse behavior of data produced by different sources requires the selection of suitable tools. The restriction of recourses to deal with a huge volume of data is a challenge for these tools, which affects the performances of the tool's execution time. Therefore, in the present paper, we proposed a time optimization model, shares common HDFS (Hadoop Distributed File System) between three Name-node (Master Node), three Data-node, and one Client-node. These nodes work under the DeMilitarized zone (DMZ) to maintain symmetry. Machine learning jobs are explored from an independent platform to realize this model. In the first node (Name-node 1), Mahout is installed with all machine learning libraries through the maven repositories. The second node (Name-node 2), R connected to Hadoop, is running through the shiny-server. Splunk is configured in the third node (Name-node 3) and is used to analyze the logs. Experiments are performed between the proposed and legacy model to evaluate the response time, execution time, and throughput. K-means clustering, Navies Bayes, and recommender algorithms are run on three different data sets, i.e., movie rating, newsgroup, and Spam SMS data set, representing structured, semi-structured, and unstructured data, respectively. The selection of tools defines data independence, e.g., Newsgroup data set to run on Mahout as others cannot be compatible with this data. It is evident from the outcome of the data that the performance of the proposed model establishes the hypothesis that our model overcomes the limitation of the resources of the legacy model. In addition, the proposed model can process any kind of algorithm on different sets of data, which resides in its native formats
Performance Evaluation of an Independent Time Optimized Infrastructure for Big Data Analytics that Maintains Symmetry
Traditional data analytics tools are designed to deal with the asymmetrical type of data i.e., structured, semi-structured, and unstructured. The diverse behavior of data produced by different sources requires the selection of suitable tools. The restriction of recourses to deal with a huge volume of data is a challenge for these tools, which affects the performances of the tool’s execution time. Therefore, in the present paper, we proposed a time optimization model, shares common HDFS (Hadoop Distributed File System) between three Name-node (Master Node), three Data-node, and one Client-node. These nodes work under the DeMilitarized zone (DMZ) to maintain symmetry. Machine learning jobs are explored from an independent platform to realize this model. In the first node (Name-node 1), Mahout is installed with all machine learning libraries through the maven repositories. The second node (Name-node 2), R connected to Hadoop, is running through the shiny-server. Splunk is configured in the third node (Name-node 3) and is used to analyze the logs. Experiments are performed between the proposed and legacy model to evaluate the response time, execution time, and throughput. K-means clustering, Navies Bayes, and recommender algorithms are run on three different data sets, i.e., movie rating, newsgroup, and Spam SMS data set, representing structured, semi-structured, and unstructured data, respectively. The selection of tools defines data independence, e.g., Newsgroup data set to run on Mahout as others cannot be compatible with this data. It is evident from the outcome of the data that the performance of the proposed model establishes the hypothesis that our model overcomes the limitation of the resources of the legacy model. In addition, the proposed model can process any kind of algorithm on different sets of data, which resides in its native formats