1,935 research outputs found

    Handwritten Digit Recognition Using Machine Learning Algorithms

    Get PDF
    Handwritten character recognition is one of the practically important issues in pattern recognition applications. The applications of digit recognition includes in postal mail sorting, bank check processing, form data entry, etc. The heart of the problem lies within the ability to develop an efficient algorithm that can recognize hand written digits and which is submitted by users by the way of a scanner, tablet, and other digital devices. This paper presents an approach to off-line handwritten digit recognition based on different machine learning technique. The main objective of this paper is to ensure effective and reliable approaches for recognition of handwritten digits. Several machines learning algorithm namely, Multilayer Perceptron, Support Vector Machine, NaFDA5; Bayes, Bayes Net, Random Forest, J48 and Random Tree has been used for the recognition of digits using WEKA. The result of this paper shows that highest 90.37% accuracy has been obtained for Multilayer Perceptron

    Performance Evaluation of Smart Decision Support Systems on Healthcare

    Get PDF
    Medical activity requires responsibility not only from clinical knowledge and skill but also on the management of an enormous amount of information related to patient care. It is through proper treatment of information that experts can consistently build a healthy wellness policy. The primary objective for the development of decision support systems (DSSs) is to provide information to specialists when and where they are needed. These systems provide information, models, and data manipulation tools to help experts make better decisions in a variety of situations. Most of the challenges that smart DSSs face come from the great difficulty of dealing with large volumes of information, which is continuously generated by the most diverse types of devices and equipment, requiring high computational resources. This situation makes this type of system susceptible to not recovering information quickly for the decision making. As a result of this adversity, the information quality and the provision of an infrastructure capable of promoting the integration and articulation among different health information systems (HIS) become promising research topics in the field of electronic health (e-health) and that, for this same reason, are addressed in this research. The work described in this thesis is motivated by the need to propose novel approaches to deal with problems inherent to the acquisition, cleaning, integration, and aggregation of data obtained from different sources in e-health environments, as well as their analysis. To ensure the success of data integration and analysis in e-health environments, it is essential that machine-learning (ML) algorithms ensure system reliability. However, in this type of environment, it is not possible to guarantee a reliable scenario. This scenario makes intelligent SAD susceptible to predictive failures, which severely compromise overall system performance. On the other hand, systems can have their performance compromised due to the overload of information they can support. To solve some of these problems, this thesis presents several proposals and studies on the impact of ML algorithms in the monitoring and management of hypertensive disorders related to pregnancy of risk. The primary goals of the proposals presented in this thesis are to improve the overall performance of health information systems. In particular, ML-based methods are exploited to improve the prediction accuracy and optimize the use of monitoring device resources. It was demonstrated that the use of this type of strategy and methodology contributes to a significant increase in the performance of smart DSSs, not only concerning precision but also in the computational cost reduction used in the classification process. The observed results seek to contribute to the advance of state of the art in methods and strategies based on AI that aim to surpass some challenges that emerge from the integration and performance of the smart DSSs. With the use of algorithms based on AI, it is possible to quickly and automatically analyze a larger volume of complex data and focus on more accurate results, providing high-value predictions for a better decision making in real time and without human intervention.A atividade médica requer responsabilidade não apenas com base no conhecimento e na habilidade clínica, mas também na gestão de uma enorme quantidade de informações relacionadas ao atendimento ao paciente. É através do tratamento adequado das informações que os especialistas podem consistentemente construir uma política saudável de bem-estar. O principal objetivo para o desenvolvimento de sistemas de apoio à decisão (SAD) é fornecer informações aos especialistas onde e quando são necessárias. Esses sistemas fornecem informações, modelos e ferramentas de manipulação de dados para ajudar os especialistas a tomar melhores decisões em diversas situações. A maioria dos desafios que os SAD inteligentes enfrentam advêm da grande dificuldade de lidar com grandes volumes de dados, que é gerada constantemente pelos mais diversos tipos de dispositivos e equipamentos, exigindo elevados recursos computacionais. Essa situação torna este tipo de sistemas suscetível a não recuperar a informação rapidamente para a tomada de decisão. Como resultado dessa adversidade, a qualidade da informação e a provisão de uma infraestrutura capaz de promover a integração e a articulação entre diferentes sistemas de informação em saúde (SIS) tornam-se promissores tópicos de pesquisa no campo da saúde eletrônica (e-saúde) e que, por essa mesma razão, são abordadas nesta investigação. O trabalho descrito nesta tese é motivado pela necessidade de propor novas abordagens para lidar com os problemas inerentes à aquisição, limpeza, integração e agregação de dados obtidos de diferentes fontes em ambientes de e-saúde, bem como sua análise. Para garantir o sucesso da integração e análise de dados em ambientes e-saúde é importante que os algoritmos baseados em aprendizagem de máquina (AM) garantam a confiabilidade do sistema. No entanto, neste tipo de ambiente, não é possível garantir um cenário totalmente confiável. Esse cenário torna os SAD inteligentes suscetíveis à presença de falhas de predição que comprometem seriamente o desempenho geral do sistema. Por outro lado, os sistemas podem ter seu desempenho comprometido devido à sobrecarga de informações que podem suportar. Para tentar resolver alguns destes problemas, esta tese apresenta várias propostas e estudos sobre o impacto de algoritmos de AM na monitoria e gestão de transtornos hipertensivos relacionados com a gravidez (gestação) de risco. O objetivo das propostas apresentadas nesta tese é melhorar o desempenho global de sistemas de informação em saúde. Em particular, os métodos baseados em AM são explorados para melhorar a precisão da predição e otimizar o uso dos recursos dos dispositivos de monitorização. Ficou demonstrado que o uso deste tipo de estratégia e metodologia contribui para um aumento significativo do desempenho dos SAD inteligentes, não só em termos de precisão, mas também na diminuição do custo computacional utilizado no processo de classificação. Os resultados observados buscam contribuir para o avanço do estado da arte em métodos e estratégias baseadas em inteligência artificial que visam ultrapassar alguns desafios que advêm da integração e desempenho dos SAD inteligentes. Como o uso de algoritmos baseados em inteligência artificial é possível analisar de forma rápida e automática um volume maior de dados complexos e focar em resultados mais precisos, fornecendo previsões de alto valor para uma melhor tomada de decisão em tempo real e sem intervenção humana

    Machine Learning Guided Exploration of an Empirical Ribozyme Fitness Landscape

    Get PDF
    Okinawa Institute of Science and Technology Graduate UniversityDoctor of PhilosophyFitness landscape of a biomolecule is a representation of its activity as a function of its sequence. Properties of a fitness landscape determine how evolution proceeds. Therefore, the distribution of functional variants and more importantly, the connectivity of these variants within the sequence space are important scientific questions. Exploration of these spaces, however, is impeded by the combinatorial explosion of the sequence space. High-throughput experimental methods have recently reduced this impediment but only modestly. Better computational methods are needed to fully utilize the rich information from these experimental data to better understand the properties of the fitness landscape. In this work, I seek to improve this exploration process by combining data from massively parallel experimental assay with smart library design using advanced computational techniques. I focus on an artificial RNA enzyme or ribozyme that can catalyze a ligation reaction between two RNA fragments. This chemistry is analogous to that of the modern RNA polymeraseenzymes, therefore, represents an important reaction in the origin of life. In the first chapter, I discuss the background to this work in the context of evolutionary theory of fitness landscape and its implications in biotechnology. In chapter 2, I explore the use of processes borrowed from the field of evolutionary computation to solve optimization problems using real experimental sequence-activity data. In chapter 3, I investigate the use of supervised machine learning models to extract information on epistatic interactions from the dataset collected during multiple rounds of directed evolution. I investigate and experimentally validate the extent to which a deep learning model can be used to guide a completely computational evolutionary algorithm towards distant regions of the fitness landscape. In the final chapter, I perform a comprehensive experimental assay of the combinatorial region explored by the deep learning-guided evolutionary algorithm. Using this dataset, I analyze higher-order epistasis and attempt to explain the increased predictability of the region sampled by the algorithm. Finally, I provide the first experimental evidence of a large RNA ‘neutral network’. Altogether, this work represents the most comprehensive experimental and computational study of the RNA ligase ribozyme fitness landscape to date, providing important insights into the evolutionary search space possibly explored during the earliest stages of life.doctoral thesi

    Multi-step time series prediction intervals using neuroevolution

    Get PDF
    Multi-step time series forecasting (TSF) is a crucial element to support tactical decisions (e.g., designing production or marketing plans several months in advance). While most TSF research addresses only single-point prediction, prediction intervals (PIs) are useful to reduce uncertainty related to important decision making variables. In this paper, we explore a large set of neural network methods for multi-step TSF and that directly optimize PIs. This includes multi-step adaptations of recently proposed PI methods, such as lower--upper bound estimation (LUBET), its ensemble extension (LUBEXT), a multi-objective evolutionary algorithm LUBE (MLUBET) and a two-phase learning multi-objective evolutionary algorithm (M2LUBET). We also explore two new ensemble variants for the evolutionary approaches based on two PI coverage--width split methods (radial slices and clustering), leading to the MLUBEXT, M2LUBEXT, MLUBEXT2 and M2LUBEXT2 methods. A robust comparison was held by considering the rolling window procedure, nine time series from several real-world domains and with different characteristics, two PI quality measures (coverage error and width) and the Wilcoxon statistic. Overall, the best results were achieved by the M2LUBET neuroevolution method, which requires a reasonable computational effort for time series with a few hundreds of observations.This article is a result of the project NORTE-01- 0247-FEDER-017497, supported by Norte Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund (ERDF). We would also like to thank the anonymous reviewers for their helpful suggestionsinfo:eu-repo/semantics/publishedVersio

    Development of soft computing and applications in agricultural and biological engineering

    Get PDF
    Soft computing is a set of “inexact” computing techniques, which are able to model and analyze very complex problems. For these complex problems, more conventional methods have not been able to produce cost-effective, analytical, or complete solutions. Soft computing has been extensively studied and applied in the last three decades for scientific research and engineering computing. In agricultural and biological engineering, researchers and engineers have developed methods of fuzzy logic, artificial neural networks, genetic algorithms, decision trees, and support vector machines to study soil and water regimes related to crop growth, analyze the operation of food processing, and support decision-making in precision farming. This paper reviews the development of soft computing techniques. With the concepts and methods, applications of soft computing in the field of agricultural and biological engineering are presented, especially in the soil and water context for crop management and decision support in precision agriculture. The future of development and application of soft computing in agricultural and biological engineering is discussed

    A machine learning approach

    Get PDF
    Castelli, M., Groznik, A., & Popovič, A. (2020). Forecasting electricity prices: A machine learning approach. Algorithms, 13(5), 1-16. [119]. https://doi.org/10.3390/A13050119The electricity market is a complex, evolutionary, and dynamic environment. Forecasting electricity prices is an important issue for all electricity market participants. In this study, we shed light on how to improve electricity price forecasting accuracy through the use of a machine learning technique-namely, a novel genetic programming approach. Drawing on empirical data from the largest EU energy markets, we propose a forecasting model that considers variables related to weather conditions, oil prices, and CO2 coupons and predicts energy prices 24 h ahead. We show that the proposed model provides more accurate predictions of future electricity prices than existing prediction methods. Our important findings will assist the electricity market participants in forecasting future price movements.publishersversionpublishe

    Determining uncertainties in AI applications in AEC sector and their corresponding mitigation strategies

    Get PDF
    The Artificial Intelligence (AI) methodologies and techniques have been used to solve a wide spectrum of engineering problems in Architectural, Engineering and Construction (AEC) industry with the aim of improving overall productivity and optimized decision throughout full project life cycle (planning, design, construction and maintenance). However, many AI applications are facing different limitations and constrains due to the lack of comprehensive understanding about the inherent uncertainty fundamentally and mathematically, hence the use of AI has not achieved a satisfactory level. It requires different actions to tackle different types of uncertainties which varies according to different types of applications. This paper therefore reviews 5 type of popular AI algorithms, including Primary Component Analysis, Multilayer Perceptron, Fuzzy Logic, Support Vector Machine and Genetic Algorithm; then examines how these artificial intelligence techniques can assist the decision-making process by mitigating uncertainty meanwhile achieving the expected high efficiency. The paper reviews each germane technique, mathematical explanation, analysis of reasons causing uncertainty, and concludes a set of guidelines and an application framework for optimizing their informed uncertainty for AEC applications. This work will pave the way for the fundamental understanding and in turn to provide a valuable reference for applying AI techniques in AEC sector properly to achieve better overall performance

    AI based residential load forecasting

    Get PDF
    The increasing levels of energy consumption worldwide is raising issues with respect to surpassing supply limits, causing severe effects on the environment, and the exhaustion of energy resources. Buildings are one of the most relevant sectors in terms of energy consumption in the world. Many researches have been carried out in the recent years with primary concentration on efficient Home or Building Management Systems. In addition, by increasing renewable energy penetration, modern power grids demand more accurate consumption predictions to provide the optimized power supply which is stochastic in nature. This study will present an analytic comparison of day-ahead load forecasting during a period of two years by applying AI based data driven models. The unit of analysis in this thesis project is based on households smart meter data in England. The collected and collated data for this study includes historical electricity consumption of 75 houses over two years of 2012 to 2014 city of London. Predictive models divided in two main forecasting groups of deterministic and probabilistic forecasting. In deterministic step, Random Forest Regression and MLP Regression employed to make a forecasting models. In the probabilistic phase,DeepAR, FFNN and Gaussian Process Estimator were employed to predict days ahead load forecasting. The models are trained based on subset of various groups of customers with registered diversified load volatility level. Daily weather data are also added as new feature in this study into subset to check model sensitivity to external factors and validate the performance of the model. The results of implemented models are evaluated by well-known error metrics as RMSE,MAE, MSE and CRPS separately for each phase of this study. The findings of this master thesis study shows that the Deep Learning methods of FNN, DeepAR and MLP compared to other utilized methods like Random Forest and Gaussian provide better data prediction reslts in terms of less deviance to real load trend, lower forecasting error and computation time. Considering probabilistic forecasting methods it is observed that DeepAR can provide better results than FFNN and Gaussian Process model. Although the computation time of FFNN was lower than other

    Estimation of real traffic radiated emissions from electric vehicles in terms of the driving profile using neural networks

    Get PDF
    The increment of the use of electric vehicles leads to a worry about measuring its principal source of environmental pollution: electromagnetic emissions. Given the complexity of directly measuring vehicular radiated emissions in real traffic, the main contribution of this PhD thesis is to propose an indirect solution to estimate such type of vehicular emissions. Relating the on-road vehicular radiated emissions with the driving profile is a complicated task. This is because it is not possible to directly measure the vehicular radiated interferences in real traffic due to potential interferences from another electromagnetic wave sources. This thesis presents a microscopic artificial intelligence model based on neural networks to estimate real traffic radiated emissions of electric vehicles in terms of the driving dynamics. Instantaneous values of measured speed and calculated acceleration have been used to characterize the driving profile. Experimental electromagnetic interference tests have been carried out with a Vectrix electric motorcycle as well as Twizy electric cars in semi-anechoic chambers. Both the motorcycle and the car have been subjected to different urban and interurban driving profiles. Time Domain measurement methodology of electromagnetic radiated emissions has been adopted in this work to save the overall measurement time. The relationship between the magnetic radiated emissions of the Twizy and the corresponding speed has been very noticeable. Maximum magnetic field levels have been observed during high speed cruising in extra-urban driving and acceleration in urban environments. A comparative study of the prediction performance between various static and dynamic neural models has been introduced. The Multilayer Perceptron feedforward neural network trained with Extreme Learning Machines has achieved the best estimation results of magnetic radiated disturbances as function of instantaneous speed and acceleration. In this way, on-road magnetic radiated interferences from an electric vehicle equipped with a Global Positioning System can be estimated. This research line will allow quantify the pollutant electromagnetic emissions of electric vehicles and study new policies to preserve the environment

    Estimation of real traffic radiated emissions from electric vehicles in terms of the driving profile using neural networks

    Get PDF
    The increment of the use of electric vehicles leads to a worry about measuring its principal source of environmental pollution: electromagnetic emissions. Given the complexity of directly measuring vehicular radiated emissions in real traffic, the main contribution of this PhD thesis is to propose an indirect solution to estimate such type of vehicular emissions. Relating the on-road vehicular radiated emissions with the driving profile is a complicated task. This is because it is not possible to directly measure the vehicular radiated interferences in real traffic due to potential interferences from another electromagnetic wave sources. This thesis presents a microscopic artificial intelligence model based on neural networks to estimate real traffic radiated emissions of electric vehicles in terms of the driving dynamics. Instantaneous values of measured speed and calculated acceleration have been used to characterize the driving profile. Experimental electromagnetic interference tests have been carried out with a Vectrix electric motorcycle as well as Twizy electric cars in semi-anechoic chambers. Both the motorcycle and the car have been subjected to different urban and interurban driving profiles. Time Domain measurement methodology of electromagnetic radiated emissions has been adopted in this work to save the overall measurement time. The relationship between the magnetic radiated emissions of the Twizy and the corresponding speed has been very noticeable. Maximum magnetic field levels have been observed during high speed cruising in extra-urban driving and acceleration in urban environments. A comparative study of the prediction performance between various static and dynamic neural models has been introduced. The Multilayer Perceptron feedforward neural network trained with Extreme Learning Machines has achieved the best estimation results of magnetic radiated disturbances as function of instantaneous speed and acceleration. In this way, on-road magnetic radiated interferences from an electric vehicle equipped with a Global Positioning System can be estimated. This research line will allow quantify the pollutant electromagnetic emissions of electric vehicles and study new policies to preserve the environment
    corecore