10 research outputs found

    Comparison of artificial neural network and logistic regression models for prediction of mortality in head trauma based on initial clinical data

    Get PDF
    BACKGROUND: In recent years, outcome prediction models using artificial neural network and multivariable logistic regression analysis have been developed in many areas of health care research. Both these methods have advantages and disadvantages. In this study we have compared the performance of artificial neural network and multivariable logistic regression models, in prediction of outcomes in head trauma and studied the reproducibility of the findings. METHODS: 1000 Logistic regression and ANN models based on initial clinical data related to the GCS, tracheal intubation status, age, systolic blood pressure, respiratory rate, pulse rate, injury severity score and the outcome of 1271 mainly head injured patients were compared in this study. For each of one thousand pairs of ANN and logistic models, the area under the receiver operating characteristic (ROC) curves, Hosmer-Lemeshow (HL) statistics and accuracy rate were calculated and compared using paired T-tests. RESULTS: ANN significantly outperformed logistic models in both fields of discrimination and calibration but under performed in accuracy. In 77.8% of cases the area under the ROC curves and in 56.4% of cases the HL statistics for the neural network model were superior to that for the logistic model. In 68% of cases the accuracy of the logistic model was superior to the neural network model. CONCLUSIONS: ANN significantly outperformed the logistic models in both fields of discrimination and calibration but lagged behind in accuracy. This study clearly showed that any single comparison between these two models might not reliably represent the true end results. External validation of the designed models, using larger databases with different rates of outcomes is necessary to get an accurate measure of performance outside the development population

    Sentiment without Sentiment Analysis: Using the Recommendation Outcome of Steam Game Reviews as Sentiment Predictor

    Get PDF
    This paper presents and explores a novel way to determine the sentiment of a Steam game review based on the predicted recommendation of the review, testing different regression models on a combination of Term Frequency-Inverse Document Frequency (TF-IDF) and Latent Dirichlet Allocation (LDA) features. A dataset of Steam game reviews extracted from the Programming games genre consisting of 21 games along with other significant features such as the number of helpful likes on the recommendation, number of hours played, and others. Based on the features, they are grouped into three datasets: 1) either having keyword features only, 2) keyword features with the numerical features, and 3) numerical features only. The three datasets were trained using five different regression models: Multilinear Regression, Lasso Regression, Ridge Regression, Support Vector Regression, and Multi-layer Perceptron Regression, which were then evaluated using RMSE, MAE, and MAPE. The review recommendation was predicted from each model, and the accuracy of the predictions were measured using the different error rates. The results of this research may prove helpful in the convergence of Machine Learning and Educational Games

    Ensemble of optimised machine learning algorithms for predicting surface soil moisture content at a global scale

    Get PDF
    Accurate information on surface soil moisture (SSM) content at a global scale under different climatic conditions is important for hydrological and climatological applications. Machine-learning-based systematic integration of inย situ hydrological measurements, complex environmental and climate data, and satellite observation facilitate the generation of reliable data products to monitor and analyse the exchange of water, energy, and carbon in the Earth system at a proper spaceโ€“time resolution. This study investigates the estimation of daily SSM using 8 optimised machine learning (ML) algorithms and 10 ensemble models (constructed via model bootstrap aggregating techniques and five-fold cross-validation). The algorithmic implementations were trained and tested using International Soil Moisture Network (ISMN) data collected from 1722 stations distributed across the world. The result showed that the K-neighbours Regressor (KNR) had the lowest root-mean-square error (0.0379โ€‰cm3โ€‰cmโˆ’3) on the โ€œtest_randomโ€ set (for testing the performance of randomly split data during training), the Random Forest Regressor (RFR) had the lowest RMSE (0.0599โ€‰cm3โ€‰cmโˆ’3) on the โ€œtest_temporalโ€ set (for testing the performance on the period that was not used in training), and AdaBoost (AB) had the lowest RMSE (0.0786โ€‰cm3โ€‰cmโˆ’3) on the โ€œtest_independent-stationsโ€ set (for testing the performance on the stations that were not used in training). Independent evaluation on novel stations across different climate zones was conducted. For the optimised ML algorithms, the median RMSE values were below 0.1โ€‰cm3โ€‰cmโˆ’3. GradientBoosting (GB), Multi-layer Perceptron Regressor (MLPR), Stochastic Gradient Descent Regressor (SGDR), and RFR achieved a median r score of 0.6 in 12, 11, 9, and 9 climate zones, respectively, out of 15 climate zones. The performance of ensemble models improved significantly, with the median RMSE value below 0.075โ€‰cm3โ€‰cmโˆ’3 for all climate zones. All voting regressors achieved r scores of above 0.6 in 13 climate zones; BSh (hot semi-arid climate) and BWh (hot desert climate) were the exceptions because of the sparse distribution of training stations. The metric evaluation showed that ensemble models can improve the performance of single ML algorithms and achieve more stable results. Based on the results computed for three different test sets, the ensemble model with KNR, RFR and Extreme Gradient Boosting (XB) performed the best. Overall, our investigation shows that ensemble machine learning algorithms have a greater capability with respect to predicting SSM compared with the optimised or base ML algorithms; this indicates their huge potential applicability in estimating water cycle budgets, managing irrigation, and predicting crop yields.</p

    Predicting student satisfaction of emergency remote learning in higher education during COVID-19 using machine learning techniques

    Get PDF
    Despite the wide adoption of emergency remote learning (ERL) in higher education during the COVID-19 pandemic, there is insufficient understanding of influencing factors predicting student satisfaction for this novel learning environment in crisis. The present study investigated important predictors in determining the satisfaction of undergraduate students (N = 425) from multiple departments in using ERL at a self-funded university in Hong Kong while Moodle and Microsoft Team are the key learning tools. By comparing the predictive accuracy between multiple regression and machine learning models before and after the use of random forest recursive feature elimination, all multiple regression, and machine learning models showed improved accuracy while the most accurate model was the elastic net regression with 65.2% explained variance. The results show only neutral (4.11 on a 7-point Likert scale) regarding the overall satisfaction score on ERL. Even majority of students are competent in technology and have no obvious issue in accessing learning devices or Wi-Fi, face-to-face learning is more preferable compared to ERL and this is found to be the most important predictor. Besides, the level of efforts made by instructors, the agreement on the appropriateness of the adjusted assessment methods, and the perception of online learning being well delivered are shown to be highly important in determining the satisfaction scores. The results suggest that the need of reviewing the quality and quantity of modified assessment accommodated for ERL and structured class delivery with the suitable amount of interactive learning according to the learning culture and program nature

    A Data-Driven Approach for Modeling Agents

    Get PDF
    Agents are commonly created on a set of simple rules driven by theories, hypotheses, and assumptions. Such modeling premise has limited use of real-world data and is challenged when modeling real-world systems due to the lack of empirical grounding. Simultaneously, the last decade has witnessed the production and availability of large-scale data from various sensors that carry behavioral signals. These data sources have the potential to change the way we create agent-based models; from simple rules to driven by data. Despite this opportunity, the literature has neglected to offer a modeling approach to generate granular agent behaviors from data, creating a gap in the literature. This dissertation proposes a novel data-driven approach for modeling agents to bridge the research gap. The approach is composed of four detailed steps including data preparation, attribute model creation, behavior model creation, and integration. The connection between and within each step is established using data flow diagrams. The practicality of the approach is demonstrated with a human mobility model that uses millions of location footprints collected from social media. In this model, the generation of movement behavior is tested with five machine learning/statistical modeling techniques covering a large number of model/data configurations. Results show that Random Forest-based learning is the most effective for the mobility use case. Furthermore, agent attribute values are obtained/generated with machine learning and translational assignment techniques. The proposed approach is evaluated in two ways. First, the use case model is compared to another model which is developed using a state-of-the-art data-driven approach. The modelโ€™s prediction performance is comparable to the state-of-the-art model. The plausibility of behaviors and model structure in the use case model is found to be closer to real-world than the state-of-the-art model. This outcome indicates that the proposed approach produces realistic results. Second, a standard mobility dataset is used for driving the mobility model in place of social media data. Despite its small size, the data and model resembled the results gathered from the primary use case indicating the possibility of using different datasets with the proposed approach

    - Case of next-generation transportation market -

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ํ˜‘๋™๊ณผ์ • ๊ธฐ์ˆ ๊ฒฝ์˜ยท๊ฒฝ์ œยท์ •์ฑ…์ „๊ณต, 2020. 8. ์ด์ข…์ˆ˜.The present dissertation aims to provide insights into the application of different artificial neural network models in the analysis of consumer choice regarding next-generation transportation services (NGT). It categorizes consumers decisions regarding the adoption of new services according to Deweys buyer decision process and then analyzes these decisions using a variety of different methods. In particular, various artificial neural network (ANN) models are applied to predict consumers intentions. Also, the dissertation proposes an attention-based ANN model that identifies the key features that affect consumers choices. Consumers preferences for different types of NGT services are analyzed using a hierarchical Bayesian model. The analyzed consumer preferences are utilized to forecast demand for NGT services, evaluate government policies within the transportation market, and provide evidence regarding the social conflicts among traditional and new transportation services. The dissertation uses the Multiple Discrete-Continuous Extreme Value (MDCEV) model to analyze consumers decisions regarding the use of different transportation modes. It also utilizes this MDCEV model analysis to estimate the effect of NGT services on consumers travel mode selection behavior and the environmental effects of the transportation sector. Finally, the findings of the dissertations analyses are combined to generate marketing and policy insights that will promote NGT services in Korea.๋ณธ ์—ฐ๊ตฌ๋Š” ๊ธฐ๊ณ„ํ•™์Šต ๊ธฐ๋ฐ˜์˜ ์ธ๊ณต์ง€๋Šฅ๋ง๊ณผ ๊ธฐ์กด์˜ ํ†ต๊ณ„์  ๋งˆ์ผ€ํŒ… ์„ ํƒ๋ชจํ˜•์„ ํ†ตํ•ฉ์ ์œผ๋กœ ํ™œ์šฉํ•˜์—ฌ ์ œํ’ˆ ๋ฐ ์„œ๋น„์Šค ์ˆ˜์šฉ ์ด๋ก ์œผ๋กœ ์ •์˜๋œ ์†Œ๋น„์ž๋“ค์˜ ์ œํ’ˆ ์ˆ˜์šฉ ํ–‰์œ„๋ฅผ ๋ถ„์„ํ•˜์˜€๋‹ค. ๊ธฐ์กด์˜ ์ œํ’ˆ ์ˆ˜์šฉ ์ด๋ก ๋“ค์€ ์†Œ๋น„์ž๋“ค์˜ ์„ ํƒ์— ๋ผ์น˜๋Š” ์˜ํ–ฅ์„ ๋‹จ๊ณ„๋ณ„๋กœ ์ •์˜ํ•˜์˜€์ง€๋งŒ, ๋Œ€๋ถ€๋ถ„์˜ ์ด๋ก ์€ ์ œํ’ˆ ํŠน์„ฑ์ด ์†Œ๋น„์ž ์„ ํƒ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ๋ถ„์„ํ•˜๊ธฐ ๋ณด๋‹ค๋Š” ์†Œ๋น„์ž๋“ค์˜ ์˜ํ–ฅ, ์ œํ’ˆ์˜ ๋Œ€ํ•œ ์˜๊ฒฌ, ์ง€๊ฐ ์ˆ˜์ค€๊ณผ ์†Œ๋น„์ž ์„ ํƒ์˜ ๊ด€๊ณ„ ๋ถ„์„์— ์ง‘์ค‘ํ•˜์˜€๋‹ค. ๋”ฐ๋ผ์„œ ๋ณธ ์—ฐ๊ตฌ๋Š” ์†Œ๋น„์ž์˜ ์ œํ’ˆ ์ˆ˜์šฉ ์˜ํ–ฅ, ๋Œ€์•ˆ ํ‰๊ฐ€ ๊ทธ๋ฆฌ๊ณ  ์ œํ’ˆ ๋ฐ ์‚ฌ์šฉ๋Ÿ‰ ์„ ํƒ์„ ํฌํ•จํ•˜์—ฌ ๋”์šฑ ํฌ๊ด„์ ์ธ ์ธก๋ฉด์—์„œ ์†Œ๋น„์ž ์ œํ’ˆ ์ˆ˜์šฉ ํ–‰์œ„๋ฅผ ๋ถ„์„ํ•˜์˜€๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์†Œ๋น„์ž์˜ ์ œํ’ˆ ์ˆ˜์šฉ ๊ด€๋ จ ์„ ํƒ์„ ์ด ์„ธ ๋‹จ๊ณ„๋กœ ๋ถ„๋ฅ˜ํ•˜์˜€๋‹ค. ์ฒซ ๋ฒˆ์งธ๋Š” ์†Œ๋น„์ž์˜ ์ œํ’ˆ ์‚ฌ์šฉ ์˜ํ–ฅ์„ ๊ฒฐ์ •ํ•˜๋Š” ๋‹จ๊ณ„, ๋‘ ๋ฒˆ์งธ๋Š” ์ œํ’ˆ๋“ค์˜ ๋Œ€์•ˆ์„ ํ‰๊ฐ€ํ•˜๋Š” ๋‹จ๊ณ„, ์„ธ ๋ฒˆ์งธ๋Š” ์ œํ’ˆ์˜ ์‚ฌ์šฉ๋Ÿ‰์„ ์„ ํƒํ•˜๋Š” ๋‹จ๊ณ„๋กœ, ๊ฐ ๋‹จ๊ณ„๋ฅผ ๋ถ„์„ํ•˜๊ธฐ ์œ„ํ•ด์„œ ๋ณธ ์—ฐ๊ตฌ๋Š” ์ธ๊ณต์ง€๋Šฅ๋ง๊ณผ ํ†ต๊ณ„์  ๋งˆ์ผ€ํŒ… ์„ ํƒ๋ชจํ˜•์„ ํ™œ์šฉํ•˜์˜€๋‹ค. ์ธ๊ณต์ง€๋Šฅ๋ง์€ ์˜ˆ์ธก๊ณผ ๋ถ„๋ฅ˜ํ•˜๋Š” ์ž‘์—…์—์„œ ์›”๋“ฑํ•œ ์„ฑ๋Šฅ์„ ๊ฐ€์ง„ ๋ชจํ˜•์œผ๋กœ ์†Œ๋น„์ž๋“ค์˜ ์ œํ’ˆ ์ˆ˜์šฉ ์˜ํ–ฅ์„ ์˜ˆ์ธกํ•˜๊ณ , ์˜ํ–ฅ ์„ ํƒ์— ์˜ํ–ฅ์„ ์ฃผ๋Š” ์ฃผ์š” ๋ณ€์ˆ˜๋“ค์„ ์‹๋ณ„ํ•˜๋Š” ๋ฐ ํ™œ์šฉ๋˜์—ˆ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ ์ œ์•ˆํ•œ ์ฃผ์š” ๋ณ€์ˆ˜ ์‹๋ณ„์„ ์œ„ํ•œ ์ธ๊ณต์ง€๋Šฅ๋ง์€ ๊ธฐ์กด์˜ ๋ณ€์ˆ˜ ์„ ํƒ ๊ธฐ๋ฒ• ๋ณด๋‹ค ๋ชจํ˜• ์ถ”์ • ์ ํ•ฉ๋„ ์ธก๋ฉด์—์„œ ๋†’์€ ์„ฑ๋Šฅ์„ ๋ณด์˜€๋‹ค. ๋ณธ ๋ชจํ˜•์€ ํ–ฅํ›„ ๋น…๋ฐ์ดํ„ฐ์™€ ๊ฐ™์ด ๋งŽ์€ ์–‘์˜ ์†Œ๋น„์ž ๊ด€๋ จ ๋ฐ์ดํ„ฐ๋ฅผ ์ฒ˜๋ฆฌํ•˜๋Š”๋ฐ ํ™œ์šฉ๋  ๊ฐ€๋Šฅ์„ฑ์ด ํด ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ, ๊ธฐ์กด์˜ ์„ค๋ฌธ ์„ค๊ณ„ ๊ธฐ๋ฒ•์„ ๊ฐœ์„ ํ•˜๋Š”๋ฐ ์šฉ์ดํ•œ ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ ํŒ๋‹จ๋œ๋‹ค. ์†Œ๋น„์ž ์„ ํ˜ธ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ ๋Œ€์•ˆ ํ‰๊ฐ€ ๋ฐ ์‚ฌ์šฉ๋Ÿ‰์„ ๋ถ„์„ํ•˜๊ธฐ ์œ„ํ•ด์„œ ํ†ต๊ณ„์  ์„ ํƒ ๋ชจํ˜• ์ค‘ ๊ณ„์ธต์  ๋ฒ ์ด์ง€์•ˆ ๋ชจํ˜•๊ณผ ํ˜ผํ•ฉ MDCEV ๋ชจํ˜•์„ ํ™œ์šฉํ•˜์˜€๋‹ค. ๊ณ„์ธต์  ๋ฒ ์ด์ง€์•ˆ ๋ชจํ˜•์€๊ฐœ๋ณ„์ ์ธ ์†Œ๋น„์ž ์„ ํ˜ธ๋ฅผ ์ถ”์ •ํ•  ์ˆ˜ ์žˆ๋Š” ์žฅ์ ์ด ์žˆ๊ณ , ํ˜ผํ•ฉ MDCEV ๋ชจํ˜•์˜ ๊ฒฝ์šฐ ์†Œ๋น„์ž๋“ค์˜ ์„ ํ˜ธ๋ฅผ ๊ธฐ๋ฐ˜ํ•˜์—ฌ ์„ ํƒ๋œ ๋Œ€์•ˆ๋“ค๋กœ ๋‹ค์–‘ํ•œ ํฌํŠธํด๋ฆฌ์˜ค๋ฅผ ๊ตฌ์„ฑํ•  ์ˆ˜ ์žˆ๊ณ , ๊ฐ ๋Œ€์•ˆ์— ๋Œ€ํ•œ ์‚ฌ์šฉ๋Ÿ‰์„ ๋ถ„์„ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ œ์•ˆ๋œ ๋ชจํ˜•๋“ค์˜ ์‹ค์ฆ ์—ฐ๊ตฌ๋ฅผ ์œ„ํ•ด ์ฐจ์„ธ๋Œ€ ์ž๋™์ฐจ ์ˆ˜์†ก ์„œ๋น„์Šค์— ๋Œ€ํ•œ ์†Œ๋น„์ž๋“ค์˜ ์‚ฌ์šฉ ์˜ํ–ฅ, ์„œ๋น„์Šค ๋Œ€์•ˆ์— ๋Œ€ํ•œ ์„ ํ˜ธ, ์ˆ˜์†ก ์„œ๋น„์Šค๋ณ„ ์‚ฌ์šฉ๋Ÿ‰์„ ๋ถ„์„ํ•˜์˜€๋‹ค. ์‹ค์ฆ ์—ฐ๊ตฌ์—์„œ๋Š” ์ฐจ์„ธ๋Œ€ ์ž๋™์ฐจ ์ˆ˜์†ก ์„œ๋น„์Šค๋ฅผ ์ˆ˜์šฉํ•˜๊ธฐ๊นŒ์ง€ ์†Œ๋น„์ž๋“ค์ด ๊ฒฝํ—˜ํ•˜๋Š” ๋‹จ๊ณ„๋ณ„ ์„ ํƒ ์ƒํ™ฉ์„ ๋ฐ˜์˜ํ•˜์˜€์œผ๋ฉฐ, ๊ฐ ๋‹จ๊ณ„์—์„œ ๋„์ถœ๋œ ๊ฒฐ๊ณผ๋ฅผ ํ†ตํ•ด ํ–ฅํ›„ ์ฐจ์„ธ๋Œ€ ์ž๋™์ฐจ ์ˆ˜์†ก ์„œ๋น„์Šค์˜ ์„ฑ์žฅ ๊ฐ€๋Šฅ์„ฑ๊ณผ ์†Œ๋น„์ž๋“ค์˜ ์ด๋™ ํ–‰์œ„ ๋ณ€ํ™”์— ๋Œ€ํ•ด ์˜ˆ์ธกํ•˜์˜€๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋ฅผ ํ†ตํ•ด ์ธ๊ณต์ง€๋Šฅ๋ง์ด ์†Œ๋น„์ž ๊ด€๋ จ ์—ฐ๊ตฌ์—์„œ ์œ ์šฉํ•˜๊ฒŒ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€์œผ๋ฉฐ, ์ธ๊ณต์ง€๋Šฅ๋ง๊ณผ ํ†ต๊ณ„์  ๋งˆ์ผ€ํŒ… ์„ ํƒ๋ชจํ˜•์ด ๊ฒฐํ•ฉ๋  ๊ฒฝ์šฐ ์†Œ๋น„์ž๋“ค์˜ ์ œํ’ˆ ์„ ํƒ ํ–‰์œ„๋ฟ๋งŒ ์•„๋‹ˆ๋ผ, ์ œํ’ˆ ์„ ํƒ ์˜์‚ฌ๊ฒฐ์ • ๊ณผ์ • ์ „๋ฐ˜์— ๊ฑธ์ณ ์†Œ๋น„์ž ์„ ํ˜ธ๋ฅผ ํฌ๊ด„์ ์œผ๋กœ ๋ถ„์„ํ•  ์ˆ˜ ์žˆ์Œ์„ ํ™•์ธํ•˜์˜€๋‹ค.Chapter 1. Introduction 1 1.1 Research Background 1 1.2 Research Objective 7 1.3 Research Outline 12 Chapter 2. Literature Review 14 2.1 Product and Technology Diffusion Theory 14 2.1.1. Extension of Adoption Models 19 2.2 Artificial Neural Network 22 2.2.1 General Component of the Artificial Neural Network 22 2.2.2 Activation Functions of Artificial Neural Network 26 2.3 Modeling Consumer Choice: Discrete Choice Model 32 2.3.1 Multinomial Logit Model 32 2.3.2 Mixed Logit Model 34 2.3.3 Latent Class Model 37 2.4 Modeling Consumer Heuristics in Discrete Choice Model 39 2.4.1 Consumer Decision Rule in Discrete Choice Model: Compensatory and Non-Compensatory Models 39 2.4.2 Choice Set Formation Behaviors: Semi-Compensatory Models 42 2.4.3 Modeling Consumer Usage: MDCEV Model 50 2.5 Difference between Artificial Neural Network and Choice Modeling 53 2.6 Limitations of Previous Studies and Research Motivation 58 Chapter 3. Methodology 63 3.1 Artificial Neural Network Models for Prediction 63 3.1.1 Multiple Perceptron Model 63 3.1.2 Convolutional Neural Network 69 3.1.3 Bayesian Neural Network 72 3.2 Feature Identification Model through Attention 77 3.3 Hierarchical Bayesian Model 83 3.4 Multiple Discrete-Continuous Extreme Value Model 86 Chapter 4. Empirical Analysis: Consumer Preference and Selection of Transportation Mode 98 4.1 Empirical Analysis Framework 98 4.2 Data 101 4.2.1 Overview of the Survey 101 4.3 Empirical Study I: Consumer Intention to New Type of Transportation 110 4.3.1 Research Motivation and Goal 110 4.3.2 Data and Model Setup 114 4.3.3 Result and Discussion 123 4.4 Empirical Study II: Consumer Choice and Preference for New Types of Transportation 142 4.4.1 Research Motivation and Goal 142 4.4.2 Data and Model Setup 144 4.4.3 Result and Discussion 149 4.5 Empirical Study III: Impact of New Transportation Mode on Consumers Travel Behavior 163 4.5.1 Research Motivation and Goal 163 4.5.2 Data and Model Setup 164 4.5.3 Result and Discussion 166 Chapter 5. Discussion 182 Bibliography 187 Appendix: Survey used in the analysis 209 Abstract (Korean) 241Docto

    Comparison of the performance of multi-layer perceptron and linear regression for epidemiological data

    No full text
    International audienceNeural networks are used increasingly as statistical models. The performance of multilayer perceptron (MLP) and that of linear regression (LR) were compared, with regard to the quality of prediction and estimation and the robustness to deviations from underlying assumptions of normality, homoscedasticity and independence of errors. Taking into account those deviations, รฟve designs were constructed, and, for each of them, 3000 data were simulated. The comparison between connectionist and linear models was achieved by graphic means including prediction intervals , as well as by classical criteria including goodness-of-รฟt and relative errors. The empirical distribution of estimations and the stability of MLP and LR were studied by re-sampling methods. MLP and linear regression had comparable performance and robustness. Despite the exibility of connectionist models, their predictions were stable. The empirical variances of weight estimations result from the distributed representation of the information among the processing elements. This emphasizes the major role of variances of weight estimations in the interpretation of neural networks. This needs, however, to be conรฟrmed by further studies. Therefore MLP could be useful statistical models, as long as convergence conditions are respected

    Comparison of the performance of multi-layer perceptron and linear regression for epidemiological data

    No full text
    International audienceNeural networks are used increasingly as statistical models. The performance of multilayer perceptron (MLP) and that of linear regression (LR) were compared, with regard to the quality of prediction and estimation and the robustness to deviations from underlying assumptions of normality, homoscedasticity and independence of errors. Taking into account those deviations, รฟve designs were constructed, and, for each of them, 3000 data were simulated. The comparison between connectionist and linear models was achieved by graphic means including prediction intervals , as well as by classical criteria including goodness-of-รฟt and relative errors. The empirical distribution of estimations and the stability of MLP and LR were studied by re-sampling methods. MLP and linear regression had comparable performance and robustness. Despite the exibility of connectionist models, their predictions were stable. The empirical variances of weight estimations result from the distributed representation of the information among the processing elements. This emphasizes the major role of variances of weight estimations in the interpretation of neural networks. This needs, however, to be conรฟrmed by further studies. Therefore MLP could be useful statistical models, as long as convergence conditions are respected

    Modeling the risks of age-related eye diseases in a population in South India

    Full text link
    The objective of this research was to determine whether an artificial intelligence methodology such as artificial neural network (ANN), a new type of predictive model offers an increased performance over a conventional logistic regression model (LR) in predicting the ranking of risk factors for irreversible age-related chronic eye diseases age-related macular degeneration (AMD), diabetic retinopathy (DR), primary open-angle glaucoma (POAG) and primary angle-closure glaucoma (PACG) in a South Indian population. The LR and ANN models were derived and validated for their respective models predictive accuracy based on a sample (n=3,723) aged >=40 years old by using a large scale population-based epidemiologic study. Sub-population data were drawn from this sample by appropriate standard techniques that used for modeling. The LR based risk score models (RS) were derived and the model fit was assessed in a standard manner including the bootstrap method for internal validity. The ANN model was built by using the multi-layer feed-forward back propagation network. The ANN models predictive ability was compared with that of traditional model with respect to the Area under the Receiver Operating Characteristic Curve (AUROC). The sensitivity and specificity of the fitted models with a threshold criterion ranged from 70% to nearly 99% overall for all models. The ANN model outperformed the traditional LR model in a sub-population analysis in predicting AMD and DR. The predictive accuracy of ANN and LR model in predicting AMD was statistically significant (AUROC=89% vs 79%; p=10 year (RS ranged from 29 to 42) was a highest priority predictor for DR. The modifiable risk factor intraocular pressure was in order of highest priority predictor for POAG and PACG. Population attributable risk percentage and population attributable fractions revealed that there is an urgent need of prioritizing modifying the modifiable factors as a public health approach. This was supported by a sensitivity analysis of the ANN model which indicated the relative importance of prioritizing modifiable risk factors on which to base preventive interventions to reduce the impact of onset or progression of these diseases
    corecore