523 research outputs found

    Predicting Automobile Accident Severity and Hotspots Using Multinomial Logistic Regression

    Get PDF
    Title: Predicting automobile accident severity and hotspots using multinomial logistic regression. Americans are now driving more than ever [1]. In 2010, close to 33,000 lives were lost and another estimated 3.9 million people were injured in automobile accidents; all things considered, these accidents accounted for $836 billion in damages [2]. Since then, the rate of automobile-related deaths per 100 million miles traveled has not shown signs of improvement [3]. This research expands upon a previous year’s poster presented at the South Dakota State University Data Science Symposium 2019 [4]. While the previous research focuses on a data visualization of automobile accident hotspots on a map based on the severity and frequency of accidents, this research aims to train a multinomial logistic regression machine learning model using data related to weather conditions, speed limit, and GPS coordinates to predict the severity of automobile accidents. The development of such a machine learning model can help inform emergency services better manage resources in anticipation of potential automobile accidents based on prevailing weather conditions, speed limit along a stretch of road, and location data. An updated version of the previous dataset will be used. This dataset contains approximately 1.5 million automobile accident data points, collected over a span of over four years, from February 2016 to December 2020. References [1] US Department of Transportation. Federal Highway Administration. (May, 2019). Strong economy has Americans driving more than ever before. Retrieved from https://cms8.fhwa.dot.gov/newsroom/strong-economy-has-americans-driving-more-ever [2] Blincoe, L. J., Miller, T. R., Zaloshnja, E., & Lawrence, B. A. (2015, May). The economic and societal impact of motor vehicle crashes, 2010. (Revised)(Report No. DOT HS 812 013). Washington, DC: National Highway Traffic Safety Administration. [3] National Center for Statistics and Analysis (2019, December). Early estimate of motor vehicle traffic fatalities for the first 9 months (Jan – Sep) of 2019. (Crash Stats Brief Statistical Summary. Report No. DOT HS 812 874). Washington, DC: Highway Traffic Safety Administration. [4] Identification of Automobile Accident Hotspots using Countrywide Traffic Accident Dataset, B. Z. Yang & S. Z. Sajal, Ph.D., Presented at 2020 South Dakota State University Data Science Symposium

    GR-232 - Accident Prediction using Big Data Analysis using Ensemble Learning

    Get PDF
    Many country wide internal roads and national highways have dim lights or no street lights all over the world. We usually observe some turns on roads more prone to accidents than other places. In our paper we used Logistic Regression and Decision Trees that together build an Ensemble learning for predicting these accident zones. For validating the results, five evaluation metrics such as Accuracy, Precision, f-measures, Re-call and Area under curve are used. State of art model for US accident dataset gives F1 score of 57%. We are implementing using ensemble learning wherein logistic regression gives F1-score of 53%

    Traffic incident duration prediction via a deep learning framework for text description encoding

    Full text link
    Predicting the traffic incident duration is a hard problem to solve due to the stochastic nature of incident occurrence in space and time, a lack of information at the beginning of a reported traffic disruption, and lack of advanced methods in transport engineering to derive insights from past accidents. This paper proposes a new fusion framework for predicting the incident duration from limited information by using an integration of machine learning with traffic flow/speed and incident description as features, encoded via several Deep Learning methods (ANN autoencoder and character-level LSTM-ANN sentiment classifier). The paper constructs a cross-disciplinary modelling approach in transport and data science. The approach improves the incident duration prediction accuracy over the top-performing ML models applied to baseline incident reports. Results show that our proposed method can improve the accuracy by 60%60\% when compared to standard linear or support vector regression models, and a further 7%7\% improvement with respect to the hybrid deep learning auto-encoded GBDT model which seems to outperform all other models. The application area is the city of San Francisco, rich in both traffic incident logs (Countrywide Traffic Accident Data set) and past historical traffic congestion information (5-minute precision measurements from Caltrans Performance Measurement System)

    Accident Risk Prediction based on Heterogeneous Sparse Data: New Dataset and Insights

    Full text link
    Reducing traffic accidents is an important public safety challenge, therefore, accident analysis and prediction has been a topic of much research over the past few decades. Using small-scale datasets with limited coverage, being dependent on extensive set of data, and being not applicable for real-time purposes are the important shortcomings of the existing studies. To address these challenges, we propose a new solution for real-time traffic accident prediction using easy-to-obtain, but sparse data. Our solution relies on a deep-neural-network model (which we have named DAP, for Deep Accident Prediction); which utilizes a variety of data attributes such as traffic events, weather data, points-of-interest, and time. DAP incorporates multiple components including a recurrent (for time-sensitive data), a fully connected (for time-insensitive data), and a trainable embedding component (to capture spatial heterogeneity). To fill the data gap, we have - through a comprehensive process of data collection, integration, and augmentation - created a large-scale publicly available database of accident information named US-Accidents. By employing the US-Accidents dataset and through an extensive set of experiments across several large cities, we have evaluated our proposal against several baselines. Our analysis and results show significant improvements to predict rare accident events. Further, we have shown the impact of traffic information, time, and points-of-interest data for real-time accident prediction.Comment: In Proceedings of the 27th ACM SIGSPATIAL, International Conference on Advances in Geographic Information Systems (2019). arXiv admin note: substantial text overlap with arXiv:1906.0540

    Predicting Accident Severity: An Analysis Of Factors Affecting Accident Severity Using Random Forest Model

    Full text link
    Road accidents have significant economic and societal costs, with a small number of severe accidents accounting for a large portion of these costs. Predicting accident severity can help in the proactive approach to road safety by identifying potential unsafe road conditions and taking well-informed actions to reduce the number of severe accidents. This study investigates the effectiveness of the Random Forest machine learning algorithm for predicting the severity of an accident. The model is trained on a dataset of accident records from a large metropolitan area and evaluated using various metrics. Hyperparameters and feature selection are optimized to improve the model's performance. The results show that the Random Forest model is an effective tool for predicting accident severity with an accuracy of over 80%. The study also identifies the top six most important variables in the model, which include wind speed, pressure, humidity, visibility, clear conditions, and cloud cover. The fitted model has an Area Under the Curve of 80%, a recall of 79.2%, a precision of 97.1%, and an F1 score of 87.3%. These results suggest that the proposed model has higher performance in explaining the target variable, which is the accident severity class. Overall, the study provides evidence that the Random Forest model is a viable and reliable tool for predicting accident severity and can be used to help reduce the number of fatalities and injuries due to road accidents in the United StatesComment: 15 page

    Correlating Extreme Weather Conditions With Road Traffic Safety: A Unified Latent Space Model

    Get PDF
    The presence of extreme weather conditions is known to expose drivers to a higher risk to incur in road accidents. Quantifying the correlation between adverse weather conditions and road traffic safety is useful for several reasons such as planning preventive actions, managing vehicle fleets, and configuring alerting systems. However, since the risk of road accidents occurrences within a specific spatial region is influenced by several factors other than the weather conditions, quantifying the actual impact of adverse weather phenomena regardless of the effect of weather-unrelated conditions can be challenging. To tackle the aforesaid issue, this paper proposes to adopt a unified latent space model based on time series embeddings. Firstly, it encodes a subset of historical series reporting weather-related accident occurrences in specific risky areas into the high-dimensional vector representation. It also encodes the weather element measurements acquired by meteorological stations spread over the analyzed area. Then, to estimate the risk level of each region within the same spatial context it seeks the temporal risk patterns that are most similar to those observed in risky areas. The experiments carried out in a real case study confirm the applicability of the proposed approach

    Burden of injuries in Nepal, 1990–2017: Findings from the Global Burden of Disease Study 2017

    Get PDF
    Background: Nepal is a low-income country undergoing rapid political, economic and social development. To date, there has been little evidence published on the burden of injuries during this period of transition.Methods: The Global Burden of Disease Study (GBD) is a comprehensive measurement of population health outcomes in terms of morbidity and mortality. We analysed the GBD 2017 estimates for deaths, years of life lost, years lived with disability, incidence and disability-adjusted life years (DALYs) from injuries to ascertain the burden of injuries in Nepal from 1990 to 2017.Results: There were 16 831 (95% uncertainty interval 13 323 to 20 579) deaths caused by injuries (9.21% of all-cause deaths (7.45% to 11.25%)) in 2017 while the proportion of deaths from injuries was 6.31% in 1990. Overall, the injury-specific age-standardised mortality rate declined from 88.91 (71.54 to 105.31) per 100 000 in 1990 to 70.25 (56.75 to 85.11) per 100 000 in 2017. In 2017, 4.11% (2.47% to 6.10%) of all deaths in Nepal were attributed to transport injuries, 3.54% (2.86% to 4.08%) were attributed to unintentional injuries and 1.55% (1.16% to 1.85%) were attributed to self-harm and interpersonal violence. From 1990 to 2017, road injuries, falls and self-harm all rose in rank for all causes of death.Conclusions: The increase in injury-related deaths and DALYs in Nepal between 1990 and 2017 indicates the need for further research and prevention interventions. Injuries remain an important public health burden in Nepal with the magnitude and trend of burden varying over time by cause-specific, sex and age group. Findings from this study may be used by the federal, provincial and local governments in Nepal to prioritise injury prevention as a public health agenda and as evidence for country-specific interventions
    • …
    corecore