2 research outputs found

    An Exploration of Features Impacting Respiratory Diseases in Urban Areas

    No full text
    Air pollution exposure has become ubiquitous and is increasingly detrimental to human health. Small Particulate matter (PM) is one of the most harmful forms of air pollution. It can easily infiltrate the lungs and trigger several respiratory diseases, especially in vulnerable populations such as children and elderly people. In this work, we start by leveraging a retrospective study of 416 children suffering from respiratory diseases. The study revealed that asthma prevalence was the most common among several respiratory diseases, and that most patients suffering from those diseases live in areas of high traffic, noise, and greenness. This paved the way to the construction of the MOREAIR dataset by combining feature abstraction and micro-level scale data collection. Unlike existing data sets, MOREAIR is rich in context-specific components, as it includes 52 temporal or geographical features, in addition to air-quality measurements. The use of Random Forest uncovered the most important features for the understanding of air-quality distribution in Moroccan urban areas. By linking the medical data and the MOREAIR dataset, we observed that the patients included in the medical study come mostly from neighborhoods that are characterized by either high average or high variations of pollution levels

    Machine Learning for Predicting the Risk for Childhood Asthma Using Prenatal, Perinatal, Postnatal and Environmental Factors

    No full text
    The prevalence rate for childhood asthma and its associated risk factors vary significantly across countries and regions. In the case of Morocco, the scarcity of available medical data makes scientific research on diseases such as asthma very challenging. In this paper, we build machine learning models to predict the occurrence of childhood asthma using data from a prospective study of 202 children with and without asthma. The association between different factors and asthma diagnosis is first assessed using a Chi-squared test. Then, predictive models such as logistic regression analysis, decision trees, random forest and support vector machine are used to explore the relationship between childhood asthma and the various risk factors. First, data were pre-processed using a Chi-squared feature selection, 19 out of the 36 factors were found to be significantly associated (p-value < 0.05) with childhood asthma; these include: history of atopic diseases in the family, presence of mites, cold air, strong odors and mold in the child’s environment, mode of birth, breastfeeding and early life habits and exposures. For asthma prediction, random forest yielded the best predictive performance (accuracy = 84.9%), followed by logistic regression (accuracy = 82.57%), support vector machine (accuracy = 82.5%) and decision trees (accuracy = 75.19%). The decision tree model has the advantage of being easily interpreted. This study identified important maternal and prenatal risk factors for childhood asthma, the majority of which are avoidable. Appropriate steps are needed to raise awareness about the prenatal risk factors
    corecore