7 research outputs found

    Identification of patient classes in low back pain data using crisp and fuzzy clustering methods

    Get PDF
    We performed a cluster analysis of the low back pain dataset in the framework of the IFCS-2017 data challenge. Because the original data contained missing values, the first part of our analysis concerned the imputation of missing values using the Fully Conditional Specification model. The Local Outlier Factor method was then used to detect and eliminate the outliers. After the data normalization, we removed highly correlated variables from the transformed dataset and carried out k-means clustering of the remaining variables based on their correlations, i.e., the variables with the highest mutual correlations were assigned to the same cluster. Once the variables were assigned to different clusters, one representative per cluster, i.e., the variable with the highest contribution score at the first principal component, was selected. Among the 13 selected variables, there are representatives of each of the 6 variable domains (contextual factor, participation, pain, psychological, activity and physical impairment), specified as important in the paper by Nielsen et al. (2016). Different clustering methods, including DAPC, k-means and k-medoids, were then carried out to cluster the reduced low back pain data. Consensus solutions, both crisp and fuzzy, were calculated using the GV3 method. The obtained crisp consensus clustering, including 5 classes, was described in detail and compared to the meta-data annotation

    The clinical relevance of formal thought disorder in the early stages of psychosis: results from the PRONIA study

    Get PDF
    Background: Formal thought disorder (FTD) has been associated with more severe illness courses and functional deficits in patients with psychotic disorders. However, it remains unclear whether the presence of FTD characterises a specific subgroup of patients showing more prominent illness severity, neurocognitive and functional impairments. This study aimed to identify stable and generalizable FTD-subgroups of patients with recent-onset psychosis (ROP) by applying a comprehensive data-driven clustering approach and to test the validity of these subgroups by assessing associations between this FTD-related stratification, social and occupational functioning, and neurocognition. Methods: 279 patients with ROP were recruited as part of the multi-site European PRONIA study (Personalised Prognostic Tools for Early Psychosis Management; www.pronia.eu). Five FTD-related symptoms (conceptual disorganization, poverty of content of speech, difficulty in abstract thinking, increased latency of response and poverty of speech) were assessed with Positive and Negative Symptom Scale (PANSS) and the Scale for the Assessment of Negative Symptoms (SANS). Results: The results with two patient subgroups showing different levels of FTD were the most stable and generalizable clustering solution (predicted clustering strength value = 0.86). FTD-High subgroup had lower scores in social (p fdr < 0.001) and role (p fdr < 0.001) functioning, as well as worse neurocognitive performance in semantic (p fdr < 0.001) and phonological verbal fluency (p fdr < 0.001), short-term verbal memory (p fdr = 0.002) and abstract thinking (p fdr = 0.010), in comparison to FTD-Low group. Conclusions: Clustering techniques allowed us to identify patients with more pronounced FTD showing more severe deficits in functioning and neurocognition, thus suggesting that FTD may be a relevant marker of illness severity in the early psychosis pathway

    The clinical relevance of formal thought disorder in the early stages of psychosis: results from the PRONIA study

    Get PDF
    Background Formal thought disorder (FTD) has been associated with more severe illness courses and functional deficits in patients with psychotic disorders. However, it remains unclear whether the presence of FTD characterises a specific subgroup of patients showing more prominent illness severity, neurocognitive and functional impairments. This study aimed to identify stable and generalizable FTD-subgroups of patients with recent-onset psychosis (ROP) by applying a comprehensive data-driven clustering approach and to test the validity of these subgroups by assessing associations between this FTD-related stratification, social and occupational functioning, and neurocognition. Methods 279 patients with ROP were recruited as part of the multi-site European PRONIA study (Personalised Prognostic Tools for Early Psychosis Management; www.pronia.eu). Five FTD-related symptoms (conceptual disorganization, poverty of content of speech, difficulty in abstract thinking, increased latency of response and poverty of speech) were assessed with Positive and Negative Symptom Scale (PANSS) and the Scale for the Assessment of Negative Symptoms (SANS). Results The results with two patient subgroups showing different levels of FTD were the most stable and generalizable clustering solution (predicted clustering strength value = 0.86). FTD-High subgroup had lower scores in social (p(fdr) Conclusions Clustering techniques allowed us to identify patients with more pronounced FTD showing more severe deficits in functioning and neurocognition, thus suggesting that FTD may be a relevant marker of illness severity in the early psychosis pathway.</p

    Detecting metro service disruptions via large-scale vehicle location data

    Get PDF
    Urban metro systems are often affected by disruptions such as infrastructure malfunctions, rolling stock breakdowns and accidents. The crucial prerequisite of any disruption analytics is to have accurate information about the location, occurrence time, duration and propagation of disruptions. To pursue this goal, we detect the abnormal deviations in trains’ headway relative to their regular services by using Gaussian mixture models. Our method is a unique contribution in the sense that it proposes a novel, probabilistic, unsupervised clustering framework and it can effectively detect any type of service interruptions, including minor delays of just a few minutes. In contrast to traditional manual inspections and other detection methods based on social media data or smart card data, which suffer from human errors, limited monitoring coverage, and potential bias, our approach uses information on train trajectories derived from automated vehicle location (train movement) data. As an important research output, this paper delivers innovative analyses of the propagation progress of disruptions along metro lines, which enables us to distinguish primary and secondary disruptions as well as effective recovery interventions performed by operators

    Disruption analytics in urban metro systems with large-scale automated data

    Get PDF
    Urban metro systems are frequently affected by disruptions such as infrastructure malfunctions, rolling stock breakdowns and accidents. Such disruptions give rise to delays, congestion and inconvenience for public transport users, which in turn, lead to a wider range of negative impacts on the social economy and wellbeing. This PhD thesis aims to improve our understanding of disruption impacts and improve the ability of metro operators to detect and manage disruptions by using large-scale automated data. The crucial precondition of any disruption analytics is to have accurate information about the location, occurrence time, duration and propagation of disruptions. In pursuit of this goal, the thesis develops statistical models to detect disruptions via deviations in trains’ headways relative to their regular services. Our method is a unique contribution in the sense that it is based on automated vehicle location data (data-driven) and the probabilistic framework is effective to detect any type of service interruptions, including minor delays that last just a few minutes. As an important research outcome, the thesis delivers novel analyses of the propagation progress of disruptions along metro lines, thus enabling us to distinguish primary and secondary disruptions as well as recovery interventions performed by operators. The other part of the thesis provides new insights for quantifying disruption impacts and measuring metro vulnerability. One of our key messages is that in metro systems there are factors influencing both the occurrence of disruptions and their outcomes. With such confounding factors, we show that causal inference is a powerful tool to estimate unbiased impacts on passenger demand and journey time, which is also capable of quantifying the spatial-temporal propagation of disruption impacts within metro networks. The causal inference approaches are applied to empirical studies based on the Hong Kong Mass Transit Railway (MTR). Our conclusions can assist researchers and practitioners in two applications: (i) the evaluation of metro performance such as service reliability, system vulnerability and resilience, and (ii) the management of future disruptions.Open Acces

    Climatology of warm season heat waves in Saudi Arabia: a time-sensitive approach

    Get PDF
    Doctor of PhilosophyDepartment of GeographyJohn A. Harrington JrThe climate of the Middle East is warming and extreme hot temperature events are becoming more common, as observed by the significant upward trends in mean and extreme temperatures during the last few decades. Climate modeling studies suggest that the frequency, intensity, and duration of extreme temperature events are expected to increase as the global and local climate continues to warm. Existing literature about heat waves (HWs) in Saudi Arabia provides information about HW duration using a single index, without considering the observed effects of climate change and the subtropical arid climate. With that in mind, this dissertation provides a series of three stand-alone papers evaluating temporal, geographic, and atmospheric aspects of the character of warm season (May-September) HWs in Saudi Arabia for 1985 to 2014. Chapter 2 examines the temporal behavior(s) of the frequency, duration, and intensity of HWs under the observed recent climate change. Several issues are addressed including the identification of some improved methodological practices for HW indices. A time-sensitive approach to define and detect HWs is proposed and assessed. HW events and their duration are considered as count data; thus, different Poisson models were used for trend detection. Chapter 3 addresses the spatio-temporal patterns of the frequency and intensity of hot days and nights, and HWs. The chapter reemphasizes the importance of considering the on-goings effects of climate warming and applies a novel time-series clustering approach to recognize hot temperature event behavior through time and space. Chapter 4 explores the atmospheric circulation conditions that are associated with warm season HW event occurrence and how different HWs aspects are related to different circulation types. Further, possible teleconnections between HWs and sea surface temperature (SST) anomalies of nearby large bodies are examined. Results from Chapters 2 and 3 detected systematic upward trends in maximum and minimum temperatures at most of the 25 stations, suggesting an on-going change in the climatology of the upper-tail of the frequency distribution. The analysis demonstrated the value of using a time-sensitive approach in studying extreme thermal events. Different patterns were observed over time and space not only across stations but also among extreme temperature events (i.e., hot days and nights, and HWs). The overall results suggest that not only local and regional factors, such as elevation, latitude, land cover, atmospheric humidity, and distance from a large body of water, but also large-scale factors such as atmospheric circulation patterns are responsible for the observed temporal and spatial patterns. Chapter 4 confirmed that as the Indian Summer Monsoon Trough and the Arabian heat low were key atmospheric features related to HW days. SST anomalies seemed to be a more important factor for HWs intensity. Extreme thermal events in Saudi Arabia tended to occur during regional warming due to atmospheric circulation conditions and SSTs teleconnections. This study documents the value of a time-sensitive approach and should initiate further research as some of temporal and spatial variabilities were not fully explaine
    corecore