7 research outputs found

    Steering a Historical Disease Forecasting Model Under a Pandemic: Case of Flu and COVID-19

    Full text link
    Forecasting influenza in a timely manner aids health organizations and policymakers in adequate preparation and decision making. However, effective influenza forecasting still remains a challenge despite increasing research interest. It is even more challenging amidst the COVID pandemic, when the influenza-like illness (ILI) counts are affected by various factors such as symptomatic similarities with COVID-19 and shift in healthcare seeking patterns of the general population. Under the current pandemic, historical influenza models carry valuable expertise about the disease dynamics but face difficulties adapting. Therefore, we propose CALI-Net, a neural transfer learning architecture which allows us to 'steer' a historical disease forecasting model to new scenarios where flu and COVID co-exist. Our framework enables this adaptation by automatically learning when it should emphasize learning from COVID-related signals and when it should learn from the historical model. Thus, we exploit representations learned from historical ILI data as well as the limited COVID-related signals. Our experiments demonstrate that our approach is successful in adapting a historical forecasting model to the current pandemic. In addition, we show that success in our primary goal, adaptation, does not sacrifice overall performance as compared with state-of-the-art influenza forecasting approaches.Comment: Appears in AAAI-2

    Triaging Content Severity in Online Mental Health Forums

    Get PDF
    Mental health forums are online communities where people express their issues and seek help from moderators and other users. In such forums, there are often posts with severe content indicating that the user is in acute distress and there is a risk of attempted self-harm. Moderators need to respond to these severe posts in a timely manner to prevent potential self-harm. However, the large volume of daily posted content makes it difficult for the moderators to locate and respond to these critical posts. We present a framework for triaging user content into four severity categories which are defined based on indications of self-harm ideation. Our models are based on a feature-rich classification framework which includes lexical, psycholinguistic, contextual and topic modeling features. Our approaches improve the state of the art in triaging the content severity in mental health forums by large margins (up to 17% improvement over the F-1 scores). Using the proposed model, we analyze the mental state of users and we show that overall, long-term users of the forum demonstrate a decreased severity of risk over time. Our analysis on the interaction of the moderators with the users further indicates that without an automatic way to identify critical content, it is indeed challenging for the moderators to provide timely response to the users in need.Comment: Accepted for publication in Journal of the Association for Information Science and Technology (2017

    Using large-scale syndromic datasets to support epidemiology and surveillance

    Get PDF
    Using large-scale syndromic datasets to support epidemiology and surveillance Healthcare and the healthcare industry have traditionally produced huge amounts of data and information; patient care necessitates accurate record keeping, records of attendances and often details of the reason for contact with healthcare and outcomes.7 During the past decade, there has been a dramatic shift to digitize healthcare related information, with a view to both increasing efficiencies in these areas, and to generate new insights.8 These rich, but often unstructured data sources can present both opportunities and challenges to data scientists and epidemiologists. Syndromic surveillance (SS) is the real-time (or near real-time) collection, analysis, interpretation, and dissemination of health-related data to enable the early identification of the impact (or absence of impact) of potential human or veterinary public-health threats which require effective public-health action.9 In England, Public Health England (PHE) coordinates a suite of national real-time syndromic surveillance systems. Underpinning their operation is the collation, analysis and interpretation of large-scale datasets (“big data”). This PhD by Published Works describes work which has evaluated, developed or utilised a number of these large healthcare datasets for both surveillance and epidemiology of public health events. The thesis is divided into four themes covering critical aspects of SS. Firstly, developing SS systems using novel data sources; something which is currently under-reported in the literature. Secondly, using syndromic data systems for non-infectious disease epidemiology; understanding how these systems can inform public health insight and action outside of their original remit. Thirdly, determining the utility in identifying outbreaks which was one of the original envisioned purposes of SS, using gastrointestinal illness (GI) as a case-study. The final theme is understanding how SS is used in the context of mass gatherings; again, a key original aspect of syndromic surveillance. The thesis collates a portfolio of indexed works, all of which use (combined with other data sources) large, health-related data collated and operated by the PHE Real-Time Syndromic Surveillance Team (ReSST) and employ a range of different methodologies to translate data into public health action. These include describing the development of a novel system, observational studies and time series analysis. Key findings from the papers include; learning how to develop these systems, demonstration of their utility in non-infectious disease epidemiology, leading to new insights into the socio-demographic distribution and causes of presentations to healthcare with Allergic Rhinitis, understanding the challenges and limitations of syndromic surveillance in identifying outbreaks of GI disease and how they can be used during mass gatherings. Using diverse methodologies and data as a collective, the papers have led to significant public health impacts; both in terms of how these systems are used in England currently and how they have influenced global development of this small but growing specialit

    Comparative evaluation of methods that adjust for reporting biases in participatory surveillance systems

    Get PDF
    Over the past decade the widespread proliferation of mobile devices and wearable technology has significantly changed the landscape of epidemiological data gathering and evolved into a field known as Digital Epidemiology. One source of active digital data collection is online participatory syndromic surveillance systems. These systems actively engage the general public in reporting health-related information and provide timely information about disease trends within the community. This dissertation comprehensively addresses how researchers can effectively use this type of data to answer questions about Influenza-like Illness (ILI) disease burden in the general population. We assess the representativeness and reporting habits of volunteers for these systems and use this information to develop statistically rigorous methods that adjust for potential biases. Specifically, we evaluate how different missing data methods, such as complete case and multiple imputation models, affect estimates of ILI disease burden using both simulated data as well as data from the Australian system, Flutracking.net. We then extend these methods to data from the American system, Flu Near You, which has different patterns. Finally, we provide examples of how this data has been used to answer questions about ILI in the general community and promote better understanding of disease surveillance and data literacy among volunteers
    corecore