2 research outputs found

    Functional clustering methods for binary longitudinal data with temporal heterogeneity

    Full text link
    In the analysis of binary longitudinal data, it is of interest to model a dynamic relationship between a response and covariates as a function of time, while also investigating similar patterns of time-dependent interactions. We present a novel generalized varying-coefficient model that accounts for within-subject variability and simultaneously clusters varying-coefficient functions, without restricting the number of clusters nor overfitting the data. In the analysis of a heterogeneous series of binary data, the model extracts population-level fixed effects, cluster-level varying effects, and subject-level random effects. Various simulation studies show the validity and utility of the proposed method to correctly specify cluster-specific varying-coefficients when the number of clusters is unknown. The proposed method is applied to a heterogeneous series of binary data in the German Socioeconomic Panel (GSOEP) study, where we identify three major clusters demonstrating the different varying effects of socioeconomic predictors as a function of age on the working status

    What impacts matriculation decisions? A double-blind experiment via an AI-led chatbot trained with social media data

    Get PDF
    This thesis explores students’ matriculation decision factors via an AI-led chatbot trained with social media data. The novelty of this thesis resides in the following methodological approaches: Firstly, it employs data mining and text analytics techniques to explore the use of topic modelling and a systematic literature reviewing technique called algorithmic document sequencing to identify decision factors from social media to be integrated to the internal model of the AI through a methodological pluralist approach. Secondly, it introduces a chatbot design and strategy for an AI-led chat survey generating both unstructured qualitative and structured quantitative primary data. Finally, upon interviewing 1193 participants around the world, a double-blind true experiment was run seamlessly without human intervention by the AI testing hypotheses and determining the factors that impact students' university choices. The thesis showcases how AI can efficiently interview participants and collect their input, offering robust evidence through an RCT (Gold standard) to establish causal relationships between interventions and their outcomes. One significant contribution of the thesis lies in aiding higher education institutions in understanding the global factors influencing students' university choices and the role of electronic word-of-mouth on social media platforms. More importantly, the research enhances knowledge in identifying themes from social media and literature, facilitating the training of AI-augmented chatbots with these themes, and designing such chatbots to run large scale social RCTs. These developments may enable researchers from a wide range of fields to collect qualitative and quantitative data from large samples, run double-blind true experiments with the AI and produce statistically reproducible, reliable, and generalisable results
    corecore