Supporting health data science education with learning analytics and artificial intelligence

Abstract

Health Data Science (HDS) is a new and interdisciplinary field that leverages computational methods, such as data science, statistics, and artificial intelligence, to analyse health, medical, or biological data. There is limited knowledge about how to teach and learn HDS topics effectively. Despite the urgent need to train health data scientists, educational institutions worldwide struggle to teach these interdisciplinary concepts effectively. This challenge is particularly evident in postgraduate studies, where learners come from diverse backgrounds. Additionally, there is a lack of clear guidelines on effective learning strategies in HDS to support students in addressing their educational challenges. To address this gap, this thesis aims to offer insights into health data science education through three key objectives. Firstly, it aims to uncover engagement patterns with HDS resources, including how learners interact with various types of educational resources and topics (Objective 1). Secondly, it aims to identify students’ learning tactics and strategies in health data science and explore how they impact student performance (Objective 2). Lastly, the thesis aims to develop an Artificial Intelligence (AI) assistant for the early prediction of struggling students in HDS and to provide them with personalised suggestions (Objective 3). A range of methods were employed, including qualitative content analysis, quantitative analysis of questionnaire data, and AI techniques on log data from online learning platforms. The qualitative content analysis by interviewing early career researchers from two UK institutes was conducted to understand and elaborate on the challenges they face in HDS education. Subsequent analysis then explored the self-reported learning strategies used by these researchers to overcome their educational challenges while shedding light on effective teaching methods and course design in HDS. A survey questionnaire was used to identify learning strategies and course design preferences in HDS among undergraduate and postgraduate learners across three HDS courses with varied modalities, designs, and academic levels. AI methods were used to uncover learners’ engagement with different resources as well as learning tactics and strategies based on clickstream data, reflecting the actual behaviours of learners. The thesis also analysed the correlation between the use of these learning tactics and strategies and students’ performance. Finally, a novel AI assistant was developed based on HDS students’ behaviours and learning process. The AI method used explainable AI and a counterfactual approach to provide suggestions that can help improve students’ performance. The results related to the first objective of this thesis indicate that students in HDS preferred a variety of resources and believed visually impressive materials enhance their understanding of HDS topics. Among these topics, machine learning emerged as the most interesting, while analysing genomic data was identified as the most difficult topic. Regarding the second objective, a range of learning tactics and strategies were identified, among which, connecting new information to prior knowledge and peer learning showed the highest correlation with student performance. The results also showed that students who engaged deeply with the material and applied different learning tactics throughout the course achieved higher grades than others. For the third objective, the novel AI assistant showed strong accuracy in predicting students’ performance, with an Area Under Curve (AUC) between 83% and 91%. The feedback generated by the AI, as evaluated by the course organiser, confirmed the validity of the suggestions. Additionally, the evaluation of the AI method on an external dataset outperformed traditional machine learning algorithms. The results of this thesis can potentially improve HDS education in the following ways. It can help develop effective courses through the appropriate design of learning materials (Objective 1). Also, evidence from Objective 2 can lead to guidelines that can help learners better manage their educational challenges. The AI model developed (Objective 3) can be used by teachers to identify students needing help and provide them with personalised suggestions according to their learning process. The evidence-based guidelines derived from this thesis have the potential to empower both learners and educators in HDS. We consider these to be significant contributions to the critical yet under-investigated field of HDS education

Similar works

Full text

thumbnail-image

Edinburgh Research Archive

redirect
Last time updated on 24/03/2025

This paper was published in Edinburgh Research Archive.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.

Licence: CC BY-NC-ND 4.0 Attribution-NonCommercial-NoDerivatives 4.0 International Deed