9 research outputs found

    Natural Language Processing in Electronic Health Records in Relation to Healthcare Decision-making: A Systematic Review

    Full text link
    Background: Natural Language Processing (NLP) is widely used to extract clinical insights from Electronic Health Records (EHRs). However, the lack of annotated data, automated tools, and other challenges hinder the full utilisation of NLP for EHRs. Various Machine Learning (ML), Deep Learning (DL) and NLP techniques are studied and compared to understand the limitations and opportunities in this space comprehensively. Methodology: After screening 261 articles from 11 databases, we included 127 papers for full-text review covering seven categories of articles: 1) medical note classification, 2) clinical entity recognition, 3) text summarisation, 4) deep learning (DL) and transfer learning architecture, 5) information extraction, 6) Medical language translation and 7) other NLP applications. This study follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. Result and Discussion: EHR was the most commonly used data type among the selected articles, and the datasets were primarily unstructured. Various ML and DL methods were used, with prediction or classification being the most common application of ML or DL. The most common use cases were: the International Classification of Diseases, Ninth Revision (ICD-9) classification, clinical note analysis, and named entity recognition (NER) for clinical descriptions and research on psychiatric disorders. Conclusion: We find that the adopted ML models were not adequately assessed. In addition, the data imbalance problem is quite important, yet we must find techniques to address this underlining problem. Future studies should address key limitations in studies, primarily identifying Lupus Nephritis, Suicide Attempts, perinatal self-harmed and ICD-9 classification

    Objective Analysis of Marker Bias in Higher Education

    Get PDF
    Marker bias has been a serious factor contributing to discrepancy in assessments. In this study we analyze one year students' results in a Business Faculty within an Australian university to understand the extent of variation induced by marker bias in multiple marker scenarios. The study shows interesting insights regarding the marking trends of a particular marker, and shows variations among markers in a particular course. The study paves the way for quantification of marker variation through objective analysis

    Introductory Engineering Mathematics Studentsā€™ Weighted Score Predictions Utilising a Novel Multivariate Adaptive Regression Spline Model

    Get PDF
    Introductory Engineering Mathematics (a skill builder for engineers) involves developing problem-solving attributes throughout the teaching period. Therefore, the prediction of studentsā€™ final course grades with continuous assessment marks is a useful toolkit for degree program educators. Predictive models are practical tools used to evaluate the effectiveness of teaching as well as assessing the studentsā€™ progression and implementing interventions for the best learning outcomes. This study develops a novel multivariate adaptive regression spline (MARS) model to predict the weighted score WS (i.e., the course grade). To construct the proposed MARS model, Introductory Engineering Mathematics performance data over five years from the University of Southern Queensland, Australia, were used to design predictive models using input predictors of online quizzes, written assignments, and examination scores. About 60% of randomised predictor grade data were applied to train the model (with 25% of the training set used for validation) and 40% to test the model. Based on the cross-correlation of inputs vs. the WS, 12 distinct combinations with single (i.e., M1ā€“M5) and multiple (M6ā€“M12) features were created to assess the influence of each on the WS with results bench-marked via a decision tree regression (DTR), kernel ridge regression (KRR), and a k-nearest neighbour (KNN) model. The influence of each predictor on WS clearly showed that online quizzes provide the least contribution. However, the MARS model improved dramatically by including written assignments and examination scores. The research demonstrates the merits of the proposed MARS model in uncovering relationships among continuous learning variables, which also provides a distinct advantage to educators in developing early intervention and moderating their teaching by predicting the performance of students ahead of final outcome for a course. The findings and future application have significant practical implications in teaching and learning interventions or planning aimed to improve graduate outcomes in undergraduate engineering program cohorts

    Fusion of Bā€mode and shear wave elastography ultrasound features for automated detection of axillary lymph node metastasis in breast carcinoma

    Get PDF
    Abstract: In this study, we evaluate and compare the diagnostic performance of ultrasound for nonā€invasive axillary lymph node (ALN) metastasis detection. The study was based on fusing shear wave elastography (SWE) and Bā€mode ultrasonography (USG) images. These images were subjected to preā€processing and feature extraction, based on biā€dimensional empirical mode decomposition and higher order spectra methods. The resulting nonlinear features were ranked according to their pā€value, which was established with Student's tā€test. The ranked features were used to train and test six classification algorithms with 10ā€fold crossā€validation. Initially, we considered Bā€mode USG images in isolation. A probabilistic neural network (PNN) classifier was able to discriminate positive from negative cases with an accuracy of 74.77% using 15 features. Subsequently, only SWE images were used and as before, the PNN classifier delivered the best result with an accuracy of 87.85% based on 47 features. Finally, we combined SWE and Bā€mode USG images. Again, the PNN classifier delivered the best result with an accuracy of 89.72% based on 71 features. These three tests indicate that SWE images contain more diagnostically relevant information when compared with Bā€mode USG. Furthermore, there is scope in fusing SWE and Bā€mode USG to improve nonā€invasive ALN metastasis detection

    A new nested ensemble technique for automated diagnosis of breast cancer

    No full text
    Nowadays, breast cancer is reported as one of most common cancers amongst women. Early detection of this cancer is an essential to aid in informing subsequent treatments. This study investigates automated breast cancer prediction using machine learning and data mining techniques. We proposed the nested ensemble approach which used the Stacking and Vote (Voting) as the classifiers combination techniques in our ensemble methods for detecting the benign breast tumors from malignant cancers. Each nested ensemble classifier contains 'Classifiers' and 'MetaClassifiers'. MetaClassifiers can have more than two different classification algorithms. In this research, we developed the two-layer nested ensemble classifiers. In our two-layer nested ensemble classifiers the MetaClassifiers have two or three different classification algorithms. We conducted the experiments on Wisconsin Diagnostic Breast Cancer (WDBC) dataset and K-fold Cross Validation technique are used for the model evaluation. We compared the proposed two-layer nested ensemble classifiers with single classifiers (i.e., BayesNet and Naive Bayes) in terms of the classification accuracy, precision, recall, F1 measure, ROC and computational times of training single and nested ensemble classifiers. We also compared our best model with previous works reported in the literatures in terms of accuracy. The results demonstrate that the proposed two-layer nested ensemble models outperformance the single classifiers and most of the previous works. Both SV-BayesNet-3-MetaClassifier and SV-Naive Bayes-3-MetaClassifier achieved accuracy 98.07% (K = 10). However, SV-Naive Bayes-3-MetaClassifier is more efficiency as it needs less time to build the model

    A survey on text classification and its applications

    No full text
    Text classification (a.k.a text categorisation) is an effective and efficient technology for information organisation and management. With the explosion of information resources on the Web and corporate intranets continues to increase, it has being become more and more important and has attracted wide attention from many different research fields. In the literature, many feature selection methods and classification algorithms have been proposed. It also has important applications in the real world. However, the dramatic increase in the availability of massive text data from various sources is creating a number of issues and challenges for text classification such as scalability issues. The purpose of this report is to give an overview of existing text classification technologies for building more reliable text classification applications, to propose a research direction for addressing the challenging problems in text mining

    A New Deep Convolutional Neural Network Model for Automated Breast Cancer Detection

    No full text
    Breast cancer is reported as one of most common malignancy amongst women in the world. Early detection of this cancer is critical to clinical and epidemiologic for aiding in informing subsequent treatments. This study investigates automated breast cancer prediction using deep learning techniques. A new 19-layer deep convolutional neural network (CNN) model for detecting the benign breast tumors from malignant cancers was proposed and implemented. The experiments on BreaKHis dataset was conducted and K-fold Cross Validation technique are used for the model evaluation. The proposed 19-layer deep CNN based classiļ¬ers compared with conventional machine learning classiļ¬er, namely Support Vector Machine (SVM) and a state-of-the-art deep learning model, namely GoogLeNet in terms of Accuracy, Area under the Receiver Operating Characteristic (ROC) Curve (AUC), the Classification Mean Absolute Error (MAE), Mean Squared Error (MSE) metrics. The results demonstrate that the proposed new model outperformed the other classiļ¬ers. The proposed model achieved an accuracy, AUC, MAE and MSE of 84.5%, 85.7%, 0.082, and 0.043, respectively

    Objective analysis of marker bias in higher education

    No full text
    Marker bias has been a serious factor contributing to discrepancy in assessments. In this study we analyze one year students' results in a Business Faculty within an Australian university to understand the extent of variation induced by marker bias in multiple marker scenarios. The study shows interesting insights regarding the marking trends of a particular marker, and shows variations among markers in a particular course. The study paves the way for quantification of marker variation through objective analysis

    EpilepsyNet: Novel automated detection of epilepsy using transformer model with EEG signals from 121 patient population

    No full text
    Background: Epilepsy is one of the most common neurological conditions globally, and the fourth most common in the United States. Recurrent non-provoked seizures characterize it and have huge impacts on the quality of life and financial impacts for affected individuals. A rapid and accurate diagnosis is essential in order to instigate and monitor optimal treatments. There is also a compelling need for the accurate interpretation of epilepsy due to the current scarcity in neurologist diagnosticians and a global inequity in access and outcomes. Furthermore, the existing clinical and traditional machine learning diagnostic methods exhibit limitations, warranting the need to create an automated system using deep learning model for epilepsy detection and monitoring using a huge database. Method: The EEG signals from 35 channels were used to train the deep learning-based transformer model named (EpilepsyNet). For each training iteration, 1-min-long data were randomly sampled from each participant. Thereafter, each 5-s epoch was mapped to a matrix using the Pearson Correlation Coefficient (PCC), such that the bottom part of the triangle was discarded and only the upper triangle of the matrix was vectorized as input data. PCC is a reliable method used to measure the statistical relationship between two variables. Based on the 5 s of data, single embedding was performed thereafter to generate a 1-dimensional array of signals. In the final stage, a positional encoding with learnable parameters was added to each correlation coefficient's embedding before being fed to the developed EpilepsyNet as input data to epilepsy EEG signals. The ten-fold cross-validation technique was used to generate the model. Results: Our transformer-based model (EpilepsyNet) yielded high classification accuracy, sensitivity, specificity and positive predictive values of 85%, 82%, 87%, and 82%, respectively. Conclusion: The proposed method is both accurate and robust since ten-fold cross-validation was employed to evaluate the performance of the model. Compared to the deep models used in existing studies for epilepsy diagnosis, our proposed method is simple and less computationally intensive. This is the earliest study to have uniquely employed the positional encoding with learnable parameters to each correlation coefficient's embedding together with the deep transformer model, using a huge database of 121 participants for epilepsy detection. With the training and validation of the model using a larger dataset, the same study approach can be extended for the detection of other neurological conditions, with a transformative impact on neurological diagnostics worldwide
    corecore