29 research outputs found

    Deep Neural Networks for Multi-Label Text Classification: Application to Coding Electronic Medical Records

    Get PDF
    Coding Electronic Medical Records (EMRs) with diagnosis and procedure codes is an essential task for billing, secondary data analyses, and monitoring health trends. Both speed and accuracy of coding are critical. While coding errors could lead to more patient-side financial burden and misinterpretation of a patient’s well-being, timely coding is also needed to avoid backlogs and additional costs for the healthcare facility. Therefore, it is necessary to develop automated diagnosis and procedure code recommendation methods that can be used by professional medical coders. The main difficulty with developing automated EMR coding methods is the nature of the label space. The standardized vocabularies used for medical coding contain over 10 thousand codes. The label space is large, and the label distribution is extremely unbalanced - most codes occur very infrequently, with a few codes occurring several orders of magnitude more than others. A few codes never occur in training dataset at all. In this work, we present three methods to handle the large unbalanced label space. First, we study how to augment EMR training data with biomedical data (research articles indexed on PubMed) to improve the performance of standard neural networks for text classification. PubMed indexes more than 23 million citations. Many of the indexed articles contain relevant information about diagnosis and procedure codes. Therefore, we present a novel method of incorporating this unstructured data in PubMed using transfer learning. Second, we combine ideas from metric learning with recent advances in neural networks to form a novel neural architecture that better handles infrequent codes. And third, we present new methods to predict codes that have never appeared in the training dataset. Overall, our contributions constitute advances in neural multi-label text classification with potential consequences for improving EMR coding

    STREAM-EVOLVING BOT DETECTION FRAMEWORK USING GRAPH-BASED AND FEATURE-BASED APPROACHES FOR IDENTIFYING SOCIAL BOTS ON TWITTER

    Get PDF
    This dissertation focuses on the problem of evolving social bots in online social networks, particularly Twitter. Such accounts spread misinformation and inflate social network content to mislead the masses. The main objective of this dissertation is to propose a stream-based evolving bot detection framework (SEBD), which was constructed using both graph- and feature-based models. It was built using Python, a real-time streaming engine (Apache Kafka version 3.2), and our pretrained model (bot multi-view graph attention network (Bot-MGAT)). The feature-based model was used to identify predictive features for bot detection and evaluate the SEBD predictions. The graph-based model was used to facilitate multiview graph attention networks (GATs) with fellowship links to build our framework for predicting account labels from streams. A probably approximately correct learning framework was applied to confirm the accuracy and confidence levels of SEBD.The results showed that the SEBD can effectively identify bots from streams and profile features are sufficient for detecting social bots. The pretrained Bot-MGAT model uses fellowship links to reveal hidden information that can aid in identifying bot accounts. The significant contributions of this study are the development of a stream based bot detection framework for detecting social bots based on a given hashtag and the proposal of a hybrid approach for feature selection to identify predictive features for identifying bot accounts. Our findings indicate that Twitter has a higher percentage of active bots than humans in hashtags. The results indicated that stream-based detection is more effective than offline detection by achieving accuracy score 96.9%. Finally, semi supervised learning (SSL) can solve the issue of labeled data in bot detection tasks

    The State of Altmetrics: A Tenth Anniversary Celebration

    Get PDF
    Altmetric’s mission is to help others understand the influence of research online.We collate what people are saying about published research in sources such as the mainstream media, policy documents, social networks, blogs, and other scholarly and non-scholarly forums to provide a more robust picture of the influence and reach of scholarly work. Altmetric works with some of the biggest publishers, funders, businesses and institutions around the world to deliver this data in an accessible and reliable format. Contents Altmetrics, Ten Years Later, Euan Adie (Altmetric (founder) & Overton) Reflections on Altmetrics, Gemma Derrick (University of Lancaster), Fereshteh Didegah (Karolinska Institutet & Simon Fraser University), Paul Groth (University of Amsterdam), Cameron Neylon (Curtin University), Jason Priem (Our Research), Shenmeng Xu (University of North Carolina at Chapel Hill), Zohreh Zahedi (Leiden University) Worldwide Awareness and Use of Altmetrics, Yin-Leng Theng (Nanyang Technological University) Leveraging Machine Learning on Altmetrics Big Data, Saeed-Ul Hassan (Information Technology University), Naif R. Aljohani (King Abdulaziz University), Timothy D. Bowman (Wayne State University) Altmetrics as Social-Spatial Sensors, Vanash M. Patel (West Hertfordshire Hospitals NHS Trust), Robin Haunschild (Max Planck Institute for Solid State Research), Lutz Bornmann (Administrative Headquarters of the Max Planck Society) Altmetric’s Fable of the Hare and the Tortoise, Mike Taylor (Digital Science) The Future of Altmetrics: A Community Vision, Liesa Ross (Altmetric), Stacy Konkiel (Altmetric

    Artificial Intelligence for Sustainability—A Systematic Review of Information Systems Literature

    Get PDF
    The booming adoption of Artificial Intelligence (AI) likewise poses benefits and challenges. In this paper, we particularly focus on the bright side of AI and its promising potential to face our society’s grand challenges. Given this potential, different studies have already conducted valuable work by conceptualizing specific facets of AI and sustainability, including reviews on AI and Information Systems (IS) research or AI and business values. Nonetheless, there is still little holistic knowledge at the intersection of IS, AI, and sustainability. This is problematic because the IS discipline, with its socio-technical nature, has the ability to integrate perspectives beyond the currently dominant technological one as well as can advance both theory and the development of purposeful artifacts. To bridge this gap, we disclose how IS research currently makes use of AI to boost sustainable development. Based on a systematically collected corpus of 95 articles, we examine sustainability goals, data inputs, technologies and algorithms, and evaluation approaches that coin the current state of the art within the IS discipline. This comprehensive overview enables us to make more informed investments (e.g., policy and practice) as well as to discuss blind spots and possible directions for future research

    The state of altmetrics: a tenth anniversary celebration

    Get PDF
    Altmetric’s mission is to help others understand the influence of research online.We collate what people are saying about published research in sources such as the mainstream media, policy documents, social networks, blogs, and other scholarly and non-scholarly forums to provide a more robust picture of the influence and reach of scholarly work. Altmetric works with some of the biggest publishers, funders, businesses and institutions around the world to deliver this data in an accessible and reliable format.ContentsAltmetrics, Ten Years Later, Euan Adie (Altmetric (founder) & Overton)Reflections on Altmetrics, Gemma Derrick (University of Lancaster), Fereshteh Didegah (Karolinska Institutet & Simon Fraser University), Paul Groth (University of Amsterdam), Cameron Neylon (Curtin University), Jason Priem (Our Research), Shenmeng Xu (University of North Carolina at Chapel Hill), Zohreh Zahedi (Leiden University)Worldwide Awareness and Use of Altmetrics, Yin-Leng Theng (Nanyang Technological University)Leveraging Machine Learning on Altmetrics Big Data, Saeed-Ul Hassan (Information Technology University), Naif R. Aljohani (King Abdulaziz University), Timothy D. Bowman (Wayne State University)Altmetrics as Social-Spatial Sensors, Vanash M. Patel (West Hertfordshire Hospitals NHS Trust), Robin Haunschild (Max Planck Institute for Solid State Research), Lutz Bornmann (Administrative Headquarters of the Max Planck Society)Altmetric’s Fable of the Hare and the Tortoise, Mike Taylor (Digital Science)The Future of Altmetrics: A Community Vision, Liesa Ross (Altmetric), Stacy Konkiel (Altmetric)https://digitalcommons.unl.edu/scholcom/170 Merit, Expertise and Measuremen
    corecore