Search CORE

31,160 research outputs found

The contribution of data mining to information science

Author: Chen SY
Liu X
Publication venue: 'Academy of Traumatology'
Publication date: 01/01/2004
Field of study

The information explosion is a serious challenge for current information institutions. On the other hand, data mining, which is the search for valuable information in large volumes of data, is one of the solutions to face this challenge. In the past several years, data mining has made a significant contribution to the field of information science. This paper examines the impact of data mining by reviewing existing applications, including personalized environments, electronic commerce, and search engines. For these three types of application, how data mining can enhance their functions is discussed. The reader of this paper is expected to get an overview of the state of the art research associated with these applications. Furthermore, we identify the limitations of current work and raise several directions for future research

CiteSeerX

Brunel University Research Archive

Discovering the Impact of Knowledge in Recommender Systems: A Comparative Study

Author: Amini Bahram
Ibrahim Roliana
Othman Mohd Shahizan
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 01/01/2011
Field of study

Recommender systems engage user profiles and appropriate filtering techniques to assist users in finding more relevant information over the large volume of information. User profiles play an important role in the success of recommendation process since they model and represent the actual user needs. However, a comprehensive literature review of recommender systems has demonstrated no concrete study on the role and impact of knowledge in user profiling and filtering approache. In this paper, we review the most prominent recommender systems in the literature and examine the impression of knowledge extracted from different sources. We then come up with this finding that semantic information from the user context has substantial impact on the performance of knowledge based recommender systems. Finally, some new clues for improvement the knowledge-based profiles have been proposed.Comment: 14 pages, 3 tables; International Journal of Computer Science & Engineering Survey (IJCSES) Vol.2, No.3, August 201

arXiv.org e-Print Archive

Crossref

Universiti Teknologi Malaysia Institutional Repository

SUBIC: A Supervised Bi-Clustering Approach for Precision Medicine

Author: Levy Phillip
Nezhad Milad Zafar
Sadati Najibesadat
Yang Kai
Zhu Dongxiao
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/09/2017
Field of study

Traditional medicine typically applies one-size-fits-all treatment for the entire patient population whereas precision medicine develops tailored treatment schemes for different patient subgroups. The fact that some factors may be more significant for a specific patient subgroup motivates clinicians and medical researchers to develop new approaches to subgroup detection and analysis, which is an effective strategy to personalize treatment. In this study, we propose a novel patient subgroup detection method, called Supervised Biclustring (SUBIC) using convex optimization and apply our approach to detect patient subgroups and prioritize risk factors for hypertension (HTN) in a vulnerable demographic subgroup (African-American). Our approach not only finds patient subgroups with guidance of a clinically relevant target variable but also identifies and prioritizes risk factors by pursuing sparsity of the input variables and encouraging similarity among the input variables and between the input and target variable

arXiv.org e-Print Archive

Crossref

Automated user modeling for personalized digital libraries

Author: Aihara
Angiulli
Belkin
Bezdek
Blum
Costabile
Cristianini
E. Frias-Martinez
Fausett
Ford
Friedman
G. Magoulas
Hartigan
Haykin
Jain
Kobsa
Krishnapuram
Magoulas
Manber
Mitchell
Montaner
R. Macredie
Rabiner
Ramsey
Riecken
S. Chen
Sarukkai
Tsukada
Webb
Winter
Witten
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

Digital libraries (DL) have become one of the most typical ways of accessing any kind of digitalized information. Due to this key role, users welcome any improvements on the services they receive from digital libraries. One trend used to improve digital services is through personalization. Up to now, the most common approach for personalization in digital libraries has been user-driven. Nevertheless, the design of efficient personalized services has to be done, at least in part, in an automatic way. In this context, machine learning techniques automate the process of constructing user models. This paper proposes a new approach to construct digital libraries that satisfy user’s necessity for information: Adaptive Digital Libraries, libraries that automatically learn user preferences and goals and personalize their interaction using this information

CiteSeerX

Crossref

Birkbeck Institutional Research Online

Brunel University Research Archive

Learning Tasks for Multitask Learning: Heterogenous Patient Populations in the ICU

Author: Buolamwini Joy
Caruana Rich
Ghassemi Marzyeh
Ngufor C.
Shankar Shreya
Wang Xiang
Xu Kelvin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 07/06/2018
Field of study

Machine learning approaches have been effective in predicting adverse outcomes in different clinical settings. These models are often developed and evaluated on datasets with heterogeneous patient populations. However, good predictive performance on the aggregate population does not imply good performance for specific groups. In this work, we present a two-step framework to 1) learn relevant patient subgroups, and 2) predict an outcome for separate patient populations in a multi-task framework, where each population is a separate task. We demonstrate how to discover relevant groups in an unsupervised way with a sequence-to-sequence autoencoder. We show that using these groups in a multi-task framework leads to better predictive performance of in-hospital mortality both across groups and overall. We also highlight the need for more granular evaluation of performance when dealing with heterogeneous populations.Comment: KDD 201

arXiv.org e-Print Archive

Crossref

DSpace@MIT

Beyond A/B Testing: Sequential Randomization for Developing Interventions in Scaled Digital Learning Environments

Author: Almirall D
Brooks C
Brusilovsky P
Davis D
Dweck CS
González-Brenes JP
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 31/01/2019
Field of study

Randomized experiments ensure robust causal inference that are critical to effective learning analytics research and practice. However, traditional randomized experiments, like A/B tests, are limiting in large scale digital learning environments. While traditional experiments can accurately compare two treatment options, they are less able to inform how to adapt interventions to continually meet learners' diverse needs. In this work, we introduce a trial design for developing adaptive interventions in scaled digital learning environments -- the sequential randomized trial (SRT). With the goal of improving learner experience and developing interventions that benefit all learners at all times, SRTs inform how to sequence, time, and personalize interventions. In this paper, we provide an overview of SRTs, and we illustrate the advantages they hold compared to traditional experiments. We describe a novel SRT run in a large scale data science MOOC. The trial results contextualize how learner engagement can be addressed through inclusive culturally targeted reminder emails. We also provide practical advice for researchers who aim to run their own SRTs to develop adaptive interventions in scaled digital learning environments

arXiv.org e-Print Archive

Crossref