2,957 research outputs found
Recommended from our members
The role of human factors in stereotyping behavior and perception of digital library users: A robust clustering approach
To deliver effective personalization for digital library users, it is necessary to identify which human factors are most relevant in determining the behavior and perception of these users. This paper examines three key human factors: cognitive styles, levels of expertise and gender differences, and utilizes three individual clustering techniques: k-means, hierarchical clustering and fuzzy clustering to understand user behavior and perception. Moreover, robust clustering, capable of correcting the bias of individual clustering techniques, is used to obtain a deeper understanding. The robust clustering approach produced results that highlighted the relevance of cognitive style for user behavior, i.e., cognitive style dominates and justifies each of the robust clusters created. We also found that perception was mainly determined by the level of expertise of a user. We conclude that robust clustering is an effective technique to analyze user behavior and perception
A psychometric modeling approach to fuzzy rating data
Modeling fuzziness and imprecision in human rating data is a crucial problem
in many research areas, including applied statistics, behavioral, social, and
health sciences. Because of the interplay between cognitive, affective, and
contextual factors, the process of answering survey questions is a complex
task, which can barely be captured by standard (crisp) rating responses. Fuzzy
rating scales have progressively been adopted to overcome some of the
limitations of standard rating scales, including their inability to disentangle
decision uncertainty from individual responses. The aim of this article is to
provide a novel fuzzy scaling procedure which uses Item Response Theory trees
(IRTrees) as a psychometric model for the stage-wise latent response process.
In so doing, fuzziness of rating data is modeled using the overall rater's
pattern of responses instead of being computed using a single-item based
approach. This offers a consistent system for interpreting fuzziness in terms
of individual-based decision uncertainty. A simulation study and two empirical
applications are adopted to assess the characteristics of the proposed model
and provide converging results about its effectiveness in modeling fuzziness
and imprecision in rating data
Sentiment analytics: Lexicons construction and analysis
With the increasing amount of text data, sentiment analysis (SA) is becoming more and more important. An automated approach is needed to parse the online reviews and comments, and analyze their sentiments. Since lexicon is the most important component in SA, enhancing the quality of lexicons will improve the efficiency and accuracy of sentiment analysis. In this research, the effect of coupling a general lexicon with a specialized lexicon (for a specific domain) and its impact on sentiment analysis was presented. Two special domains and one general domain were studied. The two special domains are the petroleum domain and the biology domain. The general domain is the social network domain. The specialized lexicon for the petroleum domain was created as part of this research. The results, as expected, show that coupling a general lexicon with a specialized lexicon improves the sentiment analysis. However, coupling a general lexicon with another general lexicon does not improve the sentiment analysis --Abstract, page iii
On Cognitive Preferences and the Plausibility of Rule-based Models
It is conventional wisdom in machine learning and data mining that logical
models such as rule sets are more interpretable than other models, and that
among such rule-based models, simpler models are more interpretable than more
complex ones. In this position paper, we question this latter assumption by
focusing on one particular aspect of interpretability, namely the plausibility
of models. Roughly speaking, we equate the plausibility of a model with the
likeliness that a user accepts it as an explanation for a prediction. In
particular, we argue that, all other things being equal, longer explanations
may be more convincing than shorter ones, and that the predominant bias for
shorter models, which is typically necessary for learning powerful
discriminative models, may not be suitable when it comes to user acceptance of
the learned models. To that end, we first recapitulate evidence for and against
this postulate, and then report the results of an evaluation in a
crowd-sourcing study based on about 3.000 judgments. The results do not reveal
a strong preference for simple rules, whereas we can observe a weak preference
for longer rules in some domains. We then relate these results to well-known
cognitive biases such as the conjunction fallacy, the representative heuristic,
or the recogition heuristic, and investigate their relation to rule length and
plausibility.Comment: V4: Another rewrite of section on interpretability to clarify focus
on plausibility and relation to interpretability, comprehensibility, and
justifiabilit
Comparison of Classification rules for Predicting Personality
Personality refers to an individual’s characteristic patterns of thought, emotion and behaviour, together with the psychological mechanisms. It has been a long-term goal for psychologists to understand human personality and its impact on human behaviour. Behavior involves an interaction between a person's underlying personality traits and situational variables. Social media is a place where users present themselves to the world, revealing personal details and insights into their lives. It is evident that there is a strong correlation between users’ personality and the way they behave on online social network and a large number of studies are made on this topic. Objective of this paper is to classify the personality traits into Extraversion and Introversion. Dataset for this purpose is collected from ‘http://personality-testing.info/_rawdata/’ which stores data on personality surveys for the purposes of academic research and personal use. The dataset consists of 140 instances with 17 attributes. Three types of classification rules like J48 Decision Tree classifier, Naive Bayes Classifier and ZeroR Rules available on Weka data mining tool are used to predict Personality traits. Performances of each of these rules are measured by the values of the Precision, Recall, and Accuracy. Comparative study of these methods shows that J48 Decision tree Classifier performs better than Naive Bayes classifier and ZeroR Rule for personality traits recognition on the Social Media
Development of an Explainability Scale to Evaluate Explainable Artificial Intelligence (XAI) Methods
Explainable Artificial Intelligence (XAI) is an area of research that develops methods and techniques to make the results of artificial intelligence understood by humans. In recent years, there has been an increased demand for XAI methods to be developed due to model architectures getting more complicated and government regulations requiring transparency in machine learning models. With this increased demand has come an increased need for instruments to evaluate XAI methods. However, there are few, if none, valid and reliable instruments that take into account human opinion and cover all aspects of explainability. Therefore, this study developed an objective, human-centred questionnaire to evaluate all types of XAI methods. This questionnaire consists of 15 items: 5 items asking about the user’s background information and 10 items evaluating the explainability of the XAI method which were based on the notions of explainability. An experiment was conducted (n = 38) which got participants to evaluate one of two XAI methods using the questionnaire. The results from this experiment were used for exploratory factor analysis which showed that the 10 items related to explainability constitute one factor (Cronbach’s α = 0.81). The results were also used to gather evidence of the questionnaire’s construct validity. It is concluded that this 15-item questionnaire has one factor, has acceptable validity and reliability, and can be used to evaluate and compare XAI methods
IQ Classification via Brainwave Features: Review on Artificial Intelligence Techniques
Intelligence study is one of keystone to distinguish individual differences in cognitive psychology. Conventional psychometric tests are limited in terms of assessment time, and existence of biasness issues. Apart from that, there is still lack in knowledge to classify IQ based on EEG signals and intelligent signal processing (ISP) technique. ISP purpose is to extract as much information as possible from signal and noise data using learning and/or other smart techniques. Therefore, as a first attempt in classifying IQ feature via scientific approach, it is important to identify a relevant technique with prominent paradigm that is suitable for this area of application. Thus, this article reviews several ISP approaches to provide consolidated source of information. This in particular focuses on prominent paradigm that suitable for pattern classification in biomedical area. The review leads to selection of ANN since it has been widely implemented for pattern classification in biomedical engineering
An Intelligent Approach Using Machine Learning Techniques to Predict Flow in People
The goal of this study is to estimate the state of consciousness known as Flow, which
is associated with an optimal experience and can indicate a person’s efficiency in both personal
and professional settings. To predict Flow, we employ artificial intelligence techniques using a
set of variables not directly connected with its construct. We analyse a significant amount of data
from psychological tests that measure various personality traits. Data mining techniques support
conclusions drawn from the psychological study. We apply linear regression, regression tree, random
forest, support vector machine, and artificial neural networks. The results show that the multilayer
perceptron network is the best estimator, with an MSE of 0.007122 and an accuracy of 88.58%.
Our approach offers a novel perspective on the relationship between personality and the state of
consciousness known as Flow
Investigating the Efficacy of Algorithmic Student Modelling in Predicting Students at Risk of Failing in the Early Stages of Tertiary Education: Case study of experience based on first year students at an Institute of Technology in Ireland.
The application of data analytics to educational settings is an emerging and growing research area. Much of the published works to-date are based on ever-increasing volumes of log data that are systematically gathered in virtual learning environments as part of module delivery. This thesis took a unique approach to modelling academic performance; it is a first study to model indicators of students at risk of failing in first year of tertiary education, based on data gathered prior to commencement of first year, facilitating early engagement with at-risk students.
The study was conducted over three years, in 2010 through 2012, and was based on a sample student population (n=1,207) aged between 18 and 60 from a range of academic disciplines. Data was extracted from both student enrolment data maintained by college administration, and an online, self-reporting, learner profiling tool developed specifically for this study. The profiling tool was administered during induction sessions for students enrolling into the first year of study. Twenty-four factors relating to prior academic performance, personality, motivation, self-regulation, learning approaches, learner modality, age and gender were considered.
Eight classification algorithms were evaluated. Cross validation model accuracies based on all participants were compared with models trained on the 2010 and 2011 student cohorts, and tested on the 2012 student cohort. Best cross validation model accuracies were a Support Vector Machine (82%) and Neural Network (75%). The k-Nearest Neighbour model, which has received little attention in educational data mining studies, achieved highest model accuracy when applied to the 2012 student cohort (72%). The performance was similar to its cross validation model accuracy (72%). Model accuracies for other algorithms applied to the 2012 student cohort also compared favourably; for example Ensembles (71%), Support Vector Machine (70%) and a Decision Tree (70%).
Models of subgroups by age and by academic discipline achieved higher accuracy than models of all participants, however, a larger sample size is needed to confirm results. Progressive sampling showed a sample size \u3e 900 was required to achieve convergence of model accuracy.
Results showed that factors most predictive of academic performance in first year of study at tertiary education included age, prior academic performance and self-efficacy. Kinaesthetic modality was also indicative of students at risk of failing, a factor that has not been cited previously as a significant predictor of academic performance.
Models reported in this study show that learner profiling completed prior to commencement of first year of study yielded informative and generalisable results that identified students at risk of failing. Additionally, model accuracies were comparable to models reported elsewhere that included data collected from student activity in semester one, confirming the validity of early student profiling
- …