2,957 research outputs found

    A psychometric modeling approach to fuzzy rating data

    Full text link
    Modeling fuzziness and imprecision in human rating data is a crucial problem in many research areas, including applied statistics, behavioral, social, and health sciences. Because of the interplay between cognitive, affective, and contextual factors, the process of answering survey questions is a complex task, which can barely be captured by standard (crisp) rating responses. Fuzzy rating scales have progressively been adopted to overcome some of the limitations of standard rating scales, including their inability to disentangle decision uncertainty from individual responses. The aim of this article is to provide a novel fuzzy scaling procedure which uses Item Response Theory trees (IRTrees) as a psychometric model for the stage-wise latent response process. In so doing, fuzziness of rating data is modeled using the overall rater's pattern of responses instead of being computed using a single-item based approach. This offers a consistent system for interpreting fuzziness in terms of individual-based decision uncertainty. A simulation study and two empirical applications are adopted to assess the characteristics of the proposed model and provide converging results about its effectiveness in modeling fuzziness and imprecision in rating data

    Sentiment analytics: Lexicons construction and analysis

    Get PDF
    With the increasing amount of text data, sentiment analysis (SA) is becoming more and more important. An automated approach is needed to parse the online reviews and comments, and analyze their sentiments. Since lexicon is the most important component in SA, enhancing the quality of lexicons will improve the efficiency and accuracy of sentiment analysis. In this research, the effect of coupling a general lexicon with a specialized lexicon (for a specific domain) and its impact on sentiment analysis was presented. Two special domains and one general domain were studied. The two special domains are the petroleum domain and the biology domain. The general domain is the social network domain. The specialized lexicon for the petroleum domain was created as part of this research. The results, as expected, show that coupling a general lexicon with a specialized lexicon improves the sentiment analysis. However, coupling a general lexicon with another general lexicon does not improve the sentiment analysis --Abstract, page iii

    On Cognitive Preferences and the Plausibility of Rule-based Models

    Get PDF
    It is conventional wisdom in machine learning and data mining that logical models such as rule sets are more interpretable than other models, and that among such rule-based models, simpler models are more interpretable than more complex ones. In this position paper, we question this latter assumption by focusing on one particular aspect of interpretability, namely the plausibility of models. Roughly speaking, we equate the plausibility of a model with the likeliness that a user accepts it as an explanation for a prediction. In particular, we argue that, all other things being equal, longer explanations may be more convincing than shorter ones, and that the predominant bias for shorter models, which is typically necessary for learning powerful discriminative models, may not be suitable when it comes to user acceptance of the learned models. To that end, we first recapitulate evidence for and against this postulate, and then report the results of an evaluation in a crowd-sourcing study based on about 3.000 judgments. The results do not reveal a strong preference for simple rules, whereas we can observe a weak preference for longer rules in some domains. We then relate these results to well-known cognitive biases such as the conjunction fallacy, the representative heuristic, or the recogition heuristic, and investigate their relation to rule length and plausibility.Comment: V4: Another rewrite of section on interpretability to clarify focus on plausibility and relation to interpretability, comprehensibility, and justifiabilit

    Comparison of Classification rules for Predicting Personality

    Get PDF
    Personality refers to an individual’s characteristic patterns of thought, emotion and behaviour, together with the psychological mechanisms. It has been a long-term goal for psychologists to understand human personality and its impact on human behaviour. Behavior involves an interaction between a person's underlying personality traits and situational variables. Social media is a place where users present themselves to the world, revealing personal details and insights into their lives. It is evident that there is a strong correlation between users’ personality and the way they behave on online social network and a large number of studies are made on this topic. Objective of this paper is to classify the personality traits into Extraversion and Introversion. Dataset for this purpose is collected from ‘http://personality-testing.info/_rawdata/’ which stores data on personality surveys for the purposes of academic research and personal use. The dataset consists of 140 instances with 17 attributes. Three types of classification rules like J48 Decision Tree classifier, Naive Bayes Classifier and ZeroR Rules available on Weka data mining tool are used to predict Personality traits. Performances of each of these rules are measured by the values of the Precision, Recall, and Accuracy. Comparative study of these methods shows that J48 Decision tree Classifier performs better than Naive Bayes classifier and ZeroR Rule for personality traits recognition on the Social Media

    Development of an Explainability Scale to Evaluate Explainable Artificial Intelligence (XAI) Methods

    Get PDF
    Explainable Artificial Intelligence (XAI) is an area of research that develops methods and techniques to make the results of artificial intelligence understood by humans. In recent years, there has been an increased demand for XAI methods to be developed due to model architectures getting more complicated and government regulations requiring transparency in machine learning models. With this increased demand has come an increased need for instruments to evaluate XAI methods. However, there are few, if none, valid and reliable instruments that take into account human opinion and cover all aspects of explainability. Therefore, this study developed an objective, human-centred questionnaire to evaluate all types of XAI methods. This questionnaire consists of 15 items: 5 items asking about the user’s background information and 10 items evaluating the explainability of the XAI method which were based on the notions of explainability. An experiment was conducted (n = 38) which got participants to evaluate one of two XAI methods using the questionnaire. The results from this experiment were used for exploratory factor analysis which showed that the 10 items related to explainability constitute one factor (Cronbach’s α = 0.81). The results were also used to gather evidence of the questionnaire’s construct validity. It is concluded that this 15-item questionnaire has one factor, has acceptable validity and reliability, and can be used to evaluate and compare XAI methods

    IQ Classification via Brainwave Features: Review on Artificial Intelligence Techniques

    Get PDF
    Intelligence study is one of keystone to distinguish individual differences in cognitive psychology. Conventional psychometric tests are limited in terms of assessment time, and existence of biasness issues. Apart from that, there is still lack in knowledge to classify IQ based on EEG signals and intelligent signal processing (ISP) technique. ISP purpose is to extract as much information as possible from signal and noise data using learning and/or other smart techniques. Therefore, as a first attempt in classifying IQ feature via scientific approach, it is important to identify a relevant technique with prominent paradigm that is suitable for this area of application. Thus, this article reviews several ISP approaches to provide consolidated source of information. This in particular focuses on prominent paradigm that suitable for pattern classification in biomedical area. The review leads to selection of ANN since it has been widely implemented for pattern classification in biomedical engineering

    An Intelligent Approach Using Machine Learning Techniques to Predict Flow in People

    Get PDF
    The goal of this study is to estimate the state of consciousness known as Flow, which is associated with an optimal experience and can indicate a person’s efficiency in both personal and professional settings. To predict Flow, we employ artificial intelligence techniques using a set of variables not directly connected with its construct. We analyse a significant amount of data from psychological tests that measure various personality traits. Data mining techniques support conclusions drawn from the psychological study. We apply linear regression, regression tree, random forest, support vector machine, and artificial neural networks. The results show that the multilayer perceptron network is the best estimator, with an MSE of 0.007122 and an accuracy of 88.58%. Our approach offers a novel perspective on the relationship between personality and the state of consciousness known as Flow

    Investigating the Efficacy of Algorithmic Student Modelling in Predicting Students at Risk of Failing in the Early Stages of Tertiary Education: Case study of experience based on first year students at an Institute of Technology in Ireland.

    Get PDF
    The application of data analytics to educational settings is an emerging and growing research area. Much of the published works to-date are based on ever-increasing volumes of log data that are systematically gathered in virtual learning environments as part of module delivery. This thesis took a unique approach to modelling academic performance; it is a first study to model indicators of students at risk of failing in first year of tertiary education, based on data gathered prior to commencement of first year, facilitating early engagement with at-risk students. The study was conducted over three years, in 2010 through 2012, and was based on a sample student population (n=1,207) aged between 18 and 60 from a range of academic disciplines. Data was extracted from both student enrolment data maintained by college administration, and an online, self-reporting, learner profiling tool developed specifically for this study. The profiling tool was administered during induction sessions for students enrolling into the first year of study. Twenty-four factors relating to prior academic performance, personality, motivation, self-regulation, learning approaches, learner modality, age and gender were considered. Eight classification algorithms were evaluated. Cross validation model accuracies based on all participants were compared with models trained on the 2010 and 2011 student cohorts, and tested on the 2012 student cohort. Best cross validation model accuracies were a Support Vector Machine (82%) and Neural Network (75%). The k-Nearest Neighbour model, which has received little attention in educational data mining studies, achieved highest model accuracy when applied to the 2012 student cohort (72%). The performance was similar to its cross validation model accuracy (72%). Model accuracies for other algorithms applied to the 2012 student cohort also compared favourably; for example Ensembles (71%), Support Vector Machine (70%) and a Decision Tree (70%). Models of subgroups by age and by academic discipline achieved higher accuracy than models of all participants, however, a larger sample size is needed to confirm results. Progressive sampling showed a sample size \u3e 900 was required to achieve convergence of model accuracy. Results showed that factors most predictive of academic performance in first year of study at tertiary education included age, prior academic performance and self-efficacy. Kinaesthetic modality was also indicative of students at risk of failing, a factor that has not been cited previously as a significant predictor of academic performance. Models reported in this study show that learner profiling completed prior to commencement of first year of study yielded informative and generalisable results that identified students at risk of failing. Additionally, model accuracies were comparable to models reported elsewhere that included data collected from student activity in semester one, confirming the validity of early student profiling
    • …
    corecore