170 research outputs found

    Cognitive diagnosis modeling and applications to assessing learning

    Get PDF
    Chapter 1: Cognitive diagnosis models (CDMs) are restricted latent class models designed to assess test takers' mastery on a set of skills or attributes. With a wide range of applications in education and in psychopathology, various CDMs have been proposed and fitted to response data from different scenarios. Recently, Xu (2017) derived sufficient conditions for identifying model parameters of a restricted latent class model, which generalizes many existing CDMs. We propose a Bayesian estimation algorithm for this restricted latent class model. The model is applied to the Examination for the Certificate of Proficiency in English language assessment data (e.g., Henson et al., 2007). Chapter 2: There has been a growing interest in measuring students' growth over time. CDMs were traditionally used to measure students' skill mastery at a static time point, but recently, they have been used in many longitudinal models to track students' changes in skill acquisition over time. In this chapter, we propose a longitudinal learning model, where different kinds of skill hierarchies were considered, and the reduced-reparameterized unified model (r-RUM) or the noisty input, deterministic-``and''-gate (NIDA) model is used to measure students' skill mastery at each time point. This model is fitted to the Spatial Rotation data set (e.g., Wang et al., 2016), and different models were compared using Bayesian model comparison methods. Chapter 3: The increased popularity of computer-based testing has enabled researchers to collect various types of process data, including response times. Extensive research has been conducted on the joint modeling of response accuracy and response times. Recent research on CDMs begins to explore the relationship between speed and accuracy to understand students’ fluency of applying the mastered skills, in addition to mastery information, in a learning environment. In this chapter, we propose a mixture hidden Markov Diagnostic Classification Model framework for learning with response times and response accuracy. Such a model accounts for the heterogeneities in learning styles among students by modeling the different learning and response behaviors among subgroups. The proposed model is evaluated through a simulation study in terms of parameter recovery. Chapter 4: We introduce an R package, hmcdm, that can be used to fit several longitudinal models for learning under the cognitive diagnosis framework. The package allows users to simulate item responses (and response times if applicable) under several learning models, to fit the models using Markov Chain Monte Carlo (MCMC) methods, to compute point estimates of parameters based on the MCMC samples, and to evaluate and compare different models using Deviance Information Criterion and posterior predictive probabilities

    Evaluating NLG Evaluation Metrics: A Measurement Theory Perspective

    Full text link
    We address the fundamental challenge in Natural Language Generation (NLG) model evaluation, the design and validation of evaluation metrics. Recognizing the limitations of existing metrics and issues with human judgment, we propose using measurement theory, the foundation of test design, as a framework for conceptualizing and evaluating the validity and reliability of NLG evaluation metrics. This approach offers a systematic method for defining "good" metrics, developing robust metrics, and assessing metric performance. In this paper, we introduce core concepts in measurement theory in the context of NLG evaluation and key methods to evaluate the performance of NLG metrics. Through this framework, we aim to promote the design, evaluation, and interpretation of valid and reliable metrics, ultimately contributing to the advancement of robust and effective NLG models in real-world settings

    Modeling Learner Heterogeneity: A Mixture Learning Model With Responses and Response Times

    Get PDF
    The increased popularity of computer-based testing has enabled researchers to collect various types of process data, including test takers' reaction time to assessment items, also known as response times. In recent studies, the relationship between speed and accuracy in a learning setting was explored to understand students' fluency changes over time in applying the mastered skills in addition to skill mastery. This can be achieved by modeling the changes in response accuracy and response times throughout the learning process. We propose a mixture learning model that utilizes the response times and response accuracy. Such a model accounts for the heterogeneities in learning styles among learners and may provide instructors with valuable information, which can be used to design individualized instructions. A Bayesian modeling framework is developed for parameter estimation and the proposed model is evaluated through a simulation study and is fitted to a real data set collected from a computer-based learning system for spatial rotation skills

    Bacillus megaterium BMJBN02 induces the resistance of grapevine against downy mildew

    Get PDF
    Grape downy mildew caused by Plasmopara viticola is one of the most destructive diseases of grapes. All grape cultivars are susceptible to P. viticola. However, the resistance of grape plants could be induced in plant defense with some help of microbes. In this study, Bacillus megaterium BMJBN02 obtained from farmland soil was shown to regulate the resistance of grapevine against downy mildew. The salicylic acid (SA) content and the expression of pathogenesis-related (PR) genes of grapes under different treatments were examined using high-performance liquid chromatography-mass spectrometry (HPLC-MS) and reverse transcription- quantitative polymerase chain reaction (RT-qPCR), and it was found that SA content and the expression of PR genes could play a role in regulating the resistance of grapevine against downy mildew. The five-year plot experiment showed that the resistance effectiveness of isolate BMJBN02 was approximately equal to that of 0.1 % nicotinyl morpholine (commercial fungicide). Therefore, this study provides a valuable candidate method that uses B. megaterium BMJBN02 by regulating the resistance of grape against downy mildew for quality and yield of grape in commercial productivity

    ProcData: An R Package for Process Data Analysis

    Full text link
    Process data refer to data recorded in the log files of computer-based items. These data, represented as timestamped action sequences, keep track of respondents' response processes of solving the items. Process data analysis aims at enhancing educational assessment accuracy and serving other assessment purposes by utilizing the rich information contained in response processes. The R package ProcData presented in this article is designed to provide tools for processing, describing, and analyzing process data. We define an S3 class "proc" for organizing process data and extend generic methods summary and print for class "proc". Two feature extraction methods for process data are implemented in the package for compressing information in the irregular response processes into regular numeric vectors. ProcData also provides functions for fitting and making predictions from a neural-network-based sequence model. These functions call relevant functions in package keras for constructing and training neural networks. In addition, several response process generators and a real dataset of response processes of the climate control item in the 2012 Programme for International Student Assessment are included in the package

    Identifying student profiles in a digital mental rotation task: insights from the 2017 NAEP math assessment

    Get PDF
    Mental rotation (MR), a key aspect of spatial reasoning, is highly predictive of success in STEM fields. This study analyzed strategies employed by 27,600 eighth-grade students during a digital MR task from the 2017 National Assessment of Educational Progress (NAEP) in mathematics. Utilizing K-means cluster analysis to categorize behavioral and performance patterns, we identified four distinct profiles: Cognitive Offloaders (15% of the sample), Internal Visualizers (55%), External Visualizers (5%), and Non-Triers (25%). Cognitive Offloaders, skilled at minimizing cognitive load by eliminating incorrect options, demonstrated the highest MR accuracy rates at 45%. Internal Visualizers, relying less on digital tools and more on mental strategies, achieved robust performance with an average score of 38%. External Visualizers, despite their extensive use of assistive tools and greater time investment, scored an average of 36%. Non-Triers showed minimal engagement and correspondingly the lowest performance, averaging 29%. These findings not only underscore the diverse strategies students adopt in solving MR tasks but also emphasize the need for educational strategies that are tailored to accommodate different cognitive styles. By integrating MR training into the curriculum and enhancing teacher preparedness to support diverse learning needs, this study advocates for educational reforms to promote equitable outcomes in mathematics and broader STEM fields
    • …
    corecore