230 research outputs found

    Wide-Scale Automatic Analysis of 20 Years of ITS Research

    Get PDF
    The analysis of literature within a research domain can provide significant value during preliminary research. While literature reviews may provide an in-depth understanding of current studies within an area, they are limited by the number of studies which they take into account. Importantly, whilst publications in hot areas abound, it is not feasible for an individual or team to analyse a large volume of publications within a reasonable amount of time. Additionally, major publications which have gained a large number of citations are more likely to be included in a review, with recent or fringe publications receiving less inclusion. We provide thus an automatic methodology for the large-scale analysis of literature within the Intelligent Tutoring Systems (ITS) domain, with the aim of identifying trends and areas of research from a corpus of publications which is significantly larger than is typically presented in conventional literature reviews. We illustrate this by a novel analysis of 20 years of ITS research. The resulting analysis indicates a significant shift of the status quo of research in recent years with the advent of novel neural network architectures and the introduction of MOOCs

    Reflecting Comprehension through French Textual Complexity Factors

    No full text
    International audienceResearch efforts in terms of automatic textual complexity analysis are mainly focused on English vocabulary and few adaptations exist for other languages. Starting from a solid base in terms of discourse analysis and existing textual complexity assessment model for English, we introduce a French model trained on 200 documents extracted from school manuals pre-classified into five complexity classes. The underlying textual complexity metrics include surface, syntactic, morphological, semantic and discourse specific factors that are afterwards combined through the use of Support Vector Machines. In the end, each factor is correlated to pupil comprehension metrics scores, spanning throughout multiple classes, therefore creating a clearer perspective in terms of measurements impacting the perceived difficulty of a given text. In addition to purely quantitative surface factors, specific parts of speech and cohesion have proven to be reliable predictors of learners' comprehension level, creating nevertheless a strong background for building dependable French textual complexity models

    ReaderBench Learns Dutch: Building a Comprehensive Automated Essay Scoring System for Dutch Language

    Full text link
    Automated Essay Scoring has gained a wider applicability and usage with the integration of advanced Natural Language Processing techniques which enabled in-depth analyses of discourse in order capture the specificities of written texts. In this paper, we introduce a novel Automatic Essay Scoring method for Dutch language, built within the Readerbench framework, which encompasses a wide range of textual complexity indices, as well as an automated segmentation approach. Our method was evaluated on a corpus of 173 technical reports automatically split into sections and subsections, thus forming a hierarchical structure on which textual complexity indices were subsequently applied. The stepwise regression model explained 30.5% of the variance in students’ scores, while a Discriminant Function Analysis predicted with substantial accuracy (75.1%) whether they are high or low performance students.This study is part of the RAGE project. The RAGE project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 644187. This publication reflects only the author's view. The European Commission is not responsible for any use that may be made of the information it contains

    Incremental probabilistic Latent Semantic Analysis for video retrieval

    Get PDF
    Recent research trends in Content-based Video Retrieval have shown topic models as an effective tool to deal with the semantic gap challenge. In this scenario, this paper has a dual target: (1) it is aimed at studying how the use of different topic models (pLSA, LDA and FSTM) affects video retrieval performance; (2) a novel incremental topic model (IpLSA) is presented in order to cope with incremental scenarios in an effective and efficient way. A comprehensive comparison among these four topic models using two different retrieval systems and two reference benchmarking video databases is provided. Experiments revealed that pLSA is the best model in sparse conditions, LDA tend to outperform the rest of the models in a dense space and IpLSA is able to work properly in both cases

    Predicting Comprehension from Students’ Summaries

    No full text
    International audienceComprehension among young students represents a key component of their formation throughout the learning process. Moreover, scaffolding students as they learn to coherently link information, while organically construct- ing a solid knowledge base, is crucial to students’ development, but requires regular assessment and progress tracking. To this end, our aim is to provide an automated solution for analyzing and predicting students’ comprehension levels by extracting a combination of reading strategies and textual complexity factors from students’ summaries. Building upon previous research and enhancing it by incorporating new heuristics and factors, Support Vector Machine classification models were used to validate our assumptions that automatically identified reading strategies, together with textual complexity indices applied on students’ summaries, represent reliable estimators of comprehension
    • …
    corecore