Search CORE

19 research outputs found

Improving Retrieval-Based Question Answering with Deep Inference Models

Author: Pirtoaca George-Sebastian
Rebedea Traian
Ruseti Stefan
Publication venue
Publication date: 06/05/2019
Field of study

Question answering is one of the most important and difficult applications at the border of information retrieval and natural language processing, especially when we talk about complex science questions which require some form of inference to determine the correct answer. In this paper, we present a two-step method that combines information retrieval techniques optimized for question answering with deep learning models for natural language inference in order to tackle the multi-choice question answering in the science domain. For each question-answer pair, we use standard retrieval-based models to find relevant candidate contexts and decompose the main problem into two different sub-problems. First, assign correctness scores for each candidate answer based on the context using retrieval models from Lucene. Second, we use deep learning architectures to compute if a candidate answer can be inferred from some well-chosen context consisting of sentences retrieved from the knowledge base. In the end, all these solvers are combined using a simple neural network to predict the correct answer. This proposed two-step model outperforms the best retrieval-based solver by over 3% in absolute accuracy.Comment: 8 pages, 2 figures, 8 tables, accepted at IJCNN 201

arXiv.org e-Print Archive

Crossref

QAnswer -Enhanced Entity Matching for Question Answering over Linked Data

Author: Alexandru Mirea
Stefan Ruseti
Stefan Trausan-Matu
Traian Rebedea
Publication venue
Publication date: 01/05/2020
Field of study

Abstract. QAnswer is a question answering system that uses DBpedia as a knowledge base and converts natural language questions into a SPARQL query. In order to improve the match between entities and relations and natural language text, we make use of Wikipedia to extract lexicalizations of the DBpedia entities and then match them with the question. These entities are validated on the ontology, while missing ones can be inferred. The proposed system was tested in the QALD-5 challenge and it obtained a F1 score of 0.30, which placed QAnswer in the second position in the challenge, despite the fact that the system used only a small subset of the properties in DBpedia, due to the long extraction process

CiteSeerX

Automated essay scoring in applied games:Reducing the teacher bandwidth problem in online training

Author: Dascalu Mihai
Kurvers Hub
Ruseti Stefan
Trausan-Matu Stefan
Westera W.
Publication venue: 'Elsevier BV'
Publication date: 01/08/2018
Field of study

This paper presents a methodology for applying automated essay scoring in educational settings. The methodology was tested and validated on a dataset of 173 reports (in Dutch language) that students have created in an applied game on environmental policy. Natural Language Processing technologies from the ReaderBench framework were used to generate an extensive set of textual complexity indices for each of the reports. Afterwards, different machine learning algorithms were used to predict the scores. By combining binary classification (pass or fail) and a probabilistic model for precision, a trade-off can be made between validity of automated score prediction (precision) and the reduction of teacher workload required for manual assessment. It was found from the sample that substantial workload reduction can be achieved, while preserving high precision: allowing for a precision of 95% or higher would already reduce the teacher’s workload to 74%; lowering precision to 80% produces a workload reduction of 50%

Open University of the Netherlands Research Portal

Crossref

Enhancing Free-text Interactions in a Communication Skills Learning Environment

Author: Dascălu Mihai
Gutu-Robu Gabriel
Hulsbergen Michiel H
Jeuring J.T.
Lala R.
Ruseti Stefan
Van Dortmont Jordy
Van Geest Marcell
Publication venue: 'International Society of the Learning Sciences'
Publication date: 21/06/2019
Field of study

Learning environments frequently use gamification to enhance user interactions.Virtual characters with whom players engage in simulated conversations often employ prescripted dialogues; however, free user inputs enable deeper immersion and higher-order cognition. In our learning environment, experts developed a scripted scenario as a sequence of potential actions, and we explore possibilities for enhancing interactions by enabling users to type free inputs that are matched to the pre-scripted statements using Natural Language Processing techniques. In this paper, we introduce a clustering mechanism that provides recommendations for fine-tuning the pre-scripted answers in order to better match user inputs

Open University of the Netherlands Research Portal

Processing open text input in a scripted communication scenario:Extended abstract

Author: Alex Beatrice
Gutu Gabriel
Heemskerk F.P.M.
Jeuring J.T.
Lala Raja
Mihai Dascalu
Ruseti Stefan
Tobin Richard
Van Dortmont Jordy
Van Geest Marcell
Publication venue
Publication date: 01/11/2018
Field of study

Open University of the Netherlands Research Portal

Processing open text input in a scripted communication scenario:Extended abstract

Author: Alex Beatrice
Gutu Gabriel
Heemskerk F.P.M.
Jeuring J.T.
Lala Raja
Mihai Dascalu
Ruseti Stefan
Tobin Richard
Van Dortmont Jordy
Van Geest Marcell
Publication venue
Publication date: 01/11/2018
Field of study

Open University of the Netherlands Research Portal

RoSummary: Control Tokens for Romanian News Summarization

Author: Mihai Alexandru Niculescu
Mihai Dascalu
Stefan Ruseti
Publication venue: MDPI AG
Publication date: 01/12/2022
Field of study

Significant progress has been achieved in text generation due to recent developments in neural architectures; nevertheless, this task remains challenging, especially for low-resource languages. This study is centered on developing a model for abstractive summarization in Romanian. A corresponding dataset for summarization is introduced, followed by multiple models based on the Romanian GPT-2, on top of which control tokens were considered to specify characteristics for the generated text, namely: counts of sentences and words, token ratio, and n-gram overlap. These are special tokens defined in the prompt received by the model to indicate traits for the text to be generated. The initial model without any control tokens was assessed using BERTScore (F1 = 73.43%) and ROUGE (ROUGE-L accuracy = 34.67%). Control tokens improved the overall BERTScore to 75.42% using , while the model was influenced more by the second token specified in the prompt when performing various combinations of tokens. Six raters performed human evaluations of 45 generated summaries with different models and decoding methods. The generated texts were all grammatically correct and consistent in most cases, while the evaluations were promising in terms of main idea coverage, details, and cohesion. Paraphrasing still requires improvements as the models mostly repeat information from the reference text. In addition, we showcase an exploratory analysis of the generated summaries using one or two specific control tokens

Directory of Open Access Journals

Identifying the Structure of CSCL Conversations Using String Kernels

Author: Gabriel Gutu-Robu
Mihai Dascalu
Mihai Masala
Stefan Ruseti
Stefan Trausan-Matu
Traian Rebedea
Publication venue: 'MDPI AG'
Publication date: 20/12/2021
Field of study

Computer-Supported Collaborative Learning tools are exhibiting an increased popularity in education, as they allow multiple participants to easily communicate, share knowledge, solve problems collaboratively, or seek advice. Nevertheless, multi-participant conversation logs are often hard to follow by teachers due to the mixture of multiple and many times concurrent discussion threads, with different interaction patterns between participants. Automated guidance can be provided with the help of Natural Language Processing techniques that target the identification of topic mixtures and of semantic links between utterances in order to adequately observe the debate and continuation of ideas. This paper introduces a method for discovering such semantic links embedded within chat conversations using string kernels, word embeddings, and neural networks. Our approach was validated on two datasets and obtained state-of-the-art results on both. Trained on a relatively small set of conversations, our models relying on string kernels are very effective for detecting such semantic links with a matching accuracy larger than 50% and represent a better alternative to complex deep neural networks, frequently employed in various Natural Language Processing tasks where large datasets are available

Multidisciplinary Digital Publishing Institute