Search CORE

1,030 research outputs found

Evaluating Centering for Information Ordering Using Corpora

Author: Chris Mellish
Grosz Barbara J
Jon Oberlander
Massimo Poesio
Nikiforos Karamanis
Strube Michael
Publication venue: 'MIT Press - Journals'
Publication date: 15/10/2008
Field of study

In this article we discuss several metrics of coherence defined using centering theory and investigate the usefulness of such metrics for information ordering in automatic text generation. We estimate empirically which is the most promising metric and how useful this metric is using a general methodology applied on several corpora. Our main result is that the simplest metric (which relies exclusively on NOCB transitions) sets a robust baseline that cannot be outperformed by other metrics which make use of additional centering-based features. This baseline can be used for the development of both text-to-text and concept-to-text generation systems. </jats:p

CiteSeerX

Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers

Author: Huang Jimmy
Jahan Israt
Laskar Md Tahmid Rahman
Peng Chun
Publication venue
Publication date: 24/08/2023
Field of study

ChatGPT is a large language model developed by OpenAI. Despite its impressive performance across various tasks, no prior work has investigated its capability in the biomedical domain yet. To this end, this paper aims to evaluate the performance of ChatGPT on various benchmark biomedical tasks, such as relation extraction, document classification, question answering, and summarization. To the best of our knowledge, this is the first work that conducts an extensive evaluation of ChatGPT in the biomedical domain. Interestingly, we find based on our evaluation that in biomedical datasets that have smaller training sets, zero-shot ChatGPT even outperforms the state-of-the-art fine-tuned generative transformer models, such as BioGPT and BioBART. This suggests that ChatGPT's pre-training on large text corpora makes it quite specialized even in the biomedical domain. Our findings demonstrate that ChatGPT has the potential to be a valuable tool for various tasks in the biomedical domain that lack large annotated data.Comment: Accepted by BioNLP@ACL 202

arXiv.org e-Print Archive

Introduction: Modeling, Learning and Processing of Text-Technological Data Structures

Author: Kühnberger Kai-Uwe
Lobin Henning
Lüngen Harald
Mehler Alexander
Storrer Angelika
Witt Andreas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/12/2015
Field of study

Researchers in many disciplines, sometimes working in close cooperation, have been concerned with modeling textual data in order to account for texts as the prime information unit of written communication. The list of disciplines includes computer science and linguistics as well as more specialized disciplines like computational linguistics and text technology. What many of these efforts have in common is the aim to model textual data by means of abstract data types or data structures that support at least the semi-automatic processing of texts in any area of written communication

Publikationsserver des Instituts für Deutsche Sprache