Search CORE

8 research outputs found

Interpretable domain adaptation via optimization over the Stiefel manifold

Author: Duivesteijn Wouter
Morik Katharina
Poelitz Christian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

In domain adaptation, the goal is to find common ground between two, potentially differently distributed, data sets. By finding common concepts present in two sets of words pertaining to different domains, one could leverage the performance of a classifier for one domain for use on the other domain. We propose a solution to the domain adaptation task, by efficiently solving an optimization problem through Stochastic Gradient Descent. We provide update rules that allow us to run Stochastic Gradient Descent directly on a matrix manifold: the steps compel the solution to stay on the Stiefel manifold. This manifold encompasses projection matrices of word vectors onto low-dimensional latent feature representations, which allows us to interpret the results: the rotation magnitude of the word vector projection for a given word corresponds to the importance of that word towards making the adaptation. Beyond this interpretability benefit, experiments show that the Stiefel manifold method performs better than state-of-the-art methods

Crossref

Ghent University Academic Bibliography

InstructExcel: A Benchmark for Natural Language Instruction in Excel

Author: Baral Chitta
Chakravarthy Rasika
Mishra Swaroop
Negreanu Carina
Nouri Elnaz
Payan Justin
Poelitz Christian
Roy Subhro
Singh Mukul
Van Durme Benjamin
Publication venue
Publication date: 22/10/2023
Field of study

With the evolution of Large Language Models (LLMs) we can solve increasingly more complex NLP tasks across various domains, including spreadsheets. This work investigates whether LLMs can generate code (Excel OfficeScripts, a TypeScript API for executing many tasks in Excel) that solves Excel specific tasks provided via natural language user instructions. To do so we introduce a new large-scale benchmark, InstructExcel, created by leveraging the 'Automate' feature in Excel to automatically generate OfficeScripts from users' actions. Our benchmark includes over 10k samples covering 170+ Excel operations across 2,000 publicly available Excel spreadsheets. Experiments across various zero-shot and few-shot settings show that InstructExcel is a hard benchmark for state of the art models like GPT-4. We observe that (1) using GPT-4 over GPT-3.5, (2) providing more in-context examples, and (3) dynamic prompting can help improve performance on this benchmark.Comment: Findings of EMNLP 2023, 18 page

arXiv.org e-Print Archive

Enhancing the possibilities of corpus-based investigations: Word sense disambiguation on query results of large text corpora

Author: Christian Poelitz
Thomas Bartz
Publication venue: Association for Computational Linguistics
Publication date: 01/01/2014
Field of study

Crossref

MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL

Author: Askari Arian
Poelitz Christian
Tang Xinye
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 11/04/2025
Field of study

Self-correction in text-to-SQL is the process of prompting large language model (LLM) to revise its previously incorrectly generated SQL, and commonly relies on manually crafted self-correction guidelines by human experts that are not only labor-intensive to produce but also limited by the human ability in identifying all potential error patterns in LLM responses. We introduce MAGIC, a novel multi-agent method that automates the creation of the self-correction guideline. MAGIC uses three specialized agents: a manager, a correction, and a feedback agent. These agents collaborate on the failures of an LLM-based method on the training set to iteratively generate and refine a self-correction guideline tailored to LLM mistakes, mirroring human processes but without human involvement. Our extensive experiments show that MAGIC's guideline outperforms expert human's created ones. We empirically find out that the guideline produced by MAGIC enhances the interpretability of the corrections made, providing insights in analyzing the reason behind the failures and successes of LLMs in self-correction

Association for the Advancement of Artificial Intelligence: AAAI Publications

Evaluating the Evaluator: Measuring LLMs’ Adherence to Task Evaluation Instructions

Author: Drosos Ian
Le Vu
McKenna Nick
Murugadoss Bhuvanashree
Negreanu Carina Suzana
Parnin Chris
Poelitz Christian
Sarkar Advait
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 11/04/2025
Field of study

LLMs-as-a-judge is a recently popularized method which replaces human judgements in task evaluation with automatic evaluation using LLMs. Due to widespread use of RLHF (Reinforcement Learning from Human Feedback), state-of-the-art LLMs like GPT4 and Llama3 are expected to have strong alignment with human preferences when prompted for a quality judgement, such as the coherence of a text. While this seems beneficial, it is not clear whether the assessments by an LLM-as-a-judge constitute only an evaluation based on the instructions in the prompts, or reflect its preference for high-quality data similar to its fine-tune data. To investigate how much influence prompting the LLMs-as-a-judge has on the alignment of AI judgements to human judgements, we analyze prompts with increasing levels of instructions about the target quality of an evaluation, for several LLMs-as-a-judge. Further, we compare to a prompt-free method using model perplexity as a quality measure instead. We aggregate a taxonomy of quality criteria commonly used across state-of-the-art evaluations with LLMs and provide this as a rigorous benchmark of models as judges. Overall, we show that the LLMs-as-a-judge benefit only little from highly detailed instructions in prompts and that perplexity can sometimes align better with human judgements than prompting, especially on textual quality

Association for the Advancement of Artificial Intelligence: AAAI Publications

Evaluating the Evaluator: Measuring LLMs' Adherence to Task Evaluation Instructions

Author: Drosos Ian
Le Vu
McKenna Nick
Murugadoss Bhuvanashree
Negreanu Carina Suzana
Parnin Chris
Poelitz Christian
Sarkar Advait
Publication venue
Publication date: 16/08/2024
Field of study

LLMs-as-a-judge is a recently popularized method which replaces human judgements in task evaluation (Zheng et al. 2024) with automatic evaluation using LLMs. Due to widespread use of RLHF (Reinforcement Learning from Human Feedback), state-of-the-art LLMs like GPT4 and Llama3 are expected to have strong alignment with human preferences when prompted for a quality judgement, such as the coherence of a text. While this seems beneficial, it is not clear whether the assessments by an LLM-as-a-judge constitute only an evaluation based on the instructions in the prompts, or reflect its preference for high-quality data similar to its fine-tune data. To investigate how much influence prompting the LLMs-as-a-judge has on the alignment of AI judgements to human judgements, we analyze prompts with increasing levels of instructions about the target quality of an evaluation, for several LLMs-as-a-judge. Further, we compare to a prompt-free method using model perplexity as a quality measure instead. We aggregate a taxonomy of quality criteria commonly used across state-of-the-art evaluations with LLMs and provide this as a rigorous benchmark of models as judges. Overall, we show that the LLMs-as-a-judge benefit only little from highly detailed instructions in prompts and that perplexity can sometimes align better with human judgements than prompting, especially on textual quality

arXiv.org e-Print Archive

Leçons de métaphysique /

Author: Kant Immanuel,1724-1804(viaf)82088490
Poelitz Karl H. L.,
Tissot Joseph,1801-1876(viaf)7397744
Publication venue: Paris : Ladrange,
Publication date
Field of study

Europeana-GoogleBook

Archivsystem Ask23

Abbildungsverzeichnis, Literaturverzeichnis, Register

Author: Aertsen Jan A
Alchina Franch JosØ
Alfonso el Sabio General Estoria
American Textbook Council
Amiama Manuel A
Ampelius Lucius
Angenend Arnold
Anghiera Pietro
Anglicus Robertus
Anklam Ewa
Aquin Summa Theologica
Arellano Hoffmann
Arnoldsson Sverker
Ascoli
Aufgebauer Peter
Augustinus Aurelius
Aurillac
Autun
Bacon Roger
Bakewell Peter
Ballµn Romeo
Berelson Bernard
Bernhard Roland
Berthold von Regensburg V«
Betten Francis
Bieri Hans
Bishop Louise M
Bitterli Urs
Blumesberger Susanne
Boethius
Borah
Breen Timothy Hall
Broda Johanna
Bry Theodore De
Büschges Christian
Cajani Luigi
Capella Martianus
Carbia Rómulo
Carrasco David
Caso Alfonso
Cassiodorus Magnus Aurelius
Chaunu Pierre
Chiari Joseph
Cilleßen Wolfgang
Clayton Lawrence A
Clendinnen Inga
Cole Jeffrey Alan
Collingwood Robin George
Comestor Petrus
Committee on the Study of Teaching Materials on Inter-American Subjects
Cook
Cook Noble David
Cook Sherburne
Cosgrove Denis Edmund
Crane Eugene R
Cro Stelio
Crosby Alfred W
Córdova EfrØn
Danto Arthur
Davidson Miles
Davis
Delgado Mariano
Denevan William Maxfield
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders
Ders Christopher Brooks
Ders Cristóbal Colón
Ders Die Religion«
Ders Hg.
Ders Indias
Ders Indias
Ders Introduction«
Ders Metahistory
Ders Sprache
Ders Wissens
Ders µfica«
Dickmann Fritz
Dicuil
Dies
Dies
Dies
Dies
Dies
Dies
Dies
Dies
Dies
Donoso Anes Rafael
Duffy John
Duverger Christian
Díaz del Castillo Bernal
Eckel
Edelmayer Friedrich
Edson Evelyn
Efron Noah J
Eliade
Eltis David
Englisch Brigitte
Engstrand Iris
Erhardt Alexandra
Ertler Klaus-Dieter
EspaÇol Bouche Luis
Fazio Mariano
Fernµndez Armesto Felipe
Fernµndez Retamar Roberto
Finoccchiaro Maurice
Flohr Markus
Folayan Osagie Iyunolu
Foucault Michel
Frankl Victor
Franklin James
Franzbach Martin
Garavaglia
García Acosta Virginia
García de la Huerta Izquierdo
Gautier de Châtillon Alexandreis
Gewecke Frauke
Gibson Charles
Gillespie Susan D
Goebl Ted
Gonzalez Torres Yolotl
Grant Edward
Gründer Horst
Guerra Francisco
GutiØrrez Gustavo
Gvirtz Silvina
Gómez-Centurión Carlos JimØnez
Hanke Lewis
Harris Olivia
Hassig Ross
Heilbron John L
Hemingway Donald W
Henige David
Hermann der Lahme
Hernµndez Cuevas Juan Carlos
Heuberger Valeria
Heydenreich Titus
Hildegard von Bingen
Hinz Felix
Historical Association (Great Britain).
Hoffmann-Arellano
Holthoff-Stenger Monika
Horlacher Cornelis
Horstmann Carl
Hubert Rainer
Humboldt
Höhne Thomas
Höhne Thomas
Idrisi Abu
Iggers Georg
Irving Washington
Isidor von Sevilla
Jacobmeyer Wolfgang
Jhi Jun-Hyung
Jones Howard
Kahle Günter
Kelly Louis
Kirn Paul
Kleinert Andreas
Koch Horst
Kocka Jürgen
Konetzke Richard
Konrad von Megenberg
Krause
Kriegleder Wynfrid
Kupperman Karen Ordahl
Kurscheid Georg
Kurtz Donald
König Hans-Joachim
Lactantius Cælius Firmianus
Lafaye Jacques
Lamnek Siegfried
Landwehr Achim
Las Casas
Latini Brunetto
Lehmann Hartmut
Lemberger
Lendzian
Lienemann Wolfgang
Lindberg David
Lipschutz Alejandro
Lira GuillØn AndrØs
Litterscheid Claus
Llosada Angel
Lockhart
Lockhart James
Loewen James
Lovell William George
Lustig Wolf
Malagón Pinzón Miguel
MallØe Rainer
Maltby William
Manzano Baena Laura
Markom Christa
Martínez JosØ Luis
Marzal Manuel María
Matthew
Maurus Hrabanus
Maybaum
Mayer Franz Martin
McCaa Robert
McLaughin Mary
Medina Juan Antonio
Meissner Jochen
Menninger Annerose
MenØndez Pidal Ramón
Messinger Sandra
Michelet Jules
Milhou Alain
Mira Caballos Esteban
Montzka
Morawietz
Muldoon James
Müller Karl-Heinz
Müller-Beck Hansjürgen
Navarro Brotóns
Neff William Lee
Nepomucky Ernst
Nikolaus Kopernikus Gesamtausgabe
Nipperdey Thomas
Nolde Dorothea
Numbers Ronald
Nösselt Friedrich
Obendorfer Heinz
Obeyesekere Gananath
Otte Enrique
Oudjik
Pagden Anthony
Paine Thomas
Perreault Melanie
Petersen Thomas
Pflüger Christine
Philipp Hans
Pieper
Pieper Renate
Pietschmann Horst
Platon Timaios
Poelitz
Pokorny Hans
Powell Philip W
Prause Gerhard
Prem Hanns
Prescott William Hickling
Puetz Wilhelm
Pöggeler Franz
Quackenbos George Payn
Radkau García Verena
Ramos PØrez Demetrio
Ranke
Rech Bruno
Reinhard Wolfgang
Reinprecht Christoph
Reiss Wolfram
Restall Matthew
Riekenberg Michael
Ritzelnadel Friedrich August
Roberts Susan A
Robicsek Francis
Rosenblat Angel
Rosweyde Heribert
Russell Colin
Russell Jeffrey Burton
Russo Lucio
Rutt John Towill
Räthzel Nora
Rüsen Jörn
Sacrobosco Johannes
Saint-Lu AndrØ
Sandberg Brian
Satjukow Silke
Scelle Georges
Scheipl
Schiller Friedrich
Schmale Wolfgang
Schmid Heinz Dieter
Schmidt Peer
Schmitthenner
Schopp Georg Michael
Schröckenfuchs
Schröder Susan
Schuh Adam
Schulze Schneider Ingrid
Schulze Winfried
Schwartz Stuart B
Scottus Eriugena Johannes
Seed Patricia
Senger Hans Gerhard
Serna Arnaiz Mercedes
Shackelford Jole
Shea William
Simek Rudolf
Simonyi Kµroly
Sinatra Frank
Singer Barry
Sinobas Manuel
Sint Josef
SobrequØs Vidal
Spiegel Gabrielle Michele
Spielvogel Jackson J
Staatengeschichte III
Stackelberg Jürgen
Stahel Albert Alexander
Stahl William
Stark Rodney
Stearns Peter N
Stein Gerd
Stenzel Werner
Stoll Eva
Straub Eberhard
Straub Jürgen
Sublimis Deus«
SØjournØ Laurette
Sütterlin Berthold
Tatsch
Teistler Gisela
Theodosius Macrobius Ambrosius
Thernstrom Stephan
Torge Wolfgang
Townsend Camilla
Tupetz Theodor
Tänzler Jade-Yasmin
Uberti Fazio
Utzt Susanne
Vaca de Osma JosØ Antonio
Varela Consuelo
Vaughan Alden T
Venerabilis Beda
Verlinden Charles
Vicente Castro
Vierhaus Rudolf
Vogel Klaus Anselm
Vollet Matthias
Voltaire
Von den Brincken Anna-Dorothee
Weber Bruno
Weber Georg
Weissensteiner Fritz
Wentworth Higgins Thomas
White Hayden
Wiater Werner
Wilson
Wirth Diane E
Wirth Fremont Philip
Wohlfeil Rainer
Wolf Werner
Worcester Joseph
Woynar Karl
Wright
Zambardino Rudolph A
Zeeden Ernst Walter
Zeehe Andreas
Publication venue: 'Vandenhoeck & Ruprecht GmbH & Co, KG'
Publication date
Field of study

Crossref