Search CORE

29 research outputs found

DocSCAN: Unsupervised Text Classification via Learning from Neighbors

Author: Ash Elliott
Stammbach Dominik
Publication venue
Publication date: 04/10/2022
Field of study

We introduce DocSCAN, a completely unsupervised text classification approach using Semantic Clustering by Adopting Nearest-Neighbors (SCAN). For each document, we obtain semantically informative vectors from a large pre-trained language model. Similar documents have proximate vectors, so neighbors in the representation space tend to share topic labels. Our learnable clustering approach uses pairs of neighboring datapoints as a weak learning signal. The proposed approach learns to assign classes to the whole dataset without provided ground-truth labels. On five topic classification benchmarks, we improve on various unsupervised baselines by a large margin. In datasets with relatively few and balanced outcome classes, DocSCAN approaches the performance of supervised classification. The method fails for other types of classification, such as sentiment analysis, pointing to important conceptual and practical differences between classifying images and texts.Comment: in Proceedings of the 18th Conference on Natural Language Processing (KONVENS 2022). Potsdam, German

arXiv.org e-Print Archive

The Law and NLP: Bridging Disciplinary Disconnects

Author: Ash Elliott
Mahari Robert
Pentland Alex 'Sandy'
Stammbach Dominik
Publication venue
Publication date: 22/10/2023
Field of study

Legal practice is intrinsically rooted in the fabric of language, yet legal practitioners and scholars have been slow to adopt tools from natural language processing (NLP). At the same time, the legal system is experiencing an access to justice crisis, which could be partially alleviated with NLP. In this position paper, we argue that the slow uptake of NLP in legal practice is exacerbated by a disconnect between the needs of the legal community and the focus of NLP researchers. In a review of recent trends in the legal NLP literature, we find limited overlap between the legal NLP community and legal academia. Our interpretation is that some of the most popular legal NLP tasks fail to address the needs of legal practitioners. We discuss examples of legal NLP tasks that promise to bridge disciplinary disconnects and highlight interesting areas for legal NLP research that remain underexplored

arXiv.org e-Print Archive

Enhancing Public Understanding of Court Opinions with Automated Summarizers

Author: Ash Elliott
Kesari Aniket
Naidu Suresh
Song Lena
Stammbach Dominik
Publication venue
Publication date: 11/11/2023
Field of study

Written judicial opinions are an important tool for building public trust in court decisions, yet they can be difficult for non-experts to understand. We present a pipeline for using an AI assistant to generate simplified summaries of judicial opinions. These are more accessible to the public and more easily understood by non-experts, We show in a survey experiment that the simplified summaries help respondents understand the key features of a ruling. We discuss how to integrate legal domain knowledge into studies using large language models. Our results suggest a role both for AI assistants to inform the public, and for lawyers to guide the process of generating accessible summaries

arXiv.org e-Print Archive

Repository for Publications and Research Data

Revisiting Automated Topic Model Evaluation with Large Language Models

Author: Ash Elliott
Hoyle Alexander
Sachan Mrinmaya
Stammbach Dominik
Zouhar Vilém
Publication venue
Publication date: 22/10/2023
Field of study

Topic models are used to make sense of large text collections. However, automatically evaluating topic model output and determining the optimal number of topics both have been longstanding challenges, with no effective automated solutions to date. This paper proposes using large language models to evaluate such output. We find that large language models appropriately assess the resulting topics, correlating more strongly with human judgments than existing automated metrics. We then investigate whether we can use large language models to automatically determine the optimal number of topics. We automatically assign labels to documents and choosing configurations with the most pure labels returns reasonable values for the optimal number of topics

arXiv.org e-Print Archive

Enhancing large language models with climate resources

Author: Bingler Julia Anna
Kraus Mathias
Leippold Markus
Schimanski Tobias
Senni Chiara Colesanti
Stammbach Dominik
Vaghefi Saeid Ashraf
Webersinke Nicolas
Publication venue
Publication date: 31/03/2023
Field of study

Large language models (LLMs) have significantly transformed the landscape of artificial intelligence by demonstrating their ability in generating human-like text across diverse topics. However, despite their impressive capabilities, LLMs lack recent information and often employ imprecise language, which can be detrimental in domains where accuracy is crucial, such as climate change. In this study, we make use of recent ideas to harness the potential of LLMs by viewing them as agents that access multiple sources, including databases containing recent and precise information about organizations, institutions, and companies. We demonstrate the effectiveness of our method through a prototype agent that retrieves emission data from ClimateWatch (this https URL) and leverages general Google search. By integrating these resources with LLMs, our approach overcomes the limitations associated with imprecise language and delivers more reliable and accurate information in the critical domain of climate change. This work paves the way for future advancements in LLMs and their application in domains where precision is of paramount importance

Oxford University Research Archive

Automated Fact-Checking of Climate Change Claims with Large Language Models

Author: Bingler Julia
Colesanti-Senni Chiara
Gostlow Glen
Huggel Christian
Leippold Markus
Luterbacher Juerg
Muccione Veruska
Ni Jingwei
Schimanski Tobias
Stammbach Dominik
Vaghefi Saeid Ashraf
Wekhof Tobias
Yu Tingyu
Publication venue
Publication date: 23/01/2024
Field of study

This paper presents Climinator, a novel AI-based tool designed to automate the fact-checking of climate change claims. Utilizing an array of Large Language Models (LLMs) informed by authoritative sources like the IPCC reports and peer-reviewed scientific literature, Climinator employs an innovative Mediator-Advocate framework. This design allows Climinator to effectively synthesize varying scientific perspectives, leading to robust, evidence-based evaluations. Our model demonstrates remarkable accuracy when testing claims collected from Climate Feedback and Skeptical Science. Notably, when integrating an advocate with a climate science denial perspective in our framework, Climinator's iterative debate process reliably converges towards scientific consensus, underscoring its adeptness at reconciling diverse viewpoints into science-based, factual conclusions. While our research is subject to certain limitations and necessitates careful interpretation, our approach holds significant potential. We hope to stimulate further research and encourage exploring its applicability in other contexts, including political fact-checking and legal domains

arXiv.org e-Print Archive

CHATREPORT: Democratizing sustainability disclosure analysis through LLM-based tools

Author: Bingler Julia
Colesanti-Senni Chiara
Gostlow Glen
Kraus Mathias
Leippold Markus
Ni Jingwei
Schimanski Tobias
Stammbach Dominik
Vaghefi Saeid Ashraf
Wang Qian
Webersinke Nicolas
Wekhof Tobias
Yu Tingyu
Publication venue
Publication date: 28/07/2023
Field of study

In the face of climate change, are companies really taking substantial steps toward more sustainable operations? A comprehensive answer lies in the dense, information-rich landscape of corporate sustainability reports. However, the sheer volume and complexity of these reports make human analysis very costly. Therefore, only a few entities worldwide have the resources to analyze these reports at scale, which leads to a lack of transparency in sustainability reporting. Empowering stakeholders with LLM-based automatic analysis tools can be a promising way to democratize sustainability report analysis. However, developing such tools is challenging due to (1) the hallucination of LLMs and (2) the inefficiency of bringing domain experts into the AI development loop. In this paper, we ChatReport, a novel LLM-based system to automate the analysis of corporate sustainability reports, addressing existing challenges by (1) making the answers traceable to reduce the harm of hallucination and (2) actively involving domain experts in the development loop. We make our methodology, annotated datasets, and generated analyses of 1015 reports publicly available

Oxford University Research Archive

Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool

Author: Bingler Julia
Colesanti-Senni Chiara
Gostlow Glen
Kraus Mathias
Leippold Markus
Ni Jingwei
Schimanski Tobias
Stammbach Dominik
Vaghefi Saeid Ashraf
Wang Qian
Webersinke Nicolas
Wekhof Tobias
Yu Tingyu
Publication venue
Publication date: 27/06/2023
Field of study

This paper introduces a novel approach to enhance Large Language Models (LLMs) with expert knowledge to automate the analysis of corporate sustainability reports by benchmarking them against the Task Force for Climate-Related Financial Disclosures (TCFD) recommendations. Corporate sustainability reports are crucial in assessing organizations' environmental and social risks and impacts. However, analyzing these reports' vast amounts of information makes human analysis often too costly. As a result, only a few entities worldwide have the resources to analyze these reports, which could lead to a lack of transparency. While AI-powered tools can automatically analyze the data, they are prone to inaccuracies as they lack domain-specific expertise. This paper introduces a novel approach to enhance LLMs with expert knowledge to automate the analysis of corporate sustainability reports. We christen our tool CHATREPORT, and apply it in a first use case to assess corporate climate risk disclosures following the TCFD recommendations. CHATREPORT results from collaborating with experts in climate science, finance, economic policy, and computer science, demonstrating how domain experts can be involved in developing AI tools. We make our prompt templates, generated data, and scores available to the public to encourage transparency.Comment: This is a working pape

arXiv.org e-Print Archive

Paradigm shift in sustainability disclosure analysis: empowering stakeholders with CHATREPORT, a language model-based tool

Author: Bingler Julia
Colesanti-Senni Chiara
Gostlow Glen
Kraus Mathias
Leippold Markus
Ni Jingwei
Schimanski Tobias
Stammbach Dominik
Vaghefi Saeid Ashraf
Wang Qian
Webersinke Nicolas
Wekhof Tobias
Yu Tingyu
Publication venue
Publication date: 27/06/2023
Field of study

Oxford University Research Archive

ChatClimate: Grounding conversational AI in climate science

Author: Allen Simon
Bingler Julia
Colesanti-Senni Chiara
Gostlow Glen
Huggel Christian
Kraus Mathias
Leippold Markus
Muccione Veruska
Ni Jingwei
Schimanski Tobias
Stammbach Dominik
Vaghefi Saeid Ashraf
Wang Qian
Webersinke Nicolas
Wekhof Tobias
Yu Tingyu
Publication venue: Nature Publishing Group
Publication date: 15/12/2023
Field of study

Large Language Models have made remarkable progress in question-answering tasks, but challenges like hallucination and outdated information persist. These issues are especially critical in domains like climate change, where timely access to reliable information is vital. One solution is granting these models access to external, scientifically accurate sources to enhance their knowledge and reliability. Here, we enhance GPT-4 by providing access to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change (IPCC AR6), the most comprehensive, up-to-date, and reliable source in this domain (refer to the ’Data Availability’ section). We present our conversational AI prototype, available at www.chatclimate.ai, and demonstrate its ability to answer challenging questions in three different setups: (1) GPT-4, (2) ChatClimate, which relies exclusively on IPCC AR6 reports, and (3) Hybrid ChatClimate, which utilizes IPCC AR6 reports with in-house GPT-4 knowledge. The evaluation of answers by experts show that the hybrid ChatClimate AI assistant provide more accurate responses, highlighting the effectiveness of our solution

ZORA