Search CORE

13,096 research outputs found

Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Author: Nathani Deepak
Pan Liangming
Saxon Michael
Wang William Yang
Wang Xinyi
Xu Wenda
Publication venue
Publication date: 29/08/2023
Field of study

Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tasks. However, their efficacy is undermined by undesired and inconsistent behaviors, including hallucination, unfaithful reasoning, and toxic content. A promising approach to rectify these flaws is self-correction, where the LLM itself is prompted or guided to fix problems in its own output. Techniques leveraging automated feedback -- either produced by the LLM itself or some external system -- are of particular interest as they are a promising way to make LLM-based solutions more practical and deployable with minimal human feedback. This paper presents a comprehensive review of this emerging class of techniques. We analyze and taxonomize a wide array of recent work utilizing these strategies, including training-time, generation-time, and post-hoc correction. We also summarize the major applications of this strategy and conclude by discussing future directions and challenges.Comment: Work in Progress. Version

arXiv.org e-Print Archive

A solution method for a two-layer sustainable supply chain distribution model

Author: Aksen
Albareda-Sambola
Alumur
Alzaman
Amador-Fontalvo
Arijit Bhattacharya
Bell
Benjaafar
Bin
Caballero
Cappanera
Chan
Chiang
Choi
Choi
Choi
Diabat
Duhamel
Erdoğan
Fleischmann
Fortes
Gendreau
Guide
Guide
Gungor
Gunnarsson
Hassanzadeh
Herrero
Hong
Hwang
Hwang
Jacobsen
Karaoglan
Kulcar
Laporte
Lee
Lin
Lin
Liu
Lopes
Madsen
Marinakis
Marinakis
Melechovský
Min
Nagy
Neto
Nguyen
Noci
Or
P.J. Byrne
Perl
Perl
Prins
Prins
Rao
Russell
Saaty
Saaty
Sahar Validi
Salimifard
Schwardt
Seuring
Srivastava
Stenger
Ting
Tuzun
Validi
Validi
Walker
Walton
Wang
Watson-Gandy
Wiedmann
Wu
Yu
Zhou
Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/02/2015
Field of study

This article presents an effective solution method for a two-layer, NP-hard sustainable supply chain distribution model. A DoE-guided MOGA-II optimiser based solution method is proposed for locating a set of non-dominated solutions distributed along the Pareto frontier. The solution method allows decision-makers to prioritise the realistic solutions, while focusing on alternate transportation scenarios. The solution method has been implemented for the case of an Irish dairy processing industry׳s two-layer supply chain network. The DoE generates 6100 real feasible solutions after 100 generations of the MOGA-II optimiser which are then refined using statistical experimentation. As the decision-maker is presented with a choice of several distribution routes on the demand side of the two-layer network, TOPSIS is applied to rank the set of non-dominated solutions thus facilitating the selection of the best sustainable distribution route. The solution method characterises the Pareto solutions from disparate scenarios through numerical and statistical experimentations. A set of realistic routes from plants to consumers is derived and mapped which minimises total CO2 emissions and costs where it can be seen that the solution method outperforms existing solution methods

Crossref

DCU Online Research Access Service

University of East Anglia digital repository

University of Huddersfield Repository

Leveraging Large Language Models in Conversational Recommender Systems

Author: Ahuja Sameer
Allen David
Chen Zexi
Chu Brian
Friedman Luke
Lara Harsh
Long Changbo
Patel Ajay
Schubiner Gabriel
Sidahmed Hakim
Tan Zhenning
Tiwari Manoj
Xie Jun
Publication venue
Publication date: 16/05/2023
Field of study

A Conversational Recommender System (CRS) offers increased transparency and control to users by enabling them to engage with the system through a real-time multi-turn dialogue. Recently, Large Language Models (LLMs) have exhibited an unprecedented ability to converse naturally and incorporate world knowledge and common-sense reasoning into language understanding, unlocking the potential of this paradigm. However, effectively leveraging LLMs within a CRS introduces new technical challenges, including properly understanding and controlling a complex conversation and retrieving from external sources of information. These issues are exacerbated by a large, evolving item corpus and a lack of conversational data for training. In this paper, we provide a roadmap for building an end-to-end large-scale CRS using LLMs. In particular, we propose new implementations for user preference understanding, flexible dialogue management and explainable recommendations as part of an integrated architecture powered by LLMs. For improved personalization, we describe how an LLM can consume interpretable natural language user profiles and use them to modulate session-level context. To overcome conversational data limitations in the absence of an existing production CRS, we propose techniques for building a controllable LLM-based user simulator to generate synthetic conversations. As a proof of concept we introduce RecLLM, a large-scale CRS for YouTube videos built on LaMDA, and demonstrate its fluency and diverse functionality through some illustrative example conversations

arXiv.org e-Print Archive

Deriving query suggestions for site search

Author: Albakour
Albakour
Albakour
Baeza-Yates
Baeza-Yates
Beitzel
Belkin
Chau
Clark
Di Caro
Dorigo
Dumais
Efthimiadis
Fonseca
Gayo-Avello
Hawking
Jansen
Jansen
Jansen
Jansen
Joachims
Justeson
Kruschwitz
Kruschwitz
Kruschwitz
Kruschwitz
Kruschwitz
Manning
Marchionini
Markey
Martens
Ruthven
Silvestri
Socha
Tunkelang
Wang
White
White
Publication venue: 'Wiley'
Publication date: 01/01/2013
Field of study

Modern search engines have been moving away from simplistic interfaces that aimed at satisfying a user's need with a single-shot query. Interactive features are now integral parts of web search engines. However, generating good query modification suggestions remains a challenging issue. Query log analysis is one of the major strands of work in this direction. Although much research has been performed on query logs collected on the web as a whole, query log analysis to enhance search on smaller and more focused collections has attracted less attention, despite its increasing practical importance. In this article, we report on a systematic study of different query modification methods applied to a substantial query log collected on a local website that already uses an interactive search engine. We conducted experiments in which we asked users to assess the relevance of potential query modification suggestions that have been constructed using a range of log analysis methods and different baseline approaches. The experimental results demonstrate the usefulness of log analysis to extract query modification suggestions. Furthermore, our experiments demonstrate that a more fine-grained approach than grouping search requests into sessions allows for extraction of better refinement terms from query log files. © 2013 ASIS&T

University of Essex Research Repository

CiteSeerX

Crossref

University of Regensburg Publication Server

Open Research Online (The Open University)

CHORUS Deliverable 3.3: Vision Document - Intermediate version

Author: Boujemaa Nozha
Compañó Ramón
Dosch Christoph
Gouraud Henri
Karlgren Jussi
King Paul
Köhler Joachim
Lowen David
Nesvadba Jan
Ortgies Robert
Proidl Adolf
van der Linden Pieter
Publication venue: Chorus Project Consortium
Publication date: 01/01/2008
Field of study

The goal of the CHORUS vision document is to create a high level vision on audio-visual search engines in order to give guidance to the future R&D work in this area (in line with the mandate of CHORUS as a Coordination Action). This current intermediate draft of the CHORUS vision document (D3.3) is based on the previous CHORUS vision documents D3.1 to D3.2 and on the results of the six CHORUS Think-Tank meetings held in March, September and November 2007 as well as in April, July and October 2008, and on the feedback from other CHORUS events. The outcome of the six Think-Thank meetings will not just be to the benefit of the participants which are stakeholders and experts from academia and industry – CHORUS, as a coordination action of the EC, will feed back the findings (see Summary) to the projects under its purview and, via its website, to the whole community working in the domain of AV content search. A few subjections of this deliverable are to be completed after the eights (and presumably last) Think-Tank meeting in spring 2009

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

DISCRETIZED GEOMETRIC APPROACHES TO THE ANALYSIS OF PROTEIN STRUCTURES

Author: Holland John
Publication venue: Dartmouth Digital Commons
Publication date: 29/09/2021
Field of study

Proteins play crucial roles in a variety of biological processes. While we know that their amino acid sequence determines their structure, which in turn determines their function, we do not know why particular sequences fold into particular structures. My work focuses on discretized geometric descriptions of protein structure—conceptualizing native structure space as composed of mostly discrete, geometrically defined fragments—to better understand the patterns underlying why particular sequence elements correspond to particular structure elements. This discretized geometric approach is applied to multiple levels of protein structure, from conceptualizing contacts between residues as interactions between discrete structural elements to treating protein structures as an assembly of discrete fragments. My earlier work focused on better understanding inter-residue contacts and estimating their energies statistically. By scoring structures with energies derived from a stricter notion of contact, I show that native protein structures can be identified out of a set of decoy structures more often than when using energies derived from traditional definitions of contact and how this has implications for the evaluation of predictions that rely on structurally defined contacts for validation. Demonstrating how useful simple geometric descriptors of structure can be, I then show that these energies identify native structures on par with well-validated, detailed, atomistic energy functions. Moving to a higher level of structure, in my later work I demonstrate that discretized, geometrically defined structural fragments make good objects for the interactive assembly of protein backbones and present a software application which lets users do so. Finally, I use these fragments to generate structure-conditioned statistical energies, generalizing the classic idea of contact energies by incorporating specific structural context, enabling these energies to reflect the interaction geometries they come from. These structure-conditioned energies contain more information about native sequence preferences, correlate more highly with experimentally determined energies, and show that pairwise sequence preferences are tightly coupled to their structural context. Considered jointly, these projects highlight the degree to which protein structures and the interactions they comprise can be understood as geometric elements coming together in finely tuned ways

Dartmouth Digital Commons (Dartmouth College)

CHORUS Deliverable 2.2: Second report - identification of multi-disciplinary key issues for gap analysis toward EU multimedia search engines roadmap

Author: Bardeli Rolf
Boujemaa Nozha
Compañó Ramón
Doch Christoph
Geurts Joost
Gouraud Henri
Joly Alexis
Karlgren Jussi
King Paul
Kompatsiaris Yiannis
Köhler Joachim
Le Moine Jean-Yves
Ortgies Robert
Point Jean-Charles
Rotenberg Boris
Rudström Åsa
Schreer Oliver
Sebe Nicu
Snoek Cees
Publication venue: Chorus Project Consortium
Publication date: 01/01/2008
Field of study

After addressing the state-of-the-art during the first year of Chorus and establishing the existing landscape in multimedia search engines, we have identified and analyzed gaps within European research effort during our second year. In this period we focused on three directions, notably technological issues, user-centred issues and use-cases and socio- economic and legal aspects. These were assessed by two central studies: firstly, a concerted vision of functional breakdown of generic multimedia search engine, and secondly, a representative use-cases descriptions with the related discussion on requirement for technological challenges. Both studies have been carried out in cooperation and consultation with the community at large through EC concertation meetings (multimedia search engines cluster), several meetings with our Think-Tank, presentations in international conferences, and surveys addressed to EU projects coordinators as well as National initiatives coordinators. Based on the obtained feedback we identified two types of gaps, namely core technological gaps that involve research challenges, and “enablers”, which are not necessarily technical research challenges, but have impact on innovation progress. New socio-economic trends are presented as well as emerging legal challenges

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Flexible Protein-Protein Docking

Author: Martin Zacharias
Sebastian Schneider
Publication venue: 'IntechOpen'
Publication date: 21/10/2011
Field of study

IntechOpen