809 research outputs found

    A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future

    Full text link
    Chain-of-thought reasoning, a cognitive process fundamental to human intelligence, has garnered significant attention in the realm of artificial intelligence and natural language processing. However, there still remains a lack of a comprehensive survey for this arena. To this end, we take the first step and present a thorough survey of this research field carefully and widely. We use X-of-Thought to refer to Chain-of-Thought in a broad sense. In detail, we systematically organize the current research according to the taxonomies of methods, including XoT construction, XoT structure variants, and enhanced XoT. Additionally, we describe XoT with frontier applications, covering planning, tool use, and distillation. Furthermore, we address challenges and discuss some future directions, including faithfulness, multi-modal, and theory. We hope this survey serves as a valuable resource for researchers seeking to innovate within the domain of chain-of-thought reasoning.Comment: 26 pages. Resources are available at https://github.com/zchuz/CoT-Reasoning-Surve

    Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

    Full text link
    Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tasks. However, their efficacy is undermined by undesired and inconsistent behaviors, including hallucination, unfaithful reasoning, and toxic content. A promising approach to rectify these flaws is self-correction, where the LLM itself is prompted or guided to fix problems in its own output. Techniques leveraging automated feedback -- either produced by the LLM itself or some external system -- are of particular interest as they are a promising way to make LLM-based solutions more practical and deployable with minimal human feedback. This paper presents a comprehensive review of this emerging class of techniques. We analyze and taxonomize a wide array of recent work utilizing these strategies, including training-time, generation-time, and post-hoc correction. We also summarize the major applications of this strategy and conclude by discussing future directions and challenges.Comment: Work in Progress. Version

    Unleashing the potential of prompt engineering in Large Language Models: a comprehensive review

    Full text link
    This paper delves into the pivotal role of prompt engineering in unleashing the capabilities of Large Language Models (LLMs). Prompt engineering is the process of structuring input text for LLMs and is a technique integral to optimizing the efficacy of LLMs. This survey elucidates foundational principles of prompt engineering, such as role-prompting, one-shot, and few-shot prompting, as well as more advanced methodologies such as the chain-of-thought and tree-of-thoughts prompting. The paper sheds light on how external assistance in the form of plugins can assist in this task, and reduce machine hallucination by retrieving external knowledge. We subsequently delineate prospective directions in prompt engineering research, emphasizing the need for a deeper understanding of structures and the role of agents in Artificial Intelligence-Generated Content (AIGC) tools. We discuss how to assess the efficacy of prompt methods from different perspectives and using different methods. Finally, we gather information about the application of prompt engineering in such fields as education and programming, showing its transformative potential. This comprehensive survey aims to serve as a friendly guide for anyone venturing through the big world of LLMs and prompt engineering

    Does Thinking in Opposites in Order to Think Differently Improve Creativity?

    Get PDF
    In this paper, we focus on the link between thinking in opposites and creativity. Thinking in opposites requires an intuitive, productive strategy, which may enhance creativity. Given the importance of creativity for the well-being of individuals and society, finding new ways to enhance it represents a valuable goal in both professional and personal contexts. We discuss the body of evidence that exists concerning the importance of the first representation of the structure of a problem to be solved, which determines the baseline representation and sets limits on the area within which a problem solver will explore. We then review a variety of interventions described in the literature on creativity and insight problem solving that were designed to overcome fixedness and encourage people to move away from stereotypical solutions. Special attention is paid to the research carried out in the context of problem solving, which provides evidence that prompting people to "think in opposites" is beneficial. We suggest that an extended investigation of the effects of this strategy in various types of tasks related to creativity is an interesting line of research to follow. We discuss the rationale supporting this claim and identify specific questions, both theoretical and methodological, for future research to address

    Current and Future Challenges in Knowledge Representation and Reasoning

    Full text link
    Knowledge Representation and Reasoning is a central, longstanding, and active area of Artificial Intelligence. Over the years it has evolved significantly; more recently it has been challenged and complemented by research in areas such as machine learning and reasoning under uncertainty. In July 2022 a Dagstuhl Perspectives workshop was held on Knowledge Representation and Reasoning. The goal of the workshop was to describe the state of the art in the field, including its relation with other areas, its shortcomings and strengths, together with recommendations for future progress. We developed this manifesto based on the presentations, panels, working groups, and discussions that took place at the Dagstuhl Workshop. It is a declaration of our views on Knowledge Representation: its origins, goals, milestones, and current foci; its relation to other disciplines, especially to Artificial Intelligence; and on its challenges, along with key priorities for the next decade

    Scalable software framework for real-time data processing in the railway environment

    Get PDF
    Background: Ticks are obligate haematophagous ectoparasites of vertebrates and frequently parasitize avian species that can carry them across continents during their long-distance migrations. Ticks may have detrimental effects on the health state of their avian hosts, which can be either directly caused by blood-draining or mediated by microbial pathogens transmitted during the blood meal. Indeed, ticks host complex microbial communities, including bacterial pathogens and symbionts. Midichloria bacteria (Rickettsiales) are widespread tick endosymbionts that can be transmitted to vertebrate hosts during the tick bite, inducing an antibody response. Their actual role as infectious/pathogenic agents is, however, unclear. Methods: We screened for Midichloria DNA African ticks and blood samples collected from trans-Saharan migratory songbirds at their arrival in Europe during spring migration. Results: Tick infestation rate was 5.7%, with most ticks belonging to the Hyalomma marginatum species complex. Over 90% of Hyalomma ticks harboured DNA of Midichloria bacteria belonging to the monophylum associated with ticks. Midichloria DNA was detected in 43% of blood samples of avian hosts. Tick-infested adult birds were significantly more likely to test positive to the presence of Midichloria DNA than non-infested adults and second-year individuals, suggesting a long-term persistence of these bacteria within avian hosts. Tick parasitism was associated with a significantly delayed timing of spring migration of avian hosts but had no significant effects on body condition, whereas blood Midichloria DNA presence negatively affected fat deposits of tick-infested avian hosts. Conclusions: Our results show that ticks effectively transfer Midichloria bacteria to avian hosts, supporting the hypothesis that they are infectious to vertebrates. Bird infection likely enhances the horizontal spread of these bacteria across haematophagous ectoparasite populations. Moreover, we showed that Midichloria and tick parasitism have detrimental non-independent effects on avian host health during migration, highlighting the complexity of interactions involving ticks, their vertebrate hosts, and tick-borne bacteria

    Individual differences and strategies for human reasoning

    Get PDF
    Theories of human reasoning have tended to assume cognitive universality, i. e. that all individuals reason in basically the same way. However, some research (e. g. that of Ford. 1995) has found evidence of individual differences in the strategies people use for syllogistic reasoning. This thesis presents a series of experiments which aimed to identify individual differences in strategies for human reasoning and investigate their nature and aetiology. Experiment 1 successfully replicated and extended Ford (1995) and provided further evidence that most individuals prefer to reason with either verbal-propositional or visuo-spatial representations. Data from verbal and written protocols showed that verbal reasoners tended to use a method of substitution whereby they obtain a value for the common term from one premise and then simply substitute it in the other premise to obtain a conclusion. Spatial reasoners, on the other hand, presented protocols which resembled Euler circles and described the syllogistic premises in terms of sets and subsets. Experiment 2 provided some further qualitative evidence about the nature of such strategies, especially the verbal reasoners, showing that within strategy variations occurred. Experiment 3 extended this line of research, identifying a strong association between verbal and spatial strategies for syllogistic reasoning and abstract and concrete strategies for transitive inference (the latter having originally been identified by Egan and Grimes- Farrow, 1982). Experiments 1-3 also showed that inter-strategic differences in accuracy are generally not observed, hence, reasoners present an outward appearance of ubiquity despite underlying differences in reasoning processes. Experiments 5 and 6 investigated individual differences in cognitive factors which may underpin strategy preference. Whilst no apparent effects of verbal and spatial ability or cognitive style were found, reasoners did appear to draw differentially on the verbal and spatial components of working memory. Confirmatory factor analysis showed that whilst verbal reasoners draw primarily on the verbal memory resource, spatial reasoners draw both on this and on spatial resource. Overall, these findings have important implications for theories of human reasoning, which need to take into account possible individual differences in strategies if they are to present a truly comprehensive account of how people reason.Economic and Social Research Counci

    Interactive Theorem Proving with Indexed Formulas

    Get PDF
    Since more than two decades research in interactive theorem proving (ITP) has attracted growing interest. The primary application domains for ITPs range from hard- and software verification tools to mathematical tutor systems. To support communication with the user in an adequate way these systems depend on calculi that allow for the construction of human understandable and readable proofs. However, most calculi that are used in current ITPs fall still short of supporting the user in an optimal way. The reason is that they enforce the user to construct proofs at a level that is far more detailed than the one that can be found in human constructed proofs. Autexier [Aut03] has recently proposed a new theorem proving framework that allows to model different logics and calculi in an uniform way. In CORE, a proof-state is always represented as a single formula that can be manipulated by the application of replacement rules that are generated from the logical context of the subformula under transformation. This approach also facilitates proof construction at the assertion level which is considered as more closely matching the level at which humans construct proofs (see for instance [Hua94]). Together with COREs window inference technique this makes CORE a potentially well suited basis for interactive theorem proving. This thesis tries to excerpt COREs potential for interactive theorem proving by mapping important concepts of the established proof system ΩMEGA to CORE. A task structure is developed to present the context of a subformula in an intuitive way to the user and to assist him in structuring proofs. The development of a method interpreter makes it possible to specify abstract inference steps declaratively and to encode proof strategies for the use in CORE. The adaptation of ΩMEGAS agent-based suggestion mechanism ΩANTS to CORE helps the user with the identification of applicable methods and replacement rules.Interaktives Theorembeweisen hat in den letzten zwei Jahrzehnten zunehmend an Bedeutung gewonnen. Die Anwendungsbereiche von Systemen mit denen sich Beweise interaktiv führen lassen reichen von der Hard- und Software Verifikation bis zu mathematischen Tutor-Systemen. Die genannten Anwendungsgebiete machen es erforderlich das der Anwender bei der Beweisführung adequat untersützt wird. Um eine entsprechende Kommunikation mit dem Benutzer zu ermöglichen verwenden interaktive Beweissysteme Kalküle, in denen Beweise in einer für den Benutzer nachvollziehbaren Art und Weise, geführt werden können. Trotzdem kann man noch nicht davon sprechen, dass interaktive Beweiser den Benutzer optimal unterstützen. Der Hauptrgund hierfür ist, dass die eingesetzten Kalküle automatisch dazu führen, dass Beweise auf einer viel detailliertern Ebene geführt werden müssen, als man typischerweise in einem mathematischen Beweis finden würde. Autexier [Aut03]hat kürzlich eine neue logische Umgebung für die Beweissuche entwickelt, welche es ermöglicht verschiedene Logiken und Kalküle einheitlich zu modellieren. In CORE ist ein Beweiszustand immer als eine einzige Formel repräsentiert, welche durch das Anwenden von Ersetzungsregeln, die aus dem Kontext einer Teilformel abgeleitet werden, transformiert werden kann. Diese Herangehensweise erleichtert es auch, Beweise auf der sogenannten Assertion-Ebene (vgl. [Hua94]) zu führen, welche allgemein als eine natürlichere Ebene für die Beweisführung angesehen wird. Zusammen mit der Window-Inferenz Technik, die von CORE unterstützt, wird stellt CORE ein System dar das potentiell als eine verbesserte Grundlage für die interaktive Beweissuche angesehen werden kann. In dieser Arbeit geht es darum, das Potential von CORE im Hinblick auf die interaktive Beweiskonstruktion auszunutzen. Dieses geschieht zum einen dadurch, dass etablierte Konzepte im Bereich des interaktiven Beweisens, wie sie auch im Beweis-System ΩMEGA verwendet werden, auf das CORE-System abgebildet werden. Desweiteren wird eine Task-Srukture entwickelt, die es zum einen ermöglicht, den logischen Kontext einer Teilformel für den Benutzer verständlich aufzubereiten und darzustellen; zum anderen untersützt sie den Anwender auch darin, Beweise strukturiert zu führen. Der Entwurf eines Methoden-Interpreters ermöglicht es, abstrakte Beweisschritte zu kodieren und im System anzuwenden. Die Anpassung des Vorschlags-Mechanismus ΩANTS an CORE stellt eine weitere Unterstützung für den Benutzer bereit, indem sie automatisch Vorschläge über mögliche Fortsetzungen eines Beweises generiert
    corecore