6 research outputs found

    SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition

    Full text link
    Distantly-Supervised Named Entity Recognition effectively alleviates the burden of time-consuming and expensive annotation in the supervised setting. But the context-free matching process and the limited coverage of knowledge bases introduce inaccurate and incomplete annotation noise respectively. Previous studies either considered only incomplete annotation noise or indiscriminately handle two types of noise with the same strategy. In this paper, we argue that the different causes of two types of noise bring up the requirement of different strategies in model architecture. Therefore, we propose the SANTA to handle these two types of noise separately with (1) Memory-smoothed Focal Loss and Entity-aware KNN to relieve the entity ambiguity problem caused by inaccurate annotation, and (2) Boundary Mixup to alleviate decision boundary shifting problem caused by incomplete annotation and a noise-tolerant loss to improve the robustness. Benefiting from our separate tailored strategies, we confirm in the experiment that the two types of noise are well mitigated. SANTA also achieves a new state-of-the-art on five public datasets.Comment: Findings of ACL202

    Distantly-Supervised Named Entity Recognition with Uncertainty-aware Teacher Learning and Student-student Collaborative Learning

    Full text link
    Distantly-Supervised Named Entity Recognition (DS-NER) effectively alleviates the burden of annotation, but meanwhile suffers from the label noise. Recent works attempt to adopt the teacher-student framework to gradually refine the training labels and improve the overall robustness. However, we argue that these teacher-student methods achieve limited performance because poor network calibration produces incorrectly pseudo-labeled samples, leading to error propagation. Therefore, we attempt to mitigate this issue by proposing: (1) Uncertainty-aware Teacher Learning that leverages the prediction uncertainty to guide the selection of pseudo-labels, avoiding the number of incorrect pseudo-labels in the self-training stage. (2) Student-student Collaborative Learning that allows the transfer of reliable labels between two student networks instead of completely relying on all pseudo-labels from its teacher. Meanwhile, this approach allows a full exploration of mislabeled samples rather than simply filtering unreliable pseudo-labeled samples. Extensive experimental results on five DS-NER datasets demonstrate that our method is superior to state-of-the-art teacher-student methods

    Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond

    Full text link
    In this study, we explore the potential of Multimodal Large Language Models (MLLMs) in improving embodied decision-making processes for agents. While Large Language Models (LLMs) have been widely used due to their advanced reasoning skills and vast world knowledge, MLLMs like GPT4-Vision offer enhanced visual understanding and reasoning capabilities. We investigate whether state-of-the-art MLLMs can handle embodied decision-making in an end-to-end manner and whether collaborations between LLMs and MLLMs can enhance decision-making. To address these questions, we introduce a new benchmark called PCA-EVAL, which evaluates embodied decision-making from the perspectives of Perception, Cognition, and Action. Additionally, we propose HOLMES, a multi-agent cooperation framework that allows LLMs to leverage MLLMs and APIs to gather multimodal information for informed decision-making. We compare end-to-end embodied decision-making and HOLMES on our benchmark and find that the GPT4-Vision model demonstrates strong end-to-end embodied decision-making abilities, outperforming GPT4-HOLMES in terms of average decision accuracy (+3%). However, this performance is exclusive to the latest GPT4-Vision model, surpassing the open-source state-of-the-art MLLM by 26%. Our results indicate that powerful MLLMs like GPT4-Vision hold promise for decision-making in embodied agents, offering new avenues for MLLM research. Code and data are open at https://github.com/pkunlp-icler/PCA-EVAL/.Comment: FMDM@NeurIPS2023, Code and data: https://github.com/pkunlp-icler/PCA-EVAL

    Human-in-the-Loop through Chain-of-Thought

    Full text link
    While the emergence of powerful language models along with Chain-of-thought prompting has made automation more and more omnipresent, it sometimes demonstrates its weakness in long-term or multi-step logical reasoning. For example, users don't always get desirable answers for complex mathematical problems without human involvement. Against this background, we present the Manual Correction System (MCS) -- a human-in-the-loop system enhanced by Chain-of-Thought prompting, which explores how manual correction of sub-logics in rationales can improve LLM's reasoning performance. Moving one step forward, considering a system with human-in-the-loop involves more than having humans improve performance but also controlling the cost. Therefore, we post a Cost-utility Analysis Model for Human-in-the-Loop systems (CAMLOP) based on classical economics theory to analyze, quantify and balance the utility and the corresponding cost. We conduct experiments of MCS and CAMLOP with twelve datasets. A significant advantage w.r.t cost and utility proves its superiority over strong baselines

    What you saw a while ago determines what you see now: The effect and temporal dynamics of awareness priming on implicit behavior

    No full text
    The perceptual content (e.g., seen a happy vs. seen a sad face) and also subjective visibility (e.g., whether the stimulus is visible or not) of a given (liminal) stimulus is influenced by the history of previously consciously experienced stimuli. This effect on subjective visibility, termed awareness priming, suggests that findings from a large body of literature on unconscious processing might be confounded by conscious awareness. However, those literatures on unconscious processing used implicit behavioral measures of unconscious processing. The challenge is only valid if previous visible stimuli (not just physically salient) also affect implicit behavior, e.g., response priming. Here, we used Continuous Flash Suppression (CFS) to probe the limits and temporal dynamics of awareness priming effect. We showed that prior conscious exposure to two Chinese words increases both visibility and discrimination accuracy, and also improves the response priming of words presented just at the visibility threshold. A correlation analysis revealed that this effect is only driven by the high visibility of the previous stimuli but not high physical saliency. Our results strongly validated the challenge from awareness priming to the literature on unconscious processing. Moreover, we found a different temporal dynamic for how previous visible exposure to a word affects current perception: previous short-term exposure (1-10 back trials) to a visible word only enhances discrimination accuracy of the same word in the current trial, whereas long-term exposure (10-30 back trials) exclusively elevates visibility. This novel finding suggests that areas higher in the processing hierarchy, with larger temporal receptive field, contribute to consciousness, while areas lower in the cortical hierarchy contribute to objective discrimination

    Manganese Oxide/Carbon Yolk–Shell Nanorod Anodes for High Capacity Lithium Batteries

    No full text
    Transition metal oxides have attracted much interest for their high energy density in lithium batteries. However, the fast capacity fading and the low power density still limit their practical implementation. In order to overcome these challenges, one-dimensional yolk–shell nanorods have been successfully constructed using manganese oxide as an example through a facile two-step sol–gel coating method. Dopamine and tetraethoxysilane are used as precursors to obtain uniform polymer coating and silica layer followed by converting into carbon shell and hollow space, respectively. As anode material for lithium batteries, the manganese oxide/carbon yolk–shell nanorod electrode has a reversible capacity of 660 mAh/g for initial cycle at 100 mA/g and exhibits excellent cyclability with a capacity of 634 mAh/g after 900 cycles at a current density of 500 mA/g. An enhanced capacity is observed during the long-term cycling process, which may be attributed to the structural integrity, the stability of solid electrolyte interphase layer, and the electrochemical actuation of the yolk–shell nanorod structure. The results demonstrate that the manganese oxide is well utilized with the one-dimensional yolk–shell structure, which represents an efficient way to realize excellent performance for practical applications
    corecore