282 research outputs found
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models
Retrieval-augmented language models (RALMs) represent a substantial
advancement in the capabilities of large language models, notably in reducing
factual hallucination by leveraging external knowledge sources. However, the
reliability of the retrieved information is not always guaranteed. The
retrieval of irrelevant data can lead to misguided responses, and potentially
causing the model to overlook its inherent knowledge, even when it possesses
adequate information to address the query. Moreover, standard RALMs often
struggle to assess whether they possess adequate knowledge, both intrinsic and
retrieved, to provide an accurate answer. In situations where knowledge is
lacking, these systems should ideally respond with "unknown" when the answer is
unattainable. In response to these challenges, we introduces Chain-of-Noting
(CoN), a novel approach aimed at improving the robustness of RALMs in facing
noisy, irrelevant documents and in handling unknown scenarios. The core idea of
CoN is to generate sequential reading notes for retrieved documents, enabling a
thorough evaluation of their relevance to the given question and integrating
this information to formulate the final answer. We employed ChatGPT to create
training data for CoN, which was subsequently trained on an LLaMa-2 7B model.
Our experiments across four open-domain QA benchmarks show that RALMs equipped
with CoN significantly outperform standard RALMs. Notably, CoN achieves an
average improvement of +7.9 in EM score given entirely noisy retrieved
documents and +10.5 in rejection rates for real-time questions that fall
outside the pre-training knowledge scope.Comment: Preprin
A New Type of Crumb Rubber Asphalt Mixture: A Dry Process Design and Performance Evaluation
To obtain a crumb rubber asphalt mixture with excellent performance, this study combined trans-polyoctenamer rubber (TOR), crumb rubber, and other additives to establish a new type of crumb rubber (CRT). The objective of this study was to design and evaluate the road performance of the new type of crumb rubber asphalt mixture (CRTAM) with a skeleton dense texture through a dry process. First, the skeleton intrusion compact volume method was used to optimize the grading of coarse and fine aggregates, and the design of the CRTAM gradation was carried out through the same and unequal volume replacement grading method. Then, three types of road performance were analyzed: high-temperature stability, low-temperature crack resistance, and water stability. The results showed that 2% and 2.5% CRT met a low-temperature index with equal volume substitution, and the six gradations obtained by unequal volume replacement with 2% CRT complied with the requirements of a skeleton dense texture. When the substitution ratio was 1.5 and 0.5, the high-temperature performance was better. In addition, when the substitution ratio was 0.5, the flexural strain energy density was the highest and the low-temperature performance was the best. Including considerations of economic benefits, it is recommended that the CRT content be 2% and the substitution ratio be 0.5
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
Current captioning approaches tend to generate correct but "generic"
descriptions that lack real-world knowledge, e.g., named entities and
contextual information. Considering that Vision-Language Pre-Training (VLP)
models master massive such knowledge from large-scale web-harvested data, it is
promising to utilize the generalizability of VLP models to incorporate
knowledge into image descriptions. However, using VLP models faces challenges:
zero-shot inference suffers from knowledge hallucination that leads to
low-quality descriptions, but the generic bias in downstream task fine-tuning
hinders the VLP model from expressing knowledge. To address these concerns, we
propose a simple yet effective method called Knowledge-guided Replay
(K-Replay), which enables the retention of pre-training knowledge during
fine-tuning. Our approach consists of two parts: (1) a knowledge prediction
task on automatically collected replay exemplars to continuously awaken the VLP
model's memory about knowledge, thus preventing the model from collapsing into
the generic pattern; (2) a knowledge distillation constraint to improve the
faithfulness of generated descriptions hence alleviating the knowledge
hallucination. To evaluate knowledge-enhanced descriptions, we construct a
novel captioning benchmark KnowCap, containing knowledge of landmarks, famous
brands, special foods and movie characters. Experimental results show that our
approach effectively incorporates knowledge into descriptions, outperforming
strong VLP baseline by 20.9 points (78.7->99.6) in CIDEr score and 20.5
percentage points (34.0%->54.5%) in knowledge recognition accuracy. Our code
and data is available at https://github.com/njucckevin/KnowCap.Comment: Accepted at ACM Multimedia (ACMMM) 202
Spin Fluctuation Induced Linear Magnetoresistance in Ultrathin Superconducting FeSe Films
The discovery of high-temperature superconductivity in FeSe/STO has trigged
great research interest to reveal a range of exotic physical phenomena in this
novel material. Here we present a temperature dependent magnetotransport
measurement for ultrathin FeSe/STO films with different thickness and
protection layers. Remarkably, a surprising linear magnetoresistance (LMR) is
observed around the superconducting transition temperatures but absent
otherwise. The experimental LMR can be reproduced by magnetotransport
calculations based on a model of magnetic field dependent disorder induced by
spin fluctuation. Thus, the observed LMR in coexistence with superconductivity
provides the first magnetotransport signature for spin fluctuation around the
superconducting transition region in ultrathin FeSe/STO films
Legal Decision-making for Highway Automated Driving
Compliance with traffic laws is a fundamental requirement for human drivers
on the road, and autonomous vehicles must adhere to traffic laws as well.
However, current autonomous vehicles prioritize safety and collision avoidance
primarily in their decision-making and planning, which will lead to
misunderstandings and distrust from human drivers and may even result in
accidents in mixed traffic flow. Therefore, ensuring the compliance of the
autonomous driving decision-making system is essential for ensuring the safety
of autonomous driving and promoting the widespread adoption of autonomous
driving technology. To this end, the paper proposes a trigger-based layered
compliance decision-making framework. This framework utilizes the decision
intent at the highest level as a signal to activate an online violation monitor
that identifies the type of violation committed by the vehicle. Then, a
four-layer architecture for compliance decision-making is employed to generate
compliantly trajectories. Using this system, autonomous vehicles can detect and
correct potential violations in real-time, thereby enhancing safety and
building public confidence in autonomous driving technology. Finally, the
proposed method is evaluated on the DJI AD4CHE highway dataset under four
typical highway scenarios: speed limit, following distance, overtaking, and
lane-changing. The results indicate that the proposed method increases the
vehicle's overall compliance rate from 13.85% to 84.46%, while reducing the
proportion of active violations to 0%, demonstrating its effectiveness.Comment: 14 pages, 17 figure
- …