217 research outputs found
Compressing Long Context for Enhancing RAG with AMR-based Concept Distillation
Large Language Models (LLMs) have made significant strides in information
acquisition. However, their overreliance on potentially flawed parametric
knowledge leads to hallucinations and inaccuracies, particularly when handling
long-tail, domain-specific queries. Retrieval Augmented Generation (RAG)
addresses this limitation by incorporating external, non-parametric knowledge.
Nevertheless, the retrieved long-context documents often contain noisy,
irrelevant information alongside vital knowledge, negatively diluting LLMs'
attention. Inspired by the supportive role of essential concepts in
individuals' reading comprehension, we propose a novel concept-based RAG
framework with the Abstract Meaning Representation (AMR)-based concept
distillation algorithm. The proposed algorithm compresses the cluttered raw
retrieved documents into a compact set of crucial concepts distilled from the
informative nodes of AMR by referring to reliable linguistic features. The
concepts explicitly constrain LLMs to focus solely on vital information in the
inference process. We conduct extensive experiments on open-domain
question-answering datasets to empirically evaluate the proposed method's
effectiveness. The results indicate that the concept-based RAG framework
outperforms other baseline methods, particularly as the number of supporting
documents increases, while also exhibiting robustness across various backbone
LLMs. This emphasizes the distilled concepts are informative for augmenting the
RAG process by filtering out interference information. To the best of our
knowledge, this is the first work introducing AMR to enhance the RAG,
presenting a potential solution to augment inference performance with
semantic-based context compression
BiHRNet: A Binary high-resolution network for Human Pose Estimation
Human Pose Estimation (HPE) plays a crucial role in computer vision
applications. However, it is difficult to deploy state-of-the-art models on
resouce-limited devices due to the high computational costs of the networks. In
this work, a binary human pose estimator named BiHRNet(Binary HRNet) is
proposed, whose weights and activations are expressed as 1. BiHRNet
retains the keypoint extraction ability of HRNet, while using fewer computing
resources by adapting binary neural network (BNN). In order to reduce the
accuracy drop caused by network binarization, two categories of techniques are
proposed in this work. For optimizing the training process for binary pose
estimator, we propose a new loss function combining KL divergence loss with
AWing loss, which makes the binary network obtain more comprehensive output
distribution from its real-valued counterpart to reduce information loss caused
by binarization. For designing more binarization-friendly structures, we
propose a new information reconstruction bottleneck called IR Bottleneck to
retain more information in the initial stage of the network. In addition, we
also propose a multi-scale basic block called MS-Block for information
retention. Our work has less computation cost with few precision drop.
Experimental results demonstrate that BiHRNet achieves a PCKh of 87.9 on the
MPII dataset, which outperforms all binary pose estimation networks. On the
challenging of COCO dataset, the proposed method enables the binary neural
network to achieve 70.8 mAP, which is better than most tested lightweight
full-precision networks.Comment: 12 pages, 6 figure
LLaMA-E: Empowering E-commerce Authoring with Object-Interleaved Instruction Following
E-commerce authoring entails creating engaging, diverse, and targeted content
to enhance preference elicitation and retrieval experience. While Large
Language Models (LLMs) have revolutionized content generation, they often fall
short in e-commerce applications due to their limited memorization of
domain-specific features. This paper proposes LLaMA-E, the unified e-commerce
authoring models that address the contextual preferences of customers, sellers,
and platforms, the essential objects in e-commerce operation. We design the
instruction set derived from tasks of ads generation, query-enhanced product
title rewriting, product classification, purchase intent speculation, and
general e-commerce Q&A. The instruction formulation ensures the interleaved
cover of the presented and required object features, allowing the alignment of
base models to parameterise e-commerce knowledge comprehensively. The proposed
LLaMA-E models achieve state-of-the-art evaluation performance and exhibit the
advantage in zero-shot practical applications. To our knowledge, this is the
first LLM tailored to empower authoring applications with comprehensive
scenario understanding by integrating features focused on participated objects
Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework
Long-range imaging inevitably suffers from atmospheric turbulence with severe geometric distortions due to random refraction of light. The further the distance, the more severe the disturbance. Despite existing research has achieved great progress in tackling short-range turbulence, there is less attention paid to long-range turbulence with significant distortions. To address this dilemma and advance the field, we construct a large-scale real long-range atmospheric turbulence dataset (RLR-AT), including 1500 turbulence sequences spanning distances from 1 Km to 13 Km. The advantages of RLR-AT compared to existing ones: turbulence with longer-distances and higher-diversity, scenes with greater-variety and larger-scale. Moreover, most existing work adopts either registration-based or decomposition-based methods to address distortions through one-step mitigation. However, they fail to effectively handle long-range turbulence due to its significant pixel displacements. In this work, we propose a coarse-to-fine framework to handle severe distortions, which cooperates dynamic turbulence and static background priors (CDSP). On the one hand, we discover the pixel motion statistical prior of turbulence, and propose a frequency-aware reference frame for better large-scale distortion registration, greatly reducing the burden of refinement. On the other hand, we take advantage of the static prior of background, and propose a subspace-based low-rank tensor refinement model to eliminate the misalignments inevitably left by registration while well preserving details. The dynamic and static priors complement to each other, facilitating us to progressively mitigate long-range turbulence with severe distortions. Extensive experiments demonstrate that the proposed method outperforms SOTA methods on different datasets.This paper is accepted by ECCV 202
IgM kappa proliferative glomerulonephritis with monoclonal immunoglobulin deposition complicated with nocardiosis dermatitis: a case report and review of literature
RationaleMonoclonal gammopathy of renal significance (MGRS) represents a group of disorders caused by monoclonal immunoglobulin (M protein) secreted by B cells or plasma cells. Proliferative glomerulonephritis with monoclonal immunoglobulin deposition (PGNMID) is a glomerular disease and a form of MGRS. Here, we presented a rare case of a patient with IgM kappa PGNMID complicated with nocardiosis dermatitis.Patient concerns and diagnosesA 56-year-old man was admitted to the hospital because of cutaneous purpura and proteinuria. His initial pathological diagnosis indicated membranous proliferative glomerulonephritis, IgM(++), and subacute interstitial nephritis. Based on further examination, he was finally diagnosed to have IgM kappa PGNMID and subacute interstitial nephritis. After the initial diagnosis, the patient received hormonal therapy. During the treatment, nocardiosis dermatitis emerged as a complication, and the hormonal therapy was gradually reduced. The patient refused further treatment with rituximab, and his health is currently stable.OutcomesIgM kappa PGNMID complicated with nocardiosis dermatitis is an extremely rare occurrence. Laboratory examination and pathological analysis are required to confirm the diagnosis of this disorder. Timely and accurate diagnosis is essential for the appropriate treatment of PGNMID
Reconstruction of primary vertices at the ATLAS experiment in Run 1 proton–proton collisions at the LHC
This paper presents the method and performance of primary vertex reconstruction in proton–proton collision data recorded by the ATLAS experiment during Run 1 of the LHC. The studies presented focus on data taken during 2012 at a centre-of-mass energy of √s=8 TeV. The performance has been measured as a function of the number of interactions per bunch crossing over a wide range, from one to seventy. The measurement of the position and size of the luminous region and its use as a constraint to improve the primary vertex resolution are discussed. A longitudinal vertex position resolution of about 30μm is achieved for events with high multiplicity of reconstructed tracks. The transverse position resolution is better than 20μm and is dominated by the precision on the size of the luminous region. An analytical model is proposed to describe the primary vertex reconstruction efficiency as a function of the number of interactions per bunch crossing and of the longitudinal size of the luminous region. Agreement between the data and the predictions of this model is better than 3% up to seventy interactions per bunch crossing
Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector
A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements
A genome-wide association study identifies GRK5 and RASGRP1 as type 2 diabetes loci in Chinese Hans.
Substantial progress has been made in identification of type 2 diabetes (T2D) risk loci in the past few years, but our understanding of the genetic basis of T2D in ethnically diverse populations remains limited. We performed a genome-wide association study and a replication study in Chinese Hans comprising 8,569 T2D case subjects and 8,923 control subjects in total, from which 10 single nucleotide polymorphisms were selected for further follow-up in a de novo replication sample of 3,410 T2D case and 3,412 control subjects and an in silico replication sample of 6,952 T2D case and 11,865 control subjects. Besides confirming seven established T2D loci (CDKAL1, CDKN2A/B, KCNQ1, CDC123, GLIS3, HNF1B, and DUSP9) at genome-wide significance, we identified two novel T2D loci, including G-protein-coupled receptor kinase 5 (GRK5) (rs10886471: P = 7.1 × 10(-9)) and RASGRP1 (rs7403531: P = 3.9 × 10(-9)), of which the association signal at GRK5 seems to be specific to East Asians. In nondiabetic individuals, the T2D risk-increasing allele of RASGRP1-rs7403531 was also associated with higher HbA(1c) and lower homeostasis model assessment of β-cell function (P = 0.03 and 0.0209, respectively), whereas the T2D risk-increasing allele of GRK5-rs10886471 was also associated with higher fasting insulin (P = 0.0169) but not with fasting glucose. Our findings not only provide new insights into the pathophysiology of T2D, but may also shed light on the ethnic differences in T2D susceptibility
Potential of Core-Collapse Supernova Neutrino Detection at JUNO
JUNO is an underground neutrino observatory under construction in Jiangmen, China. It uses 20kton liquid scintillator as target, which enables it to detect supernova burst neutrinos of a large statistics for the next galactic core-collapse supernova (CCSN) and also pre-supernova neutrinos from the nearby CCSN progenitors. All flavors of supernova burst neutrinos can be detected by JUNO via several interaction channels, including inverse beta decay, elastic scattering on electron and proton, interactions on C12 nuclei, etc. This retains the possibility for JUNO to reconstruct the energy spectra of supernova burst neutrinos of all flavors. The real time monitoring systems based on FPGA and DAQ are under development in JUNO, which allow prompt alert and trigger-less data acquisition of CCSN events. The alert performances of both monitoring systems have been thoroughly studied using simulations. Moreover, once a CCSN is tagged, the system can give fast characterizations, such as directionality and light curve
- …
