266 research outputs found

    Benchmarking Large Language Models in Complex Question Answering Attribution using Knowledge Graphs

    Full text link
    The attribution of question answering is to provide citations for supporting generated statements, and has attracted wide research attention. The current methods for automatically evaluating the attribution, which are often based on Large Language Models (LLMs), are still inadequate, particularly in recognizing subtle differences between attributions, and complex relationships between citations and statements. To compare these attribution evaluation methods and develop new ones, we introduce a set of fine-grained categories (i.e., supportive, insufficient, contradictory and irrelevant) for measuring the attribution, and develop a Complex Attributed Question Answering (CAQA) benchmark by leveraging knowledge graphs (KGs) for automatically generating attributions of different categories to question-answer pairs. Our analysis reveals that existing evaluators perform poorly under fine-grained attribution settings and exhibit weaknesses in complex citation-statement reasoning. Our CAQA benchmark, validated with human annotations, emerges as a promising tool for selecting and developing LLM attribution evaluators.Comment: 13 pages, 5 figure

    Learn from Yesterday: A Semi-Supervised Continual Learning Method for Supervision-Limited Text-to-SQL Task Streams

    Full text link
    Conventional text-to-SQL studies are limited to a single task with a fixed-size training and test set. When confronted with a stream of tasks common in real-world applications, existing methods struggle with the problems of insufficient supervised data and high retraining costs. The former tends to cause overfitting on unseen databases for the new task, while the latter makes a full review of instances from past tasks impractical for the model, resulting in forgetting of learned SQL structures and database schemas. To address the problems, this paper proposes integrating semi-supervised learning (SSL) and continual learning (CL) in a stream of text-to-SQL tasks and offers two promising solutions in turn. The first solution Vanilla is to perform self-training, augmenting the supervised training data with predicted pseudo-labeled instances of the current task, while replacing the full volume retraining with episodic memory replay to balance the training efficiency with the performance of previous tasks. The improved solution SFNet takes advantage of the intrinsic connection between CL and SSL. It uses in-memory past information to help current SSL, while adding high-quality pseudo instances in memory to improve future replay. The experiments on two datasets shows that SFNet outperforms the widely-used SSL-only and CL-only baselines on multiple metrics.Comment: Accepted by AAAI-202

    Real-time Local Feature with Global Visual Information Enhancement

    Full text link
    Local feature provides compact and invariant image representation for various visual tasks. Current deep learning-based local feature algorithms always utilize convolution neural network (CNN) architecture with limited receptive field. Besides, even with high-performance GPU devices, the computational efficiency of local features cannot be satisfactory. In this paper, we tackle such problems by proposing a CNN-based local feature algorithm. The proposed method introduces a global enhancement module to fuse global visual clues in a light-weight network, and then optimizes the network by novel deep reinforcement learning scheme from the perspective of local feature matching task. Experiments on the public benchmarks demonstrate that the proposal can achieve considerable robustness against visual interference and meanwhile run in real time.Comment: 6 pages, 5 figures, 2 tables. Accepted by ICIEA 202

    Study on the biomechanical properties of 3D printed blended esophageal stents with different structural parameters based on patient CT

    Get PDF
    Introduction: Esophageal stenting is a widely used treatment for esophageal diseases, which can also be used for adjuvant therapy and feeding after chemotherapy for esophageal cancer. The structural parameters of the stent have a significant impact on its mechanical properties and patient comfort.Methods: In the present work, we reconstructed the esophagus model based on the patient’s computed tomography (CT) data, and designed stents with different structural parameters. We used 3D printing technology to achieve rapid production of the designed stents by using Thermoplastic polyurethane (TPU)/Poly-ε-caprolactone (PCL) blends as the materials. The mechanical properties and effects on the esophagus of polymer stents with four different structural parameters of diameter, wall thickness, length and flaring were investigated by in vitro tests of radial compression and migration of the stents, as well as by finite element simulations of the stent implantation process in the esophagus and of the stent migration process. An artificial neural network model was established to predict the radial force of the stent and the maximum equivalent stress of the esophagus during implantation based on these four structural parameters.Results: The results show that wall thickness was the structural parameter that had the greatest impact on the radial force of the stent (statistically significant, p < 0.01), and flaring was the structural parameter that had the greatest impact on the maximum equivalent stress of the esophageal wall after stent implantation (statistically significant, p < 0.01). No. 6 stent had a maximum radial force of 18.07 N, which exceeded that of commercial esophageal stents and had good mechanical properties. And the maximum equivalent force on the esophagus caused by its implantation was only 30.39 kPa, which can improve patient comfort. The predicted values of the constructed back propagation (BP) neural network model had an error of less than 10% from the true values, and the overall prediction accuracies were both above 97%, which can provide guidance for optimizing the design of the stent and for clinical research.Discussion: 3D printing technology presents a wide range of applications for the rapid fabrication of personalized TPU/PCL blend stents that are more suitable for individual patients

    MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model

    Full text link
    In the realm of data-driven AI technology, the application of open-source large language models (LLMs) in robotic task planning represents a significant milestone. Recent robotic task planning methods based on open-source LLMs typically leverage vast task planning datasets to enhance models' planning abilities. While these methods show promise, they struggle with complex long-horizon tasks, which require comprehending more context and generating longer action sequences. This paper addresses this limitation by proposing MLDT, theMulti-Level Decomposition Task planning method. This method innovatively decomposes tasks at the goal-level, task-level, and action-level to mitigate the challenge of complex long-horizon tasks. In order to enhance open-source LLMs' planning abilities, we introduce a goal-sensitive corpus generation method to create high-quality training data and conduct instruction tuning on the generated corpus. Since the complexity of the existing datasets is not high enough, we construct a more challenging dataset, LongTasks, to specifically evaluate planning ability on complex long-horizon tasks. We evaluate our method using various LLMs on four datasets in VirtualHome. Our results demonstrate a significant performance enhancement in robotic task planning, showcasing MLDT's effectiveness in overcoming the limitations of existing methods based on open-source LLMs as well as its practicality in complex, real-world scenarios

    Geochemical characteristics of dissolved heavy metals in Zhujiang River, Southwest China: spatial-temporal distribution, source, export flux estimation, and a water quality assessment

    Get PDF
    To investigate the sources and spatial-temporal distribution of dissolved heavy metals in river water, and to evaluate the water quality, a total of 162 water samples were collected from 81 key sampling points in high and low flow seasons separately in the Zhujiang River, Southwest China. Ten dissolved heavy metals (V, Cr, Mn, Co, Ni, Cu, Mo, Cd, Ba, and Pb) in the Zhujiang River water exhibit little variation at temporal scale, but vary with a significant spatial heterogeneity. Furthermore, different metals present different variation trends along the main channel of the Zhujiang River. Our results suggest that Ba (14.72 μg L−1 in low flow season and 12.50 μg L−1 in high flow season) and Cr (6.85 μg L−1 in low flow season and 7.52 μg L−1 in high flow season) are consistently the most abundant metals in the two sampling periods. According to the water quality index (WQI values ranged from 1.3 to 43.9) and health risk assessment, metals investigated in Zhujiang River are below the hazard level (all hazard index (HI) < 1). Application of statistical approaches, including correlation matrix and principal component analysis (PCA), identify three principal components that account for 61.74% of the total variance, the results conclude that the anthropogenic heavy metals (V, Cr, Ni, and Cu) are greatly impacted by the dilution effect, and the heavy metals in Zhujiang River are mainly presented a natural sources signature from the perspective of entire basin. Moreover, our results reveal that the estimated export budget of several heavy metals including V (735.6 t year−1), Cr (1,561.1 t year−1), Ni (498.2 t year−1), and Mo (118.9 t year−1) to the ocean are higher than the world average

    Direct observation of nucleation in the bulk of an opaque sample

    Get PDF
    Remarkably little is known about the physical phenomena leading to nucleation of new perfect crystals within deformed metals during annealing, in particular how and where volumes with nearly perfect lattices evolve from structures filled with dislocations, and how local variations at the micrometer length scale affect this nucleation process. We present here the first experimental measurements that relate directly nucleation of recrystallization to the local deformation microstructure in the bulk of a sample of cold rolled aluminum, further deformed locally by a hardness indentation. White beam differential aperture X-ray microscopy is used for the measurements, allowing us to map a selected gauge volume in the bulk of the sample in the deformed state, then anneal the sample and map the exact same gauge volume in the annealed state. It is found that nuclei develop at sites of high stored energy and they have crystallographic orientations from those present in the deformed state. Accordingly we suggest that for each nucleus the embryonic volume arises from a structural element contained within the voxels identified with the same orientation. Possible nucleation mechanisms are discussed and the growth potentials of the nuclei are also analyzed and discussed

    Boron neutron capture therapy induces cell cycle arrest and cell apoptosis of glioma stem/progenitor cells in vitro

    Get PDF
    BACKGROUND: Glioma stem cells in the quiescent state are resistant to clinical radiation therapy. An almost inevitable glioma recurrence is due to the persistence of these cells. The high linear energy transfer associated with boron neutron capture therapy (BNCT) could kill quiescent and proliferative cells. METHODS: The present study aimed to evaluate the effects of BNCT on glioma stem/progenitor cells in vitro. The damage induced by BNCT was assessed using cell cycle progression, apoptotic cell ratio and apoptosis-associated proteins expression. RESULTS: The surviving fraction and cell viability of glioma stem/progenitor cells were decreased compared with differentiated glioma cells using the same boronophenylalanine pretreatment and the same dose of neutron flux. BNCT induced cell cycle arrest in the G2/M phase and cell apoptosis via the mitochondrial pathway, with changes in the expression of associated proteins. CONCLUSIONS: Glioma stem/progenitor cells, which are resistant to current clinical radiotherapy, could be effectively killed by BNCT in vitro via cell cycle arrest and apoptosis using a prolonged neutron irradiation, although radiosensitivity of glioma stem/progenitor cells was decreased compared with differentiated glioma cells when using the same dose of thermal neutron exposure and boronophenylalanine pretreatment. Thus, BNCT could offer an appreciable therapeutic advantage to prevent tumor recurrence, and may become a promising treatment in recurrent glioma
    • …
    corecore