49 research outputs found

    Reverse Chain: A Generic-Rule for LLMs to Master Multi-API Planning

    Full text link
    While enabling large language models to implement function calling (known as APIs) can greatly enhance the performance of LLMs, function calling is still a challenging task due to the complicated relations between different APIs, especially in a context-learning setting without fine-tuning. This paper proposes a simple yet controllable target-driven approach called Reverse Chain to empower LLMs with capabilities to use external APIs with only prompts. Given that most open-source LLMs have limited tool-use or tool-plan capabilities, LLMs in Reverse Chain are only employed to implement simple tasks, e.g., API selection and argument completion, and a generic rule is employed to implement a controllable multiple functions calling. In this generic rule, after selecting a final API to handle a given task via LLMs, we first ask LLMs to fill the required arguments from user query and context. Some missing arguments could be further completed by letting LLMs select another API based on API description before asking user. This process continues until a given task is completed. Extensive numerical experiments indicate an impressive capability of Reverse Chain on implementing multiple function calling. Interestingly enough, the experiments also reveal that tool-use capabilities of the existing LLMs, e.g., ChatGPT, can be greatly improved via Reverse Chain

    Modeling and Calibration of Gaia, Hipparcos, and Tycho-2 astrometric data for the detection of dark companions

    Get PDF
    © 2024 The Author(s). Published by the American Astronomical Society. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY), https://creativecommons.org/licenses/by/4.0/Hidden within the Gaia satellite’s multiple data releases lies a valuable cache of dark companions. To facilitate the efficient and reliable detection of these companions via combined analyses involving the Gaia, Hipparcos, and Tycho-2 catalogs, we introduce an astrometric modeling framework. This method incorporates analytical least-square minimization and nonlinear parameter optimization techniques to a set of common calibration sources across the different space-based astrometric catalogs. This enables us to discern the error inflation, astrometric jitter, differential parallax zero-points, and frame rotation of various catalogs relative to Gaia Data Release 3 (DR3). Our findings yield the most precise Gaia DR2 calibration parameters to date, revealing notable dependencies on magnitude and color. Intriguingly, we identify submilliarcsecond frame rotation between Gaia DR1 and DR3, along with an estimated astrometric jitter of 2.16 mas for the revised Hipparcos catalog. In a thorough comparative analysis with previous studies, we offer recommendations on calibrating and utilizing different catalogs for companion detection. Furthermore, we provide a user-friendly pipeline (https://github.com/ruiyicheng/Download_HIP_Gaia_GOST) for catalog download and bias correction, enhancing accessibility and usability within the scientific community.Peer reviewe

    Revised orbits of the two nearest Jupiters

    Full text link
    With its near-to-mid-infrared high contrast imaging capabilities, JWST is ushering us into a golden age of directly imaging Jupiter-like planets. As the two closest cold Jupiters, ε\varepsilon Ind A b and ε\varepsilon Eridani b have sufficiently wide orbits and adequate infrared emissions to be detected by JWST. To detect more Jupiter-like planets for direct imaging, we develop a GOST-based method to analyze radial velocity data and multiple Gaia data releases simultaneously. Without approximating instantaneous astrometry by catalog astrometry, this approach enables the use of multiple Gaia data releases for detection of both short-period and long-period planets. We determine a mass of 2.960.38+0.412.96_{-0.38}^{+0.41} MJupM_{\rm Jup} and a period of 42.924.09+6.3842.92_{-4.09}^{+6.38} yr for ε\varepsilon Ind A b. We also find a mass of 0.760.11+0.140.76_{-0.11}^{+0.14} MJupM_{\rm Jup}, a period of 7.360.05+0.047.36_{-0.05}^{+0.04} yr, and an eccentricity of 0.260.04+0.04_{-0.04}^{+0.04} for ε\varepsilon Eridani b. The eccentricity differs from that given by some previous solutions probably due to the sensitivity of orbital eccentricity to noise modeling. Our work refines the constraints on orbits and masses of the two nearest Jupiters and demonstrate the feasibility of using multiple Gaia data releases to constrain Jupiter-like planets.Comment: 14 pages, 5 figures, 4 tables, accepted for publication in MNRA

    TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

    Full text link
    Aligned large language models (LLMs) demonstrate exceptional capabilities in task-solving, following instructions, and ensuring safety. However, the continual learning aspect of these aligned LLMs has been largely overlooked. Existing continual learning benchmarks lack sufficient challenge for leading aligned LLMs, owing to both their simplicity and the models' potential exposure during instruction tuning. In this paper, we introduce TRACE, a novel benchmark designed to evaluate continual learning in LLMs. TRACE consists of 8 distinct datasets spanning challenging tasks including domain-specific tasks, multilingual capabilities, code generation, and mathematical reasoning. All datasets are standardized into a unified format, allowing for effortless automatic evaluation of LLMs. Our experiments show that after training on TRACE, aligned LLMs exhibit significant declines in both general ability and instruction-following capabilities. For example, the accuracy of llama2-chat 13B on gsm8k dataset declined precipitously from 28.8\% to 2\% after training on our datasets. This highlights the challenge of finding a suitable tradeoff between achieving performance on specific tasks while preserving the original prowess of LLMs. Empirical findings suggest that tasks inherently equipped with reasoning paths contribute significantly to preserving certain capabilities of LLMs against potential declines. Motivated by this, we introduce the Reasoning-augmented Continual Learning (RCL) approach. RCL integrates task-specific cues with meta-rationales, effectively reducing catastrophic forgetting in LLMs while expediting convergence on novel tasks

    Binary Star Evolution in Different Environments: Filamentary, Fractal, Halo and Tidal-tail Clusters

    Full text link
    Using membership of 85 open clusters from previous studies (Pang et al. 2021a,b, 2022b; Li et al. 2021) based on Gaia DR3 data, we identify binary candidates in the color-magnitude diagram, for systems with mass ratio q > 0.4. The binary fraction is corrected for incompleteness at different distances due to the Gaia angular resolution limit. We find a decreasing binary fraction with increasing cluster age, with substantial scatter. For clusters with a total mass > 200MM_\odot, the binary fraction is independent of cluster mass. The binary fraction depends strongly on stellar density. Among four types of cluster environments, the lowest-density filamentary and fractal stellar groups have the highest mean binary fraction: 23.6% and 23.2%, respectively. The mean binary fraction in tidal-tail clusters is 20.8%, and is lowest in the densest halo-type clusters: 14.8%. We find clear evidence of early disruptions of binary stars in the cluster sample. The radial binary fraction depends strongly on the cluster-centric distance across all four types of environments, with the smallest binary fraction within the half-mass radius rhr_h, and increasing towards a few rhr_h. Only hints of mass segregation is found in the target clusters. The observed amount of mass segregation is not significant to generate a global effect inside the target clusters. We evaluate the bias of unresolved binary systems (assuming a primary mass of 1MM_\odot) in 1D tangential velocity, which is 0.1-1kms1\,\rm km\,s^{-1}. Further studies are required to characterize the internal star cluster kinematics using Gaia proper motions

    A novel liposomal S-propargyl-cysteine: a sustained release of hydrogen sulfide reducing myocardial fibrosis via TGF-β1/Smad pathway

    Get PDF
    Purpose: S-propargyl-cysteine (SPRC; alternatively known as ZYZ-802) is a novel modulator of endogenous tissue H2S concentrations with known cardioprotective and anti-inflammatory effects. However, its rapid metabolism and excretion have limited its clinical application. To overcome these issues, we have developed some novel liposomal carriers to deliver ZYZ-802 to cells and tissues and have characterized their physicochemical, morphological and pharmacological properties. Methods :Two liposomal formulations of ZYZ-802 were prepared by thin-layer hydration and the morphological characteristics of each liposome system were assessed using a laser particle size analyzer and transmission electron microscopy. The entrapment efficiency and ZYZ-802 release profiles were determined following ultrafiltration centrifugation, dialysis tube and HPLC measurements. LC-MS/MS was used to evaluate the pharmacokinetic parameters and tissue distribution profiles of each formulation via the measurements of plasma and tissues ZYZ-802 and H2S concentrations. Using an in vivo model of heart failure (HF), the cardio-protective effects of liposomal carrier were determined by echocardiography, histopathology, western blot and the assessment of antioxidant and myocardial fibrosis markers.Results: Both liposomal formulations improved ZYZ-802 pharmacokinetics and optimized H2S concentrations in plasma and tissues. Liposomal ZYZ-802 showed enhanced cardioprotective effects in vivo. Importantly, liposomal ZYZ-802 could inhibit myocardial fibrosis via the inhibition of the TGF-β1/Smad signaling pathway. Conclusion: The liposomal formulations of ZYZ-802 have enhanced pharmacokinetic and pharmacological properties in vivo. This work is the first report to describe the development of liposomal formulations to improve the sustained release of H2S within tissues.Key word: Liposome; S-Propargyl-cysteine (SPRC, ZYZ-802); Hydrogen sulfide; Heart failure; Myocardial fibrosis; TGF-β1/Smad pathwa

    PgtE Enzyme of Salmonella enterica Shares the Similar Biological Roles to Plasminogen Activator (Pla) in Interacting With DEC-205 (CD205), and Enhancing Host Dissemination and Infectivity by Yersinia pestis

    Get PDF
    Yersinia pestis, the cause of plague, is a newly evolved Gram-negative bacterium. Through the acquisition of the plasminogen activator (Pla), Y. pestis gained the means to rapidly disseminate throughout its mammalian hosts. It was suggested that Y. pestis utilizes Pla to interact with the DEC-205 (CD205) receptor on antigen-presenting cells (APCs) to initiate host dissemination and infection. However, the evolutionary origin of Pla has not been fully elucidated. The PgtE enzyme of Salmonella enterica, involved in host dissemination, shows sequence similarity with the Y. pestis Pla. In this study, we demonstrated that both Escherichia coli K-12 and Y. pestis bacteria expressing the PgtE-protein were able to interact with primary alveolar macrophages and DEC-205-transfected CHO cells. The interaction between PgtE-expressing bacteria and DEC-205-expressing transfectants could be inhibited by the application of an anti-DEC-205 antibody. Moreover, PgtE-expressing Y. pestis partially re-gained the ability to promote host dissemination and infection. In conclusion, the DEC-205-PgtE interaction plays a role in promoting the dissemination and infection of Y. pestis, suggesting that Pla and the PgtE of S. enterica might share a common evolutionary origin.Peer reviewe

    Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector

    Get PDF
    A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements

    Measurement of jet fragmentation in Pb+Pb and pppp collisions at sNN=2.76\sqrt{{s_\mathrm{NN}}} = 2.76 TeV with the ATLAS detector at the LHC

    Get PDF

    Search for single production of vector-like quarks decaying into Wb in pp collisions at s=8\sqrt{s} = 8 TeV with the ATLAS detector

    Get PDF
    corecore