534 research outputs found

    OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

    Full text link
    Nowadays, open-source large language models like LLaMA have emerged. Recent developments have incorporated supervised fine-tuning (SFT) and reinforcement learning fine-tuning (RLFT) to align these models with human goals. However, SFT methods treat all training data with mixed quality equally, while RLFT methods require high-quality pairwise or ranking-based preference data. In this study, we present a novel framework, named OpenChat, to advance open-source language models with mixed-quality data. Specifically, we consider the general SFT training data, consisting of a small amount of expert data mixed with a large proportion of sub-optimal data, without any preference labels. We propose the C(onditioned)-RLFT, which regards different data sources as coarse-grained reward labels and learns a class-conditioned policy to leverage complementary data quality information. Interestingly, the optimal policy in C-RLFT can be easily solved through single-stage, RL-free supervised learning, which is lightweight and avoids costly human preference labeling. Through extensive experiments on three standard benchmarks, our openchat-13b fine-tuned with C-RLFT achieves the highest average performance among all 13b open-source language models. Moreover, we use AGIEval to validate the model generalization performance, in which only openchat-13b surpasses the base model. Finally, we conduct a series of analyses to shed light on the effectiveness and robustness of OpenChat. Our code, data, and models are publicly available at https://github.com/imoneoi/openchat

    SKA sensitivity for possible radio emission from dark matter in Omega Centauri

    Full text link
    Omega Centauri, the largest known globular cluster in the Milky Way, is believed to be the remains of a dwarf galaxy's core. Giving its potential abundance of dark matter (DM), it is an attractive target for investigating the nature of this elusive substance in our local environment. Our study demonstrates that by observing Omega Centauri with the SKA for 1000 hours, we can detect synchrotron radio or Inverse Compton (IC) emissions from the DM annihilation products. It enables us to constrain the cross-section of DM annihilation down to ∼10−30 cm3 s−1\sim {\rm 10^{-30}~cm^3~s^{-1}} for DM mass from several GeV\rm{GeV} to 100 GeV\rm{100~GeV}, which is much stronger compared with other observations. Additionally, we explore the axion, another well-motivated DM candidate, and provide stimulated decay calculations. It turns out that the sensitivity can reach gaγγ∼10−10 GeV−1g_{\rm{a\gamma\gamma}} \sim 10^{-10} ~\rm{GeV^{-1}} for 2×10−7 eV<ma<2×10−4 eV2\times 10^{-7} ~\rm{eV} < m_a < 2\times 10^{-4} ~\rm{eV}.Comment: 19 pages, 8 figure

    Towards Congestion Management in Distribution Networks:a Dutch Case Study on Increasing Heat Pump Hosting Capacity

    Get PDF
    The current high gas prices motivate end-users to replace their gas heating with electric heat pumps. This will likely cause frequent congestion issues in low-voltage (LV) distribution grids and slow down the heat pump adoption rate. To avoid or defer the expensive and complicated grid expansion, this study shares a solution approach of a Dutch Distribution System Operator (DSO) to enable the increasing adoption of heat pumps in existing dense housing areas. Data of the DSO and a local housing company have been combined to investigate the heat pump hosting capacity on a dense urban LV feeder, including realistic data of grid topology, load and heat dynamics, and practical operating characteristics of heat pumps. Our simulation compares two control strategies: (1) individual peak shaving and (2) central optimal power flow control. We show the central optimal power flow control with end-users' thermal comfort constraints and an objective function of minimizing losses can smoothen total grid loading and lead to flat voltage profiles. This allows the approach to be robust against baseload forecast errors, while the individual peak shaving is more prone to such errors. Moreover, by simulating the strategies on the worst-case scenarios where heat pumps are allocated to end-users at the end of the feeder, we determine the individual peak shaving strategy can slightly increase the heat pump hosting capacity from 49% where no control is imposed to 51%, while the central optimal power flow control allows 100% heat pump connections without causing grid congestion. Finally, recommendations to increase the heat pump hosting capacity are given based on simulation results

    Electromagnetic Spatiotemporal Differentiators

    Full text link
    Spatiotemporal optical computing devices which could perform mathematical operations in both spatial and temporal domains can provide unprecedented measures to build efficient and real-time information processing systems. It is particularly important to realize the comprehensive functions in a compact design for better integration with electronic components. In this work, we experimentally demonstrated an analogue spatiotemporal differentiator in microwaves based on an asymmetrical metasurface which has a phase singularity in the spatiotemporal domain. We showed that this structure could give rise to a spatiotemporal transfer function required by an ideal first-order differentiator in both spatial and temporal domains by tailoring the unidirectional excitation of spoof surface plasmon polaritons (SSPPs). The spatial edge detection was performed utilizing a metallic slit and the temporal differentiation capability of the device was examined by Gaussian-like temporal pulses of different width. We further confirmed the differentiator demonstrated here could detect sharp changes of spatiotemporal pulses even with intricate profiles and theoretically estimated the resolution limits of the spatial and temporal edge detection. We also show that the pulse input after passing the spatiotemporal differentiator implemented here could carry a transverse orbital angular momentum (OAM) with a fractal topology charge which further increases the information quantity.Comment: 6 figure

    Soliton Solutions on Noncommutative Orbifold $ T^2/Z_4

    Full text link
    In this paper, we explicitly construct a series of projectors on integral noncommutative orbifold T2/Z4T^2/Z_4 by extended GHSGHS constrution. They include integration of two arbitary functions with Z4Z_4 symmetry. Our expressions possess manifest Z4Z_{4} symmetry. It is proved that the expression include all projectors with minimal trace and in their standard expansions, the eigen value functions of coefficient operators are continuous with respect to the arguments kk and qq. Based on the integral expression, we alternately show the derivative expression in terms of the similar kernal to the integral one.Since projectors correspond to soliton solutions of the field theory on the noncommutative orbifold, we thus present a series of corresponding solitons.Comment: 18 pages, no figure; referrences adde

    Review of Recent Developments in Technical Control Approaches for Voltage and Congestion Management in Distribution Networks

    Get PDF
    The increasing installation of distributed energy resources in residential households is causing frequent voltage and congestion issues in low- and medium-voltage electrical networks. To defer or avoid the costly and complicated grid expansion, technical, pricing-based, and market-based approaches have been proposed in the literature. These approaches can help distribution system operators (DSOs) exploit flexible resources to manage their grids. This study focuses on technical control approaches, which are easier to implement, and provides an up-to-date review of their developments in modeling, solution approaches, and innovative applications facilitating indirect control from DSOs. Challenges and future research directions are also discussed

    Genomic insight into diet adaptation in the biological control agent Cryptolaemus montrouzieri

    Get PDF
    Background: The ladybird beetle Cryptolaemus montrouzieri Mulsant, 1853 (Coleoptera, Coccinellidae) is used worldwide as a biological control agent. It is a predator of various mealybug pests, but it also feeds on alternative prey and can be reared on artificial diets. Relatively little is known about the underlying genetic adaptations of its feeding habits. Results: We report the first high-quality genome sequence for C. montrouzieri. We found that the gene families encoding chemosensors and digestive and detoxifying enzymes among others were significantly expanded or contracted in C. montrouzieri in comparison to published genomes of other beetles. Comparisons of diet-specific larval development, survival and transcriptome profiling demonstrated that differentially expressed genes on unnatural diets as compared to natural prey were enriched in pathways of nutrient metabolism, indicating that the lower performance on the tested diets was caused by nutritional deficiencies. Remarkably, the C. montrouzieri genome also showed a significant expansion in an immune effector gene family. Some of the immune effector genes were dramatically downregulated when larvae were fed unnatural diets. Conclusion: We suggest that the evolution of genes related to chemosensing, digestion, and detoxification but also immunity might be associated with diet adaptation of an insect predator. These findings help explain why this predatory ladybird has become a successful biological control agent and will enable the optimization of its mass rearing and use in biological control programs

    Measuring Unified Dark Matter with 3D Cosmic Shear

    Full text link
    We present parameter estimation forecasts for future 3D cosmic shear surveys for a class of Unified Dark Matter (UDM) models, where a single scalar field mimics both Dark Matter (DM) and Dark Energy (DE). These models have the advantage that they can describe the dynamics of the Universe with a single matter component providing an explanation for structure formation and cosmic acceleration. A crucial feature of the class of UDM models we use in this work is characterised by a parameter, c_inf (in units of the speed of light c=1), that is the value of the sound speed at late times, and on which structure formation depends. We demonstrate that the properties of the DM-like behaviour of the scalar field can be estimated with very high precision with large-scale, fully 3D weak lensing surveys. We found that 3D weak lensing significantly constrains c_inf, and we find minimal errors 0.00003, for the fiducial value c_inf=0.001, and 0.000026, for c_inf=0.012. Moreover, we compute the Bayesian evidence for UDM models over the LCDM model as a function of c_inf. For this purpose, we can consider the LCDM model as a UDM model with c_inf=0. We find that the expected evidence clearly shows that the survey data would unquestionably favour UDM models over the LCDM model, for the values c_inf>0.001.Comment: 12 pages, 5 figures; revisions on Fisher matrix analysis and Bayesian model selection according to the reviewer's comments; MNRAS (in press
    • …
    corecore