534 research outputs found
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data
Nowadays, open-source large language models like LLaMA have emerged. Recent
developments have incorporated supervised fine-tuning (SFT) and reinforcement
learning fine-tuning (RLFT) to align these models with human goals. However,
SFT methods treat all training data with mixed quality equally, while RLFT
methods require high-quality pairwise or ranking-based preference data. In this
study, we present a novel framework, named OpenChat, to advance open-source
language models with mixed-quality data. Specifically, we consider the general
SFT training data, consisting of a small amount of expert data mixed with a
large proportion of sub-optimal data, without any preference labels. We propose
the C(onditioned)-RLFT, which regards different data sources as coarse-grained
reward labels and learns a class-conditioned policy to leverage complementary
data quality information. Interestingly, the optimal policy in C-RLFT can be
easily solved through single-stage, RL-free supervised learning, which is
lightweight and avoids costly human preference labeling. Through extensive
experiments on three standard benchmarks, our openchat-13b fine-tuned with
C-RLFT achieves the highest average performance among all 13b open-source
language models. Moreover, we use AGIEval to validate the model generalization
performance, in which only openchat-13b surpasses the base model. Finally, we
conduct a series of analyses to shed light on the effectiveness and robustness
of OpenChat. Our code, data, and models are publicly available at
https://github.com/imoneoi/openchat
SKA sensitivity for possible radio emission from dark matter in Omega Centauri
Omega Centauri, the largest known globular cluster in the Milky Way, is
believed to be the remains of a dwarf galaxy's core. Giving its potential
abundance of dark matter (DM), it is an attractive target for investigating the
nature of this elusive substance in our local environment. Our study
demonstrates that by observing Omega Centauri with the SKA for 1000 hours, we
can detect synchrotron radio or Inverse Compton (IC) emissions from the DM
annihilation products. It enables us to constrain the cross-section of DM
annihilation down to for DM mass from several
to , which is much stronger compared with other
observations. Additionally, we explore the axion, another well-motivated DM
candidate, and provide stimulated decay calculations. It turns out that the
sensitivity can reach for
.Comment: 19 pages, 8 figure
Towards Congestion Management in Distribution Networks:a Dutch Case Study on Increasing Heat Pump Hosting Capacity
The current high gas prices motivate end-users to replace their gas heating with electric heat pumps. This will likely cause frequent congestion issues in low-voltage (LV) distribution grids and slow down the heat pump adoption rate. To avoid or defer the expensive and complicated grid expansion, this study shares a solution approach of a Dutch Distribution System Operator (DSO) to enable the increasing adoption of heat pumps in existing dense housing areas. Data of the DSO and a local housing company have been combined to investigate the heat pump hosting capacity on a dense urban LV feeder, including realistic data of grid topology, load and heat dynamics, and practical operating characteristics of heat pumps. Our simulation compares two control strategies: (1) individual peak shaving and (2) central optimal power flow control. We show the central optimal power flow control with end-users' thermal comfort constraints and an objective function of minimizing losses can smoothen total grid loading and lead to flat voltage profiles. This allows the approach to be robust against baseload forecast errors, while the individual peak shaving is more prone to such errors. Moreover, by simulating the strategies on the worst-case scenarios where heat pumps are allocated to end-users at the end of the feeder, we determine the individual peak shaving strategy can slightly increase the heat pump hosting capacity from 49% where no control is imposed to 51%, while the central optimal power flow control allows 100% heat pump connections without causing grid congestion. Finally, recommendations to increase the heat pump hosting capacity are given based on simulation results
Electromagnetic Spatiotemporal Differentiators
Spatiotemporal optical computing devices which could perform mathematical
operations in both spatial and temporal domains can provide unprecedented
measures to build efficient and real-time information processing systems. It is
particularly important to realize the comprehensive functions in a compact
design for better integration with electronic components. In this work, we
experimentally demonstrated an analogue spatiotemporal differentiator in
microwaves based on an asymmetrical metasurface which has a phase singularity
in the spatiotemporal domain. We showed that this structure could give rise to
a spatiotemporal transfer function required by an ideal first-order
differentiator in both spatial and temporal domains by tailoring the
unidirectional excitation of spoof surface plasmon polaritons (SSPPs). The
spatial edge detection was performed utilizing a metallic slit and the temporal
differentiation capability of the device was examined by Gaussian-like temporal
pulses of different width. We further confirmed the differentiator demonstrated
here could detect sharp changes of spatiotemporal pulses even with intricate
profiles and theoretically estimated the resolution limits of the spatial and
temporal edge detection. We also show that the pulse input after passing the
spatiotemporal differentiator implemented here could carry a transverse orbital
angular momentum (OAM) with a fractal topology charge which further increases
the information quantity.Comment: 6 figure
Soliton Solutions on Noncommutative Orbifold $ T^2/Z_4
In this paper, we explicitly construct a series of projectors on integral
noncommutative orbifold by extended constrution. They include
integration of two arbitary functions with symmetry. Our expressions
possess manifest symmetry. It is proved that the expression include all
projectors with minimal trace and in their standard expansions, the eigen value
functions of coefficient operators are continuous with respect to the arguments
and . Based on the integral expression, we alternately show the
derivative expression in terms of the similar kernal to the integral one.Since
projectors correspond to soliton solutions of the field theory on the
noncommutative orbifold, we thus present a series of corresponding solitons.Comment: 18 pages, no figure; referrences adde
Review of Recent Developments in Technical Control Approaches for Voltage and Congestion Management in Distribution Networks
The increasing installation of distributed energy resources in residential households is causing frequent voltage and congestion issues in low- and medium-voltage electrical networks. To defer or avoid the costly and complicated grid expansion, technical, pricing-based, and market-based approaches have been proposed in the literature. These approaches can help distribution system operators (DSOs) exploit flexible resources to manage their grids. This study focuses on technical control approaches, which are easier to implement, and provides an up-to-date review of their developments in modeling, solution approaches, and innovative applications facilitating indirect control from DSOs. Challenges and future research directions are also discussed
Genomic insight into diet adaptation in the biological control agent Cryptolaemus montrouzieri
Background: The ladybird beetle Cryptolaemus montrouzieri Mulsant, 1853 (Coleoptera, Coccinellidae) is used worldwide as a biological control agent. It is a predator of various mealybug pests, but it also feeds on alternative prey and can be reared on artificial diets. Relatively little is known about the underlying genetic adaptations of its feeding habits.
Results: We report the first high-quality genome sequence for C. montrouzieri. We found that the gene families encoding chemosensors and digestive and detoxifying enzymes among others were significantly expanded or contracted in C. montrouzieri in comparison to published genomes of other beetles. Comparisons of diet-specific larval development, survival and transcriptome profiling demonstrated that differentially expressed genes on unnatural diets as compared to natural prey were enriched in pathways of nutrient metabolism, indicating that the lower performance on the tested diets was caused by nutritional deficiencies. Remarkably, the C. montrouzieri genome also showed a significant expansion in an immune effector gene family. Some of the immune effector genes were dramatically downregulated when larvae were fed unnatural diets.
Conclusion: We suggest that the evolution of genes related to chemosensing, digestion, and detoxification but also immunity might be associated with diet adaptation of an insect predator. These findings help explain why this predatory ladybird has become a successful biological control agent and will enable the optimization of its mass rearing and use in biological control programs
Measuring Unified Dark Matter with 3D Cosmic Shear
We present parameter estimation forecasts for future 3D cosmic shear surveys
for a class of Unified Dark Matter (UDM) models, where a single scalar field
mimics both Dark Matter (DM) and Dark Energy (DE). These models have the
advantage that they can describe the dynamics of the Universe with a single
matter component providing an explanation for structure formation and cosmic
acceleration. A crucial feature of the class of UDM models we use in this work
is characterised by a parameter, c_inf (in units of the speed of light c=1),
that is the value of the sound speed at late times, and on which structure
formation depends. We demonstrate that the properties of the DM-like behaviour
of the scalar field can be estimated with very high precision with large-scale,
fully 3D weak lensing surveys. We found that 3D weak lensing significantly
constrains c_inf, and we find minimal errors 0.00003, for the fiducial value
c_inf=0.001, and 0.000026, for c_inf=0.012. Moreover, we compute the Bayesian
evidence for UDM models over the LCDM model as a function of c_inf. For this
purpose, we can consider the LCDM model as a UDM model with c_inf=0. We find
that the expected evidence clearly shows that the survey data would
unquestionably favour UDM models over the LCDM model, for the values
c_inf>0.001.Comment: 12 pages, 5 figures; revisions on Fisher matrix analysis and Bayesian
model selection according to the reviewer's comments; MNRAS (in press
- …