91 research outputs found
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
Aligning agent behaviors with diverse human preferences remains a challenging
problem in reinforcement learning (RL), owing to the inherent abstractness and
mutability of human preferences. To address these issues, we propose AlignDiff,
a novel framework that leverages RL from Human Feedback (RLHF) to quantify
human preferences, covering abstractness, and utilizes them to guide diffusion
planning for zero-shot behavior customizing, covering mutability. AlignDiff can
accurately match user-customized behaviors and efficiently switch from one to
another. To build the framework, we first establish the multi-perspective human
feedback datasets, which contain comparisons for the attributes of diverse
behaviors, and then train an attribute strength model to predict quantified
relative strengths. After relabeling behavioral datasets with relative
strengths, we proceed to train an attribute-conditioned diffusion model, which
serves as a planner with the attribute strength model as a director for
preference aligning at the inference phase. We evaluate AlignDiff on various
locomotion tasks and demonstrate its superior performance on preference
matching, switching, and covering compared to other baselines. Its capability
of completing unseen downstream tasks under human instructions also showcases
the promising potential for human-AI collaboration. More visualization videos
are released on https://aligndiff.github.io/
Dark Energy Perturbations Revisited
In this paper we study the evolution of cosmological perturbations in the
presence of dynamical dark energy, and revisit the issue of dark energy
perturbations. For a generally parameterized equation of state (EoS) such as
w_D(z) = w_0+w_1\frac{z}{1+z}, (for a single fluid or a single scalar field )
the dark energy perturbation diverges when its EoS crosses the cosmological
constant boundary w_D=-1. In this paper we present a method of treating the
dark energy perturbations during the crossing of the surface by
imposing matching conditions which require the induced 3-metric on the
hypersurface of w_D=-1 and its extrinsic curvature to be continuous. These
matching conditions have been used widely in the literature to study
perturbations in various models of early universe physics, such as Inflation,
the Pre-Big-Bang and Ekpyrotic scenarios, and bouncing cosmologies. In all of
these cases the EoS undergoes a sudden change. Through a detailed analysis of
the matching conditions, we show that \delta_D and \theta_D are continuous on
the matching hypersurface. This justifies the method used[1-4] in the numerical
calculation and data fitting for the determination of cosmological parameters.
We discuss the conditions under which our analysis is applicable.Comment: 10 pages and 1 figure
Clinical characterization and proteomic profiling of lean nonalcoholic fatty liver disease
IntroductionObesity has been historically associated with nonalcoholic fatty liver disease (NAFLD), but it can also occur in lean individuals. However, limited data is available on this special group. To investigate the clinical and proteomic characteristics of lean subjects with NAFLD, and to identify potential clinical variables and plasma proteins for diagnosing NAFLD in lean individuals, we collected clinical data from a large cohort of 2,236 subjects.MethodsDiagnosis of NAFLD relied on detecting pronounced hepatic steatosis through abdominal ultrasonography. Participants were categorized into four groups based on body mass index: overweight NAFLD, overweight control, lean NAFLD, and lean control. Plasma proteomic profiling was performed on samples from 20 subjects in each group. The lean NAFLD group was compared to both lean healthy and obese NAFLD groups across all data.Results and discussionThe results indicated that the lean NAFLD group exhibited intermediate metabolic profiles, falling between those of the lean healthy and overweight NAFLD groups. Proteomic profiling of plasma in lean subjects with or without NAFLD revealed 45 statistically significant changes in proteins, of which 37 showed high diagnostic value (AUC > 0.7) for lean NAFLD. These potential biomarkers primarily involved lipid metabolism, the immune and complement systems, and platelet degranulation. Furthermore, AFM, GSN, CFH, HGFAC, MMP2, and MMP9 have been previously associated with NAFLD or NAFLD-related factors such as liver damage, insulin resistance, metabolic syndromes, and extracellular homeostasis. Overall, lean individuals with NAFLD exhibit distinct clinical profiles compared to overweight individuals with NAFLD. Despite having worse metabolic profiles than their healthy counterparts, lean NAFLD patients generally experience milder systemic metabolic disturbances compared to obese NAFLD patients. Additionally, the plasma proteomic profile is significantly altered in lean NAFLD, highlighting the potential of differentially expressed proteins as valuable biomarkers or therapeutic targets for diagnosing and treating NAFLD in this population
SkyMath: Technical Report
Large language models (LLMs) have shown great potential to solve varieties of
natural language processing (NLP) tasks, including mathematical reasoning. In
this work, we present SkyMath, a large language model for mathematics with 13
billion parameters. By applying self-compare fine-tuning, we have enhanced
mathematical reasoning abilities of Skywork-13B-Base remarkably. On GSM8K,
SkyMath outperforms all known open-source models of similar size and has
established a new SOTA performance
Recommended from our members
Tailoring sub-3.3 angstrom ultramicropores in advanced carbon molecular sieve membranes for blue hydrogen production
Carbon molecular sieve (CMS) membranes prepared by carbonization of polymers containing strongly size-sieving ultramicropores are attractive for high-temperature gas separations. However, polymers need to be carbonized at extremely high temperatures (900° to 1200°C) to achieve sub-3.3 Å ultramicroporous channels for H2/CO2 separation, which makes them brittle and impractical for industrial applications. Here, we demonstrate that polymers can be first doped with thermolabile cross-linkers before low-temperature carbonization to retain the polymer processability and achieve superior H2/CO2 separation properties. Specifically, polybenzimidazole (PBI) is cross-linked with pyrophosphoric acid (PPA) via H bonding and proton transfer before carbonization at ≤600°C. The synergistic PPA doping and subsequent carbonization of PBI increase H2 permeability from 27 to 140 Barrer and H2/CO2 selectivity from 15 to 58 at 150°C, superior to state-of-the-art polymeric materials and surpassing Robeson’s upper bound. This study provides a facile and effective way to tailor subnanopore size and porosity in CMS membranes with desirable molecular sieving ability.
</p
Skywork: A More Open Bilingual Foundation Model
In this technical report, we present Skywork-13B, a family of large language
models (LLMs) trained on a corpus of over 3.2 trillion tokens drawn from both
English and Chinese texts. This bilingual foundation model is the most
extensively trained and openly published LLMs of comparable size to date. We
introduce a two-stage training methodology using a segmented corpus, targeting
general purpose training and then domain-specific enhancement training,
respectively. We show that our model not only excels on popular benchmarks, but
also achieves \emph{state of the art} performance in Chinese language modeling
on diverse domains. Furthermore, we propose a novel leakage detection method,
demonstrating that test data contamination is a pressing issue warranting
further investigation by the LLM community. To spur future research, we release
Skywork-13B along with checkpoints obtained during intermediate stages of the
training process. We are also releasing part of our SkyPile corpus, a
collection of over 150 billion tokens of web text, which is the largest high
quality open Chinese pre-training corpus to date. We hope Skywork-13B and our
open corpus will serve as a valuable open-source resource to democratize access
to high-quality LLMs
The genetic architecture of membranous nephropathy and its potential to improve non-invasive diagnosis
Membranous Nephropathy (MN) is a rare autoimmune cause of kidney failure. Here we report a genome-wide association study (GWAS) for primary MN in 3,782 cases and 9,038 controls of East Asian and European ancestries. We discover two previously unreported loci, NFKB1 (rs230540, OR = 1.25, P = 3.4 × 10−12) and IRF4 (rs9405192, OR = 1.29, P = 1.4 × 10−14), fine-map the PLA2R1 locus (rs17831251, OR = 2.25, P = 4.7 × 10−103) and report ancestry-specific effects of three classical HLA alleles: DRB1*1501 in East Asians (OR = 3.81, P = 2.0 × 10−49), DQA1*0501 in Europeans (OR = 2.88, P = 5.7 × 10−93), and DRB1*0301 in both ethnicities (OR = 3.50, P = 9.2 × 10−23 and OR = 3.39, P = 5.2 × 10−82, respectively). GWAS loci explain 32% of disease risk in East Asians and 25% in Europeans, and correctly re-classify 20–37% of the cases in validation cohorts that are antibody-negative by the serum anti-PLA2R ELISA diagnostic test. Our findings highlight an unusual genetic architecture of MN, with four loci and their interactions accounting for nearly one-third of the disease risk
The genetic architecture of membranous nephropathy and its potential to improve non-invasive diagnosis
Membranous Nephropathy (MN) is a rare autoimmune cause of kidney failure. Here we report a genome-wide association study (GWAS) for primary MN in 3,782 cases and 9,038 controls of East Asian and European ancestries. We discover two previously unreported loci, NFKB1 (rs230540, OR = 1.25, P = 3.4 × 10-12) and IRF4 (rs9405192, OR = 1.29, P = 1.4 × 10-14), fine-map the PLA2R1 locus (rs17831251, OR = 2.25, P = 4.7 × 10-103) and report ancestry-specific effects of three classical HLA alleles: DRB1*1501 in East Asians (OR = 3.81, P = 2.0 × 10-49), DQA1*0501 in Europeans (OR = 2.88, P = 5.7 × 10-93), and DRB1*0301 in both ethnicities (OR = 3.50, P = 9.2 × 10-23 and OR = 3.39, P = 5.2 × 10-82, respectively). GWAS loci explain 32% of disease risk in East Asians and 25% in Europeans, and correctly re-classify 20-37% of the cases in validation cohorts that are antibody-negative by the serum anti-PLA2R ELISA diagnostic test. Our findings highlight an unusual genetic architecture of MN, with four loci and their interactions accounting for nearly one-third of the disease risk
Heterogeneous Impact of Economic Policy Uncertainty on Provincial Environmental Pollution Emissions in China
With China’s proposal of carbon peak and carbon neutral goals, its trend of economic development has shifted from pursuing high-speed economic development to high-quality development. However, for the past few years, with the increasing global economic policy uncertainty, fluctuations in the world economy, especially emergent through public events such as COVID-19, affect investment and consumption, and thus indirectly affect the realization of the dual carbon target. Economic policy uncertainty plays an increasingly important role in many factors affecting environmental pollution. We conducted an empirical test on sample data, which are from 30 provinces and autonomous regions in China from 2008 to 2020, to further study the impact of economic policy uncertainty on environmental pollution emissions. We found that: (1) Economic policy uncertainty is inversely related to the emission of environmental pollution, and the consumption effect brought by economic policy uncertainty is more than the investment effect. This means that, with the economic policy uncertainty index increasing, the comprehensive index of environmental pollution emissions is lower, and the environmental pollution emissions are lower; (2) Compared with provinces with an average level of economic development, the impact of economic policy uncertainty on environmental emissions is deeper in developed provinces
- …