112 research outputs found

    Reinforcement Learning from Statistical Feedback: the Journey from AB Testing to ANT Testing

    Full text link
    Reinforcement Learning from Human Feedback (RLHF) has played a crucial role in the success of large models such as ChatGPT. RLHF is a reinforcement learning framework which combines human feedback to improve learning effectiveness and performance. However, obtaining preferences feedback manually is quite expensive in commercial applications. Some statistical commercial indicators are usually more valuable and always ignored in RLHF. There exists a gap between commercial target and model training. In our research, we will attempt to fill this gap with statistical business feedback instead of human feedback, using AB testing which is a well-established statistical method. Reinforcement Learning from Statistical Feedback (RLSF) based on AB testing is proposed. Statistical inference methods are used to obtain preferences for training the reward network, which fine-tunes the pre-trained model in reinforcement learning framework, achieving greater business value. Furthermore, we extend AB testing with double selections at a single time-point to ANT testing with multiple selections at different feedback time points. Moreover, we design numerical experiences to validate the effectiveness of our algorithm framework

    Effects of acidification on nitrification and associated nitrous oxide emission in estuarine and coastal waters

    Get PDF
    In the context of an increasing atmospheric carbon dioxide (CO2) level, acidification of estuarine and coastal waters is greatly exacerbated by land-derived nutrient inputs, coastal upwelling, and complex biogeochemical processes. A deeper understanding of how nitrifiers respond to intensifying acidification is thus crucial to predict the response of estuarine and coastal ecosystems and their contribution to global climate change. Here, we show that acidification can significantly decrease nitrification rate but stimulate generation of byproduct nitrous oxide (N2O) in estuarine and coastal waters. By varying CO2 concentration and pH independently, an expected beneficial effect of elevated CO2 on activity of nitrifiers (“CO2-fertilization” effect) is excluded under acidification. Metatranscriptome data further demonstrate that nitrifiers could significantly up-regulate gene expressions associated with intracellular pH homeostasis to cope with acidification stress. This study highlights the molecular underpinnings of acidification effects on nitrification and associated greenhouse gas N2O emission, and helps predict the response and evolution of estuarine and coastal ecosystems under climate change and human activities.publishedVersio

    Longitudinal Genomic Evolution of Conventional Papillary Thyroid Cancer With Brain Metastasis

    Get PDF
    BackgroundBrain metastasis is extremely rare but predicts dismal prognosis in papillary thyroid cancer (PTC). Dynamic evaluation of stepwise metastatic lesions was barely conducted to identify the longitudinal genomic evolution of brain metastasis in PTC.MethodChronologically resected specimen was analyzed by whole exome sequencing, including four metastatic lymph nodes (lyn 1–4) and brain metastasis lesion (BM). Phylogenetic tree was reconstructed to infer the metastatic pattern and the potential functional mutations.ResultsContrasting with lyn1, ipsilateral metastatic lesions (lyn2–4 and BM) with shared biallelic mutations of TSC2 indicated different genetic originations from multifocal tumors. Lyn 3/4, particularly lyn4 exhibited high genetic similarity with BM. Besides the similar mutational compositions and signatures, shared functional mutations (CDK4R24C, TP53R342*) were observed in lyn3/4 and BM. Frequencies of these mutations gradually increase along with the metastasis progression. Consistently, TP53 knockout and CDK4R24C introduction in PTC cells significantly decreased radioiodine uptake and increased metastatic ability.ConclusionGenomic mutations in CDK4 and TP53 during the tumor evolution may contribute to the lymph node and brain metastasis of PTC

    Potential of Core-Collapse Supernova Neutrino Detection at JUNO

    Get PDF
    JUNO is an underground neutrino observatory under construction in Jiangmen, China. It uses 20kton liquid scintillator as target, which enables it to detect supernova burst neutrinos of a large statistics for the next galactic core-collapse supernova (CCSN) and also pre-supernova neutrinos from the nearby CCSN progenitors. All flavors of supernova burst neutrinos can be detected by JUNO via several interaction channels, including inverse beta decay, elastic scattering on electron and proton, interactions on C12 nuclei, etc. This retains the possibility for JUNO to reconstruct the energy spectra of supernova burst neutrinos of all flavors. The real time monitoring systems based on FPGA and DAQ are under development in JUNO, which allow prompt alert and trigger-less data acquisition of CCSN events. The alert performances of both monitoring systems have been thoroughly studied using simulations. Moreover, once a CCSN is tagged, the system can give fast characterizations, such as directionality and light curve

    Detection of the Diffuse Supernova Neutrino Background with JUNO

    Get PDF
    As an underground multi-purpose neutrino detector with 20 kton liquid scintillator, Jiangmen Underground Neutrino Observatory (JUNO) is competitive with and complementary to the water-Cherenkov detectors on the search for the diffuse supernova neutrino background (DSNB). Typical supernova models predict 2-4 events per year within the optimal observation window in the JUNO detector. The dominant background is from the neutral-current (NC) interaction of atmospheric neutrinos with 12C nuclei, which surpasses the DSNB by more than one order of magnitude. We evaluated the systematic uncertainty of NC background from the spread of a variety of data-driven models and further developed a method to determine NC background within 15\% with {\it{in}} {\it{situ}} measurements after ten years of running. Besides, the NC-like backgrounds can be effectively suppressed by the intrinsic pulse-shape discrimination (PSD) capabilities of liquid scintillators. In this talk, I will present in detail the improvements on NC background uncertainty evaluation, PSD discriminator development, and finally, the potential of DSNB sensitivity in JUNO

    Real-time Monitoring for the Next Core-Collapse Supernova in JUNO

    Full text link
    Core-collapse supernova (CCSN) is one of the most energetic astrophysical events in the Universe. The early and prompt detection of neutrinos before (pre-SN) and during the SN burst is a unique opportunity to realize the multi-messenger observation of the CCSN events. In this work, we describe the monitoring concept and present the sensitivity of the system to the pre-SN and SN neutrinos at the Jiangmen Underground Neutrino Observatory (JUNO), which is a 20 kton liquid scintillator detector under construction in South China. The real-time monitoring system is designed with both the prompt monitors on the electronic board and online monitors at the data acquisition stage, in order to ensure both the alert speed and alert coverage of progenitor stars. By assuming a false alert rate of 1 per year, this monitoring system can be sensitive to the pre-SN neutrinos up to the distance of about 1.6 (0.9) kpc and SN neutrinos up to about 370 (360) kpc for a progenitor mass of 30M⊙M_{\odot} for the case of normal (inverted) mass ordering. The pointing ability of the CCSN is evaluated by using the accumulated event anisotropy of the inverse beta decay interactions from pre-SN or SN neutrinos, which, along with the early alert, can play important roles for the followup multi-messenger observations of the next Galactic or nearby extragalactic CCSN.Comment: 24 pages, 9 figure

    Influence of stratospheric sudden warming on the tropical intraseasonal convection

    No full text
    Madden–Julian oscillation (MJO), the dominant mode of intraseasonal variability in the tropical troposphere, has recently been shown to have a great impact on Northern Hemisphere (NH) extratropical stratosphere. But the influence of the variability in the extratropical stratosphere on MJO is seldom reported. In this study, the influence of major, mid–winter NH stratospheric sudden warmings (SSWs) on the MJO is investigated using meteorological reanalysis datasets. Our analysis reveals that SSWs also exert considerable influence on tropical intraseasonal convection. The occurrences of MJO phases 6 and 7 significantly increase during around 20 d after the onset of SSWs, corresponding to enhanced convective activity over the equatorial Central and Western Pacific. Then in the following days, the coherent eastward propagation of tropical intraseasonal convection resembles the periodic variation in a typical MJO. These results suggest that the extratropical stratosphere affects the organized tropical intraseasonal convection, and variability of the tropical intraseasonal convection related to MJO can be better grasped by taking extratropical stratospheric variability into account. Considering the complex interaction between MJO and extratropical stratosphere, further work on comprehensive understanding of the relationship between SSWs and MJO is required in future studies

    Container Ship Carbon and Fuel Estimation in Voyages Utilizing Meteorological Data with Data Fusion and Machine Learning Techniques

    No full text
    The International Maritime Organization (IMO) had made effort to reduce the ship’s energy consumption and carbon emission by optimizing the ship’s operational measures such as speed and weather routing. However, existing fuel consumption models were relatively simple without considering the quantified effect of weather conditions. In this paper, a knowledge-based ridge regression-based algorithm is presented for enabling automated fuel consumption estimation under varying weather conditions during voyages. Wind speed, wave height, ship speed, draught, AIS segment distance, and ship’s heading (HDG) are used as input to predict the fuel consumption value from the MRV report. In this work, 3 types of models are tested: AIS-based model, MRV-based model, and MRV-based normalized model. In AIS based model, weather conditions are divided into nine categories based on wind speed, wave height, and wind directions then trained separately. In MRV-based mode, the daily weather condition was used, and the MRV-normalized model used the normalized daily weather data. The proposed ridge regression models (11 models total) were tested with 4 container ships for a period of one year, and the result shows that compared to real fuel consumption, MRV-based model could achieve the best result with an average error less than 3% comparing to real MRV report

    Recent Strategies to Address Hypoxic Tumor Environments in Photodynamic Therapy

    No full text
    Photodynamic therapy (PDT) has become a promising method of cancer treatment due to its unique properties, such as noninvasiveness and low toxicity. The efficacy of PDT is, however, significantly reduced by the hypoxia tumor environments, because PDT involves the generation of reactive oxygen species (ROS), which requires the great consumption of oxygen. Moreover, the consumption of oxygen caused by PDT would further exacerbate the hypoxia condition, which leads to angiogenesis, invasion of tumors to other parts, and metastasis. Therefore, many research studies have been conducted to design nanoplatforms that can alleviate tumor hypoxia and enhance PDT. Herein, the recent progress on strategies for overcoming tumor hypoxia is reviewed, including the direct transport of oxygen to the tumor site by O2 carriers, the in situ generation of oxygen by decomposition of oxygen-containing compounds, reduced O2 consumption, as well as the regulation of tumor microenvironments. Limitations and future perspectives of these technologies to improve PDT are also discussed

    Understanding social media beyond text: A reliable practice on Twitter

    No full text
    Social media provides high-volume and real-time data, which has been broadly used in diverse applications in sales, marketing, disaster management, health surveillance, etc. However, distinguishing between noises and reliable information can be challenging, since social media, a user-generated content system, has a great number of users who update massive information every second. The rich information is not only included in the short textual content but also embedded in the images and videos. In this paper, we introduce an effective and efficient framework for event detection with social media data. The framework integrates both textual and imagery content in the hope to fully utilize the information. The approach has been demonstrated to be more accurate than the text-only approach by removing 58 (66.7%) false-positive events. The precision of event detection is improved by 6.5%. Besides, based on our analysis, we also look into the content of these images to further explore the space of social media studies. Finally, the closely related text and image from social media offer us a valuable text-image mapping, which can enable knowledge transfer between two media types
    • …
    corecore