24 research outputs found

    Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation

    Full text link
    Large Language Models (LLMs) hold immense potential to generate synthetic data of high quality and utility, which has numerous applications from downstream model training to practical data utilisation. However, contemporary models, despite their impressive capacities, consistently struggle to produce both coherent and diverse data. To address the coherency issue, we introduce contrastive expert guidance, where the difference between the logit distributions of fine-tuned and base language models is emphasised to ensure domain adherence. In order to ensure diversity, we utilise existing real and synthetic examples as negative prompts to the model. We deem this dual-pronged approach to logit reshaping as STEER: Semantic Text Enhancement via Embedding Repositioning. STEER operates at inference-time and systematically guides the LLMs to strike a balance between adherence to the data distribution (ensuring semantic fidelity) and deviation from prior synthetic examples or existing real datasets (ensuring diversity and authenticity). This delicate balancing act is achieved by dynamically moving towards or away from chosen representations in the latent space. STEER demonstrates improved performance over previous synthetic data generation techniques, exhibiting better balance between data diversity and coherency across three distinct tasks: hypothesis generation, toxic and non-toxic comment generation, and commonsense reasoning task generation. We demonstrate how STEER allows for fine-tuned control over the diversity-coherency trade-off via its hyperparameters, highlighting its versatility

    Biomimetic nanocrystalline apatite coatings synthesized by Matrix Assisted Pulsed Laser Evaporation for medical applications

    Get PDF
    tWe report the deposition by Matrix Assisted Pulsed Laser Evaporation (MAPLE) technique of biomimeticnanocrystalline apatite coatings on titanium substrates, with potential application in tissue engineering.The targets were prepared from metastable, nanometric, poorly crystalline apatite powders, analogousto mineral bone, synthesized through a biomimetic approach by double decomposition process. For thedeposition of thin films, a KrF* excimer laser source was used (λ = 248 nm, τFWHM ≤ 25 ns). The analy-ses revealed the existence, in synthesized powders, of labile non-apatitic mineral ions, associated withthe formation of a hydrated layer at the surface of the nanocrystals. The thin film analyses showedthat the structural and chemical nature of the nanocrystalline apatite was prevalently preserved. Theperpetuation of the non-apatitic environments was also observed. The study indicated that MAPLE isa suitable technique for the congruent transfer of a delicate material, such as the biomimetic hydratednanohydroxyapatite

    Metallicity gradient of the thick disc progenitor at high redshift

    Get PDF
    We have developed a novel Markov Chain Monte Carlo chemical 'painting' technique to explore possible radial and vertical metallicity gradients for the thick disc progenitor. In our analysis, we match an N-body simulation to the data from the Apache Point Observatory Galactic Evolution Experiment survey.We assume that the thick disc has a constant scaleheight and has completed its formation at an early epoch, after which time radial mixing of its stars has taken place. Under these assumptions, we find that the initial radial metallicity gradient of the thick disc progenitor should not be negative, but either flat or even positive, to explain the current negative vertical metallicity gradient of the thick disc. Our study suggests that the thick disc was built-up in an inside-out and upside-down fashion, and older, smaller and thicker populations are more metal poor. In this case, star-forming discs at different epochs of the thick disc formation are allowed to have different radial metallicity gradients, including a negative one, which helps to explain a variety of slopes observed in high-redshift disc galaxies. This scenario helps to explain the positive slope of the metallicity-rotation velocity relation observed for the Galactic thick disc. On the other hand, radial mixing flattens the slope of an existing gradient.DK and IC acknowledge the support of the UK’s Science & Technology Facilities Council (STFC Grants ST/K000977/1 and ST/N000811/1). CAP is thankful to the Spanish MINECO for funding through grant AYA2014-56359-P. LC gratefully acknowledges support from the Australian Research Council (grants DP150100250, FT160100402). RJJG acknowledges support by the DFG Research Centre SFB-881 ‘The Milky Way System’, through project A1. JH is supported by a Dunlap Fellowship at the Dunlap Institute for Astronomy & Astrophysics, funded through an endowment established by the Dunlap family and the University of Toronto

    The GALAH Survey: Chemical tagging and chrono-chemodynamics of accreted halo stars with GALAH+ DR3 and GaiaGaia eDR3

    Get PDF
    © 2021 The Author(s) Published by Oxford University Press on behalf of Royal Astronomical Society. This is the accepted manuscript version of an article which has been published in final form at https://doi.org/10.1093/mnras/stab3504Since the advent of GaiaGaia astrometry, it is possible to identify massive accreted systems within the Galaxy through their unique dynamical signatures. One such system, GaiaGaia-Sausage-Enceladus (GSE), appears to be an early "building block" given its virial mass >1010M> 10^{10}\,\mathrm{M_\odot} at infall (z13z\sim1-3). In order to separate the progenitor population from the background stars, we investigate its chemical properties with up to 30 element abundances from the GALAH+ Survey Data Release 3 (DR3). To inform our choice of elements for purely chemically selecting accreted stars, we analyse 4164 stars with low-α\alpha abundances and halo kinematics. These are most different to the Milky Way stars for abundances of Mg, Si, Na, Al, Mn, Fe, Ni, and Cu. Based on the significance of abundance differences and detection rates, we apply Gaussian mixture models to various element abundance combinations. We find the most populated and least contaminated component, which we confirm to represent GSE, contains 1049 stars selected via [Na/Fe] vs. [Mg/Mn] in GALAH+ DR3. We provide tables of our selections and report the chrono-chemodynamical properties (age, chemistry, and dynamics). Through a previously reported clean dynamical selection of GSE stars, including 30<JR / kpckms1<5530 < \sqrt{J_R~/~\mathrm{kpc\,km\,s^{-1}}} < 55, we can characterise an unprecedented 24 abundances of this structure with GALAH+ DR3. Our chemical selection allows us to prevent circular reasoning and characterise the dynamical properties of the GSE, for example mean JR / kpckms1=2614+9\sqrt{J_R~/~\mathrm{kpc\,km\,s^{-1}}} = 26_{-14}^{+9}. We find only (29±1)%(29\pm1)\% of the GSE stars within the clean dynamical selection region. Our methodology will improve future studies of accreted structures and their importance for the formation of the Milky Way.Peer reviewedFinal Accepted Versio

    Species diversity, host preference and arbovirus detection of Culicoides (Diptera: Ceratopogonidae) in south-eastern Serbia

    Get PDF
    BackgroundCulicoides (Diptera: Ceratopogonidae) is a genus of small biting midges (also known as no-see ums) that currently includes 1368 described species. They are proven or suspected vectors for important pathogens affecting animals such as bluetongue virus (BTV) and Schmallenberg virus (SBV). Currently little information is available on the species of Culicoides present in Serbia. Thus, the aim of this study was to examine species diversity, host preference and the presence of BTV and SBV RNA in Culicoides from the Stara Planina Nature Park in south-eastern Serbia.ResultsIn total 19,887 individual Culicoides were collected during three nights of trapping at two farm sites and pooled into six groups (Obsoletus group, Pulicaris group, Others group and further each group according to the blood-feeding status to freshly engorged and non-engorged). Species identification was done on subsamples of 592 individual Culicoides specimens by morphological and molecular methods (MALDI-TOF mass spectrometry and PCR/sequencing). At least 22 Culicoides species were detected. Four animal species (cow, sheep, goat and common blackbird) as well as humans were identified as hosts of Culicoides biting midges. The screening of 8291 Culicoides specimens in 99 pools for the presence of BTV and SBV RNA by reverse-transcription quantitative PCR were negative.ConclusionsThe biodiversity of Culicoides species in the natural reserve Stara Planina was high with at least 22 species present. The presence of C. imicola Kieffer was not recorded in this area. Culicoides showed opportunistic feeding behaviour as determined by host preference. The absence of SBV and BTV viral RNA correlates with the absence of clinical disease in the field during the time of sampling. These data are the direct outcome of a training programme within the Institutional Partnership Project AMSAR: Arbovirus monitoring, research and surveillance-capacity building on mosquitoes and biting midges funded by the programme SCOPES of the Swiss National Science Foundation

    The GALAH survey: chemical clocks

    Get PDF
    We present the first large-scale study that demonstrates how ages can be determined for large samples of stars through Galactic chemical evolution. Previous studies found that the elemental abundances of a star correlate directly with its age and metallicity. Using this knowledge, we derive ages for 214 577 stars in GALAH DR3 using only overall metallicities and chemical abundances. Stellar ages are estimated via the machine learning algorithm XGBoost for stars belonging to the Milky Way disc with metallicities in the range -1 < [Fe/H] < 0.5, using main-sequence turn-off stars as our training set. We find that stellar ages for the bulk of GALAH DR3 are precise to 1-2 Gyr using this method. With these ages, we replicate many recent results on the age-kinematic trends of the nearby disc, including the solar neighbourhood's age-velocity dispersion relationship and the larger global velocity dispersion relations of the disc found using Gaia and GALAH. These results show that chemical abundance variations at a given birth radius are small, and that strong chemical tagging of stars directly to birth clusters may prove difficult with our current elemental abundance precision. Our results highlight the need to measure abundances for as many nucleosynthetic production sites as possible in order to estimate reliable ages from chemistry. Our methods open a new door into studies of the kinematic structure and evolution of the disc, as ages may potentially be estimated to a precision of 1-2 Gyr for a large fraction of stars in existing spectroscopic surveys

    Inference in the Milky Way in the Gaia era

    No full text
    We employ state-of-the-art statistical inference and Machine Learning techniques to understand the formation and evolution history of our Galaxy, the Milky Way, using data from the astrometric Gaia mission and ground-based spectroscopic surveys. We first investigate the vertical metallicity gradients of five mono-age stellar populations for a sample of 18,435 dwarf stars selected from the cross-matched Tycho- Gaia Astrometric Solution (TGAS) and RAdial Velocity Experiment (RAVE) Data Release 5. We find an increasingly steeper negative vertical metallicity gradient for the older stellar populations and a steadily increasing intrinsic dispersion in metallicity with age. These results are consistent with a scenario that thin disc stars formed from a flaring thin star-forming disc. To further study the chrono-chemo- dynamical structure of the Galactic disc, we develop a Bayesian Machine Learning framework called BINGO (Bayesian INference for Galactic archaeOlogy), which is a Bayesian Neural Network trained on asteroseismic age data, to obtain accurate relative stellar age estimates with reliable uncertainties for the Apache Point Obser- vatory Galactic Evolution Experiment (APOGEE) stars. After carefully architecting a training set to minimise bias, we apply BINGO to a stellar sample consisting of 17,305 carefully selected evolved stars. We find that the outer disc follows a differ- ent chemical evolution pathway than the inner disc. The outer metal-poor stars only starting to form after the compact thick disc formation phase has completed in the inner region and the star-forming gas disc extended outwardly with a metal-poor gas accretion. Using the Gaia DR2 data, we also try to find dwarf galaxies in eight Fermi-LAT extended, unassociated, gamma-ray source fields, to test the hypothesis that they owe to dark matter annihilation. After probing previously unexplored heliocentric distances of less than 20 kpc with an extreme-deconvolution technique, we find no sign of a dwarf galaxy in any of these fields despite Gaia’s excellent astrometric accuracy

    Lung Ultrasound Is More Sensitive for Hospitalized Consolidated Pneumonia Diagnosis Compared to CXR in Children

    No full text
    Background: Pneumonia is the leading cause of death among children; thus, a correct early diagnosis would be ideal. The imagistic diagnosis still uses chest X-ray (CXR), but lung ultrasound (LUS) proves to be reliable for pneumonia diagnosis. The aim of our study was to evaluate the sensitivity and specificity of LUS compared to CXR in consolidated pneumonia. Methods: Children with clinical suspicion of bacterial pneumonia were screened by LUS for pneumonia, followed by CXR. The agreement relation between LUS and CXR regarding the detection of consolidation was evaluated by Cohen’s kappa test. Results: A total of 128 patients with clinical suspicion of pneumonia were evaluated; 74 of them were confirmed by imagery and biological inflammatory markers. The highest frequency of pneumonia was in the 0–3 years age group (37.83%). Statistical estimation of the agreement between LUS and CXR in detection of the consolidation found an almost perfect agreement, with a Cohen’s kappa coefficient of K = 0.89 ± 0.04 SD, p = 0.000. Sensitivity of LUS was superior to CXR in detection of consolidations. Conclusion: Lung ultrasound is a reliable method for the detection of pneumonia consolidation in hospitalized children, with sensitivity and specificity superior to CXR. LUS should be used for rapid and safe evaluation of child pneumonia

    Oral Glucose Tolerance Test in Patients with Cystic Fibrosis Compared to the Overweight and Obese: A Different Approach in Understanding the Results

    No full text
    (1) Background: In cystic fibrosis (CF), the oral glucose tolerance test (OGTT) is recommended from 10 years old annually to screen and diagnose cystic fibrosis-related diabetes (CFRD). Alternative OGTT characteristics (glucose curve shape, time to glucose peak, one-hour glucose value, and three-hour glucose value with the new shape curve) were studied in other populations considered at high risk for diabetes; (2) Methods: The study analyses classical and alternative OGGT characteristics from 44 children (22 CF, 22 obese without CF), mean age: 12.9 ± 2.2 years evaluated in a single-center from Romania. (3) Results: In 59.1% of children with CF, the predominant OGTT pattern was: abnormal glucose metabolism or CFRD, with a monophasic curve shape, a late peak glucose level, and 1 h glucose ≥ 155 mg/dL, showing a very different pattern compared with sex and age-matched obese children. Statistical estimation agreement between the late glucose peak (K = 0.60; p = 0.005), the 1 h glucose ≥ 155 mg/dL during OGTT (K = 0.69, p = 0.001), and the classical method of interpretation was found. (4) Conclusions: Late peak glucose and 1 h glucose level ≥ 155 mg/dL during OGTT can be used for diagnosing the early glucose metabolism alteration in children with CF

    Oral Glucose Tolerance Test in Patients with Cystic Fibrosis Compared to the Overweight and Obese: A Different Approach in Understanding the Results

    No full text
    (1) Background: In cystic fibrosis (CF), the oral glucose tolerance test (OGTT) is recommended from 10 years old annually to screen and diagnose cystic fibrosis-related diabetes (CFRD). Alternative OGTT characteristics (glucose curve shape, time to glucose peak, one-hour glucose value, and three-hour glucose value with the new shape curve) were studied in other populations considered at high risk for diabetes; (2) Methods: The study analyses classical and alternative OGGT characteristics from 44 children (22 CF, 22 obese without CF), mean age: 12.9 &plusmn; 2.2 years evaluated in a single-center from Romania. (3) Results: In 59.1% of children with CF, the predominant OGTT pattern was: abnormal glucose metabolism or CFRD, with a monophasic curve shape, a late peak glucose level, and 1 h glucose &ge; 155 mg/dL, showing a very different pattern compared with sex and age-matched obese children. Statistical estimation agreement between the late glucose peak (K = 0.60; p = 0.005), the 1 h glucose &ge; 155 mg/dL during OGTT (K = 0.69, p = 0.001), and the classical method of interpretation was found. (4) Conclusions: Late peak glucose and 1 h glucose level &ge; 155 mg/dL during OGTT can be used for diagnosing the early glucose metabolism alteration in children with CF
    corecore