1,653 research outputs found

    Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs

    Full text link
    Most NLP tasks are modeled as supervised learning and thus require labeled training data to train effective models. However, manually producing such data at sufficient quality and quantity is known to be costly and time-intensive. Current research addresses this bottleneck by exploring a novel paradigm called zero-shot learning via dataset generation. Here, a powerful LLM is prompted with a task description to generate labeled data that can be used to train a downstream NLP model. For instance, an LLM might be prompted to "generate 500 movie reviews with positive overall sentiment, and another 500 with negative sentiment." The generated data could then be used to train a binary sentiment classifier, effectively leveraging an LLM as a teacher to a smaller student model. With this demo, we introduce Fabricator, an open-source Python toolkit for dataset generation. Fabricator implements common dataset generation workflows, supports a wide range of downstream NLP tasks (such as text classification, question answering, and entity recognition), and is integrated with well-known libraries to facilitate quick experimentation. With Fabricator, we aim to support researchers in conducting reproducible dataset generation experiments using LLMs and help practitioners apply this approach to train models for downstream tasks.Comment: 3 Figures and 2 Table

    The complete genome sequence and comparative genome analysis of the high pathogenicity Yersinia enterocolitica strain 8081

    Get PDF
    The human enteropathogen, Yersinia enterocolitica, is a significant link in the range of Yersinia pathologies extending from mild gastroenteritis to bubonic plague. Comparison at the genomic level is a key step in our understanding of the genetic basis for this pathogenicity spectrum. Here we report the genome of Y. enterocolitica strain 8081 (serotype 0:8; biotype 1B) and extensive microarray data relating to the genetic diversity of the Y. enterocolitica species. Our analysis reveals that the genome of Y. enterocolitica strain 8081 is a patchwork of horizontally acquired genetic loci, including a plasticity zone of 199 kb containing an extraordinarily high density of virulence genes. Microarray analysis has provided insights into species-specific Y. enterocolitica gene functions and the intraspecies differences between the high, low, and nonpathogenic Y. enterocolitica biotypes. Through comparative genome sequence analysis we provide new information on the evolution of the Yersinia. We identify numerous loci that represent ancestral clusters of genes potentially important in enteric survival and pathogenesis, which have been lost or are in the process of being lost, in the other sequenced Yersinia lineages. Our analysis also highlights large metabolic operons in Y. enterocolitica that are absent in the related enteropathogen, Yersinia pseudotuberculosis, indicating major differences in niche and nutrients used within the mammalian gut. These include clusters directing, the production of hydrogenases, tetrathionate respiration, cobalamin synthesis, and propanediol utilisation. Along with ancestral gene clusters, the genome of Y. enterocolitica has revealed species-specific and enteropathogen-specific loci. This has provided important insights into the pathology of this bacterium and, more broadly, into the evolution of the genus. Moreover, wider investigations looking at the patterns of gene loss and gain in the Yersinia have highlighted common themes in the genome evolution of other human enteropathogens

    IDENTIFICATION OF OCCULT CEREBRAL MICROBLEEDS IN ADULTS WITH IMMUNE THROMBOCYTOPENIA

    Get PDF
    Management of symptoms and prevention of life-threatening hemorrhage in immune thrombocytopenia (ITP) must be balanced against adverse effects of therapies. Because current treatment guidelines based on platelet count are confounded by variable bleeding phenotypes, there is a need to identify new objective markers of disease severity for treatment stratification. In this cross-sectional prospective study of 49 patients with ITP and nadir platelet counts <30 × 109/L and 18 aged-matched healthy controls, we used susceptibility-weighted magnetic resonance imaging to detect cerebral microbleeds (CMBs) as a marker of occult hemorrhage. CMBs were detected using a semiautomated method and correlated with clinical metadata using multivariate regression analysis. No CMBs were detected in health controls. In contrast, lobar CMBs were identified in 43% (21 of 49) of patients with ITP; prevalence increased with decreasing nadir platelet count (0/4, ≥15 × 109/L; 2/9, 10-14 × 109/L; 4/11, 5-9 × 109/L; 15/25 <5 × 109/L) and was associated with longer disease duration (P = 7 × 10−6), lower nadir platelet count (P = .005), lower platelet count at time of neuroimaging (P = .029), and higher organ bleeding scores (P = .028). Mucosal and skin bleeding scores, number of previous treatments, age, and sex were not associated with CMBs. Occult cerebral microhemorrhage is common in patients with moderate to severe ITP. Strong associations with ITP duration may reflect CMB accrual over time or more refractory disease. Further longitudinal studies in children and adults will allow greater understanding of the natural history and clinical and prognostic significance of CMBs

    Fish-borne trematodosis: Potential risk of infection by Ascocotyle (Phagicola) longa (Heterophyidae)

    Get PDF
    AbstractOwing to the veterinary and medical importance of heterophyid trematodes, a survey on Ascocotyle (Phagicola) longa in different organs of mullets Mugil liza from Rio de Janeiro was undertaken. The prevalence of metacercariae varied greatly between different organs of the mullets: spleen (100%), heart (98%), intestine wall (97%), liver (97%), muscle (87%), stomach wall (77%), brain (47%), gonads (30%) and gall bladder (30%). The high level of the intensity of the infection in relation to different fish organs was confirmed in two experimental infections performed during the spring/summer and autumn/winter seasons when 258 and 47 adult parasites were recovered from hamsters fed only with small pieces of muscle tissue. The potential risk of infection was considered to be high in view of the high prevalence and intensity of A. (P.) longa in the muscles of mullets throughout the year. Additionally new confocal imaging of metacercariae and adults experimentally obtained, enabled for the first time the description of a short genital atrium formed by the union of uterus and ejaculatory duct

    Review on computational methods for Lyapunov functions

    Get PDF
    Lyapunov functions are an essential tool in the stability analysis of dynamical systems, both in theory and applications. They provide sufficient conditions for the stability of equilibria or more general invariant sets, as well as for their basin of attraction. The necessity, i.e. the existence of Lyapunov functions, has been studied in converse theorems, however, they do not provide a general method to compute them. Because of their importance in stability analysis, numerous computational construction methods have been developed within the Engineering, Informatics, and Mathematics community. They cover different types of systems such as ordinary differential equations, switched systems, non-smooth systems, discrete-time systems etc., and employ di_erent methods such as series expansion, linear programming, linear matrix inequalities, collocation methods, algebraic methods, set-theoretic methods, and many others. This review brings these different methods together. First, the different types of systems, where Lyapunov functions are used, are briefly discussed. In the main part, the computational methods are presented, ordered by the type of method used to construct a Lyapunov function

    The Iceland Microcontinent and a continental Greenland-Iceland-Faroe Ridge

    Get PDF
    The breakup of Laurasia to form the Northeast Atlantic Realm was the culmination of a long period of tectonic unrest extending back to the Late Palaeozoic. Breakup was prolonged and complex and disintegrated an inhomogeneous collage of cratons sutured by cross-cutting orogens. Volcanic rifted margins formed, which are blanketed by lavas and underlain variously by magma-inflated, extended continental crust and mafic high-velocity lower crust of ambiguous and probably partly continental provenance. New rifts formed by diachronous propagation along old zones of weakness. North of the Greenland-Iceland-Faroe Ridge the newly forming rift propagated south along the Caledonian suture. South of the Greenland-Iceland-Faroe Ridge it propagated north through the North Atlantic Craton along an axis displaced ~ 150 km to the west of the northern rift. Both propagators stalled where the confluence of the Nagssugtoqidian and Caledonian orogens formed a transverse barrier. Thereafter, the ~ 400-km-wide latitudinal zone between the stalled rift tips extended in a distributed, unstable manner along multiple axes of extension that frequently migrated or jumped laterally with shearing occurring between them in diffuse transfer zones. This style of deformation continues to the present day. It is the surface expression of underlying magma-assisted stretching of ductile mid- and lower continental crust which comprises the Icelandic-type lower crust that underlies the Greenland-Iceland-Faroe Ridge. This, and probably also one or more full-crustal-thickness microcontinents incorporated in the Ridge, are capped by surface lavas. The Greenland-Iceland-Faroe Ridge thus has a similar structure to some zones of seaward-dipping reflectors. The contemporaneous melt layer corresponds to the 3–10 km thick Icelandic-type upper crust plus magma emplaced in the ~ 10–30-km-thick Icelandic-type lower crust. This model can account for seismic and gravity data that are inconsistent with a gabbroic composition for Icelandic-type lower crust, and petrological data that show no reasonable temperature or source composition could generate the full ~ 40-km thickness of Icelandic-type crust observed. Numerical modeling confirms that extension of the continental crust can continue for many tens of Myr by lower-crustal flow from beneath the adjacent continents. Petrological estimates of the maximum potential temperature of the source of Icelandic lavas are up to 1450 °C, no more than ~ 100 °C hotter than MORB source. The geochemistry is compatible with a source comprising hydrous peridotite/pyroxenite with a component of continental mid- and lower crust. The fusible petrology, high source volatile contents, and frequent formation of new rifts can account for the true ~ 15–20 km melt thickness at the moderate temperatures observed. A continuous swathe of magma-inflated continental material beneath the 1200-km-wide Greenland-Iceland-Faroe Ridge implies that full continental breakup has not yet occurred at this latitude. Ongoing tectonic instability on the Ridge is manifest in long-term tectonic disequilibrium on the adjacent rifted margins and on the Reykjanes Ridge, where southerly migrating propagators that initiate at Iceland are associated with diachronous swathes of unusually thick oceanic crust. Magmatic volumes in the NE Atlantic Realm have likely been overestimated and the concept of a monogenetic North Atlantic Igneous Province needs to be reappraised. A model of complex, piecemeal breakup controlled by pre-existing structures that produces anomalous volcanism at barriers to rift propagation and distributes continental material in the growing oceans fits other oceanic regions including the Davis Strait and the South Atlantic and West Indian oceans

    IMPACT-Global Hip Fracture Audit: Nosocomial infection, risk prediction and prognostication, minimum reporting standards and global collaborative audit. Lessons from an international multicentre study of 7,090 patients conducted in 14 nations during the COVID-19 pandemic

    Get PDF
    corecore