69 research outputs found
Enhancing Large Language Model Performance To Answer Questions and Extract Information More Accurately
Large Language Models (LLMs) generate responses to questions; however, their
effectiveness is often hindered by sub-optimal quality of answers and
occasional failures to provide accurate responses to questions. To address
these challenges, a fine-tuning process is employed, involving feedback and
examples to refine models. The objective is to enhance AI models through
continuous feedback loops, utilizing metrics such as cosine similarity, LLM
evaluation and Rouge-L scores to evaluate the models. Leveraging LLMs like
GPT-3.5, GPT4ALL, and LLaMA2, and Claude, this approach is benchmarked on
financial datasets, including the FinanceBench and RAG Instruct Benchmark
Tester Dataset, illustrating the necessity of fine-tuning. The results showcase
the capability of fine-tuned models to surpass the accuracy of zero-shot LLMs,
providing superior question and answering capabilities. Notably, the
combination of fine-tuning the LLM with a process known as Retrieval Augmented
Generation (RAG) proves to generate responses with improved accuracy
A HIV-1 heterosexual transmission chain in Guangzhou, China: a molecular epidemiological study
<p>Abstract</p> <p>Background</p> <p>We conducted molecular analyses to confirm four clustering HIV-1 infections (Patient A, B, C & D) in Guangzhou, China. These cases were identified by epidemiological investigation and suspected to acquire the infection through a common heterosexual transmission chain.</p> <p>Methods</p> <p><it>Env C2V3V4 </it>region, <it>gag p17/p24 </it>junction and partial <it>pol </it>gene of HIV-1 genome from serum specimens of these infected cases were amplified by reverse transcription polymerase chain reaction (RT-PCR) and nucleotide sequenced.</p> <p>Results</p> <p>Phylogenetic analyses indicated that their viral nucleotide sequences were significantly clustered together (bootstrap value is 99%, 98% and 100% in <it>env</it>, <it>gag </it>and <it>pol </it>tree respectively). Evolutionary distance analysis indicated that their genetic diversities of <it>env</it>, <it>gag </it>and <it>pol </it>genes were significantly lower than non-clustered controls, as measured by unpaired <it>t</it>-test (<it>env </it>gene comparison: <it>p </it>< 0.005; <it>gag </it>gene comparison: <it>p </it>< 0.005; <it>pol </it>gene comparison: <it>p </it>< 0.005).</p> <p>Conclusion</p> <p>Epidemiological results and molecular analyses consistently illustrated these four cases represented a transmission chain which dispersed in the locality through heterosexual contact involving commercial sex worker.</p
Zelda Binding in the Early Drosophila melanogaster Embryo Marks Regions Subsequently Activated at the Maternal-to-Zygotic Transition
The earliest stages of development in most metazoans are driven by maternally deposited proteins and mRNAs, with widespread transcriptional activation of the zygotic genome occurring hours after fertilization, at a period known as the maternal-to-zygotic transition (MZT). In Drosophila, the MZT is preceded by the transcription of a small number of genes that initiate sex determination, patterning, and other early developmental processes; and the zinc-finger protein Zelda (ZLD) plays a key role in their transcriptional activation. To better understand the mechanisms of ZLD activation and the range of its targets, we used chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-Seq) to map regions bound by ZLD before (mitotic cycle 8), during (mitotic cycle 13), and after (late mitotic cycle 14) the MZT. Although only a handful of genes are transcribed prior to mitotic cycle 10, we identified thousands of regions bound by ZLD in cycle 8 embryos, most of which remain bound through mitotic cycle 14. As expected, early ZLD-bound regions include the promoters and enhancers of genes transcribed at this early stage. However, we also observed ZLD bound at cycle 8 to the promoters of roughly a thousand genes whose first transcription does not occur until the MZT and to virtually all of the thousands of known and presumed enhancers bound at cycle 14 by transcription factors that regulate patterned gene activation during the MZT. The association between early ZLD binding and MZT activity is so strong that ZLD binding alone can be used to identify active promoters and regulatory sequences with high specificity and selectivity. This strong early association of ZLD with regions not active until the MZT suggests that ZLD is not only required for the earliest wave of transcription but also plays a major role in activating the genome at the MZT
Prevalence of and interventions for sarcopenia in ageing adults: a systematic review. Report of the International Sarcopenia Initiative (EWGSOP and IWGS)
Objective: to examine the clinical evidence reporting the prevalence of sarcopenia and the effect of nutrition and exercise interventions from studies using the consensus definition of sarcopenia proposed by the European Working Group on Sarcopenia in Older People (EWGSOP).
Methods: PubMed and Dialog databases were searched (January 2000–October 2013) using pre-defined search terms. Prevalence studies and intervention studies investigating muscle mass plus strength or function outcome measures using the EWGSOP definition of sarcopenia, in well-defined populations of adults aged ≥50 years were selected.
Results: prevalence of sarcopenia was, with regional and age-related variations, 1–29% in community-dwelling populations, 14–33% in long-term care populations and 10% in the only acute hospital-care population examined. Moderate quality evidence suggests that exercise interventions improve muscle strength and physical performance. The results of nutrition interventions are equivocal due to the low number of studies and heterogeneous study design. Essential amino acid (EAA) supplements, including ∼2.5 g of leucine, and β-hydroxy β-methylbutyric acid (HMB) supplements, show some effects in improving muscle mass and function parameters. Protein supplements have not shown consistent benefits on muscle mass and function.
Conclusion: prevalence of sarcopenia is substantial in most geriatric settings. Well-designed, standardised studies evaluating exercise or nutrition interventions are needed before treatment guidelines can be developed. Physicians should screen for sarcopenia in both community and geriatric settings, with diagnosis based on muscle mass and function. Supervised resistance exercise is recommended for individuals with sarcopenia. EAA (with leucine) and HMB may improve muscle outcomes
In Vivo Control of CpG and Non-CpG DNA Methylation by DNA Methyltransferases
The enzymatic control of the setting and maintenance of symmetric and non-symmetric DNA methylation patterns in a particular genome context is not well understood. Here, we describe a comprehensive analysis of DNA methylation patterns generated by high resolution sequencing of hairpin-bisulfite amplicons of selected single copy genes and repetitive elements (LINE1, B1, IAP-LTR-retrotransposons, and major satellites). The analysis unambiguously identifies a substantial amount of regional incomplete methylation maintenance, i.e. hemimethylated CpG positions, with variant degrees among cell types. Moreover, non-CpG cytosine methylation is confined to ESCs and exclusively catalysed by Dnmt3a and Dnmt3b. This sequence position–, cell type–, and region-dependent non-CpG methylation is strongly linked to neighboring CpG methylation and requires the presence of Dnmt3L. The generation of a comprehensive data set of 146,000 CpG dyads was used to apply and develop parameter estimated hidden Markov models (HMM) to calculate the relative contribution of DNA methyltransferases (Dnmts) for de novo and maintenance DNA methylation. The comparative modelling included wild-type ESCs and mutant ESCs deficient for Dnmt1, Dnmt3a, Dnmt3b, or Dnmt3a/3b, respectively. The HMM analysis identifies a considerable de novo methylation activity for Dnmt1 at certain repetitive elements and single copy sequences. Dnmt3a and Dnmt3b contribute de novo function. However, both enzymes are also essential to maintain symmetrical CpG methylation at distinct repetitive and single copy sequences in ESCs
Quantitative Models of the Mechanisms That Control Genome-Wide Patterns of Transcription Factor Binding during Early Drosophila Development
Transcription factors that drive complex patterns of gene expression during animal development bind to thousands of genomic regions, with quantitative differences in binding across bound regions mediating their activity. While we now have tools to characterize the DNA affinities of these proteins and to precisely measure their genome-wide distribution in vivo, our understanding of the forces that determine where, when, and to what extent they bind remains primitive. Here we use a thermodynamic model of transcription factor binding to evaluate the contribution of different biophysical forces to the binding of five regulators of early embryonic anterior-posterior patterning in Drosophila melanogaster. Predictions based on DNA sequence and in vitro protein-DNA affinities alone achieve a correlation of ∼0.4 with experimental measurements of in vivo binding. Incorporating cooperativity and competition among the five factors, and accounting for spatial patterning by modeling binding in every nucleus independently, had little effect on prediction accuracy. A major source of error was the prediction of binding events that do not occur in vivo, which we hypothesized reflected reduced accessibility of chromatin. To test this, we incorporated experimental measurements of genome-wide DNA accessibility into our model, effectively restricting predicted binding to regions of open chromatin. This dramatically improved our predictions to a correlation of 0.6–0.9 for various factors across known target genes. Finally, we used our model to quantify the roles of DNA sequence, accessibility, and binding competition and cooperativity. Our results show that, in regions of open chromatin, binding can be predicted almost exclusively by the sequence specificity of individual factors, with a minimal role for protein interactions. We suggest that a combination of experimentally determined chromatin accessibility data and simple computational models of transcription factor binding may be used to predict the binding landscape of any animal transcription factor with significant precision
Skin infectome of patients with a tick bite history
IntroductionTicks are the most important obligate blood-feeding vectors of human pathogens. With the advance of high-throughput sequencing, more and more bacterial community and virome in tick has been reported, which seems to pose a great threat to people.MethodsA total of 14 skin specimens collected from tick-bite patients with mild to severe symptoms were analyzed through meta-transcriptomic sequencings.ResultsFour bacteria genera were both detected in the skins and ticks, including Pseudomonas, Acinetobacter, Corynebacterium and Propionibacterium, and three tick-associated viruses, Jingmen tick virus (JMTV), Bole tick virus 4 (BLTV4) and Deer tick mononegavirales-like virus (DTMV) were identified in the skin samples. Except of known pathogens such as pathogenic rickettsia, Coxiella burnetii and JMTV, we suggest Roseomonas cervicalis and BLTV4 as potential new agents amplified in the skins and then disseminated into the blood. As early as 1 day after a tick-bite, these pathogens can transmit to skins and at most four ones can co-infect in skins.DiscussionAdvances in sequencing technologies have revealed that the diversity of tick microbiome and virome goes far beyond our previous understanding. This report not only identifies three new potential pathogens in humans but also shows that the skin barrier is vital in preventing horizontal transmissions of tick-associated bacteria or virus communities to the host. It is the first research on patients’ skin infectome after a tick bite and demonstrates that more attention should be paid to the cutaneous response to prevent tick-borne illness
Robust estimation of bacterial cell count from optical density
Optical density (OD) is widely used to estimate the density of cells in liquid culture, but cannot be compared between instruments without a standardized calibration protocol and is challenging to relate to actual cell count. We address this with an interlaboratory study comparing three simple, low-cost, and highly accessible OD calibration protocols across 244 laboratories, applied to eight strains of constitutive GFP-expressing E. coli. Based on our results, we recommend calibrating OD to estimated cell count using serial dilution of silica microspheres, which produces highly precise calibration (95.5% of residuals <1.2-fold), is easily assessed for quality control, also assesses instrument effective linear range, and can be combined with fluorescence calibration to obtain units of Molecules of Equivalent Fluorescein (MEFL) per cell, allowing direct comparison and data fusion with flow cytometry measurements: in our study, fluorescence per cell measurements showed only a 1.07-fold mean difference between plate reader and flow cytometry data
- …