655 research outputs found

    Improving Retrieval-Augmented Large Language Models via Data Importance Learning

    Full text link
    Retrieval augmentation enables large language models to take advantage of external knowledge, for example on tasks like question answering and data imputation. However, the performance of such retrieval-augmented models is limited by the data quality of their underlying retrieval corpus. In this paper, we propose an algorithm based on multilinear extension for evaluating the data importance of retrieved data points. There are exponentially many terms in the multilinear extension, and one key contribution of this paper is a polynomial time algorithm that computes exactly, given a retrieval-augmented model with an additive utility function and a validation set, the data importance of data points in the retrieval corpus using the multilinear extension of the model's utility function. We further proposed an even more efficient ({\epsilon}, {\delta})-approximation algorithm. Our experimental results illustrate that we can enhance the performance of large language models by only pruning or reweighting the retrieval corpus, without requiring further training. For some tasks, this even allows a small model (e.g., GPT-JT), augmented with a search engine API, to outperform GPT-3.5 (without retrieval augmentation). Moreover, we show that weights based on multilinear extension can be computed efficiently in practice (e.g., in less than ten minutes for a corpus with 100 million elements)

    HL-1 cells express an inwardly rectifying K+ current activated via muscarinic receptors comparable to that in mouse atrial myocytes

    Get PDF
    An inwardly rectifying K^+ current is present in atrial cardiac myocytes that is activated by acetylcholine (I_{KACh}). Physiologically, activation of the current in the SA node is important in slowing the heart rate with increased parasympathetic tone. It is a paradigm for the direct regulation of signaling effectors by the Gβγ G-protein subunit. Many questions have been addressed in heterologous expression systems with less focus on the behaviour in native myocytes partly because of the technical difficulties in undertaking comparable studies in native cells. In this study, we characterise a potassium current in the atrial-derived cell line HL-1. Using an electrophysiological approach, we compare the characteristics of the potassium current with those in native atrial cells and in a HEK cell line expressing the cloned Kir3.1/3.4 channel. The potassium current recorded in HL-1 is inwardly rectifying and activated by the muscarinic agonist carbachol. Carbachol-activated currents were inhibited by pertussis toxin and tertiapin-Q. The basal current was time-dependently increased when GTP was substituted in the patch-clamp pipette by the non-hydrolysable analogue GTPγS. We compared the kinetics of current modulation in HL-1 with those of freshly isolated atrial mouse cardiomyocytes. The current activation and deactivation kinetics in HL-1 cells are comparable to those measured in atrial cardiomyocytes. Using immunofluorescence, we found GIRK4 at the membrane in HL-1 cells. Real-time RT-PCR confirms the presence of mRNA for the main G-protein subunits, as well as for M2 muscarinic and A1 adenosine receptors. The data suggest HL-1 cells are a good model to study IKAch

    A Novel Unsupervised Method to Identify Genes Important in the Anti-viral Response: Application to Interferon/Ribavirin in Hepatitis C Patients

    Get PDF
    Background: Treating hepatitis C with interferon/ribavirin results in a varied response in terms of decrease in viral titer and ultimate outcome. Marked responders have a sharp decline in viral titer within a few days of treatment initiation, whereas in other patients there is no effect on the virus (poor responders). Previous studies have shown that combination therapy modifies expression of hundreds of genes in vitro and in vivo. However, identifying which, if any, of these genes have a role in viral clearance remains challenging. Aims: The goal of this paper is to link viral levels with gene expression and thereby identify genes that may be responsible for early decrease in viral titer. Methods: Microarrays were performed on RNA isolated from PBMC of patients undergoing interferon/ribavirin therapy. Samples were collected at pre-treatment (day 0), and 1, 2, 7, 14 and 28 days after initiating treatment. A novel method was applied to identify genes that are linked to a decrease in viral titer during interferon/ribavirin treatment. The method uses the relationship between inter-patient gene expression based proximities and inter-patient viral titer based proximities to define the association between microarray gene expression measurements of each gene and viral-titer measurements. Results: We detected 36 unique genes whose expressions provide a clustering of patients that resembles viral titer based clustering of patients. These genes include IRF7, MX1, OASL and OAS2, viperin and many ISG's of unknown function. Conclusion: The genes identified by this method appear to play a major role in the reduction of hepatitis C virus during the early phase of treatment. The method has broad utility and can be used to analyze response to any group of factors influencing biological outcome such as antiviral drugs or anti-cancer agents where microarray data are available. © 2007 Brodsky et al

    Extended 2D myotube culture recapitulates postnatal fibre type plasticity

    Get PDF
    Background: The traditional problems of performing skeletal muscle cell cultures derived from mammalian or avian species are limited myotube differentiation, and transient myotube persistence which greatly restricts the ability of myotubes to undergo phenotypic maturation. We report here on a major technical breakthrough in the establishment of a simple and effective method of extended porcine myotube cultures (beyond 50 days) in two-dimension (2D) that recapitulates key features of postnatal fibre types. Results: Primary porcine muscle satellite cells (myoblasts) were isolated from the longissimus dorsi of 4 to 6 weeks old pigs for 2D cultures to optimise myotube formation, improve surface adherence and characterise myotube maturation. Over 95 % of isolated cells were myoblasts as evidenced by the expression of Pax3 and Pax7. Our relatively simple approach, based on modifications of existing surface coating reagents (Maxgel), and of proliferation and differentiation (Ultroser G) media, typically achieved by 5 days of differentiation fusion index of around 80 % manifested in an abundance of discrete myosin heavy chain (MyHC) slow and fast myotubes. There was little deterioration in myotube viability over 50 days, and the efficiency of myotube formation was maintained over seven myoblast passages. Regular spontaneous contractions of myotubes were frequently observed throughout culture. Myotubes in extended cultures were able to undergo phenotypic adaptation in response to different culture media, including the adoption of a dominant postnatal phenotype of fast-glycolytic MyHC 2x and 2b expression by about day 20 of differentiation. Furthermore, fast-glycolytic myotubes coincided with enhanced expression of the putative porcine long intergenic non-coding RNA (linc-MYH), which has recently been shown to be a key coordinator of MyHC 2b expression in vivo. Conclusions: Our revised culture protocol allows the efficient differentiation and fusion of porcine myoblasts into myotubes and their prolonged adherence to the culture surface. Furthermore, we are able to recapitulate in 2D the maturation process of myotubes to resemble postnatal fibre types which represent a major technical advance in opening access to the in vitro study of coordinated postnatal muscle gene expression

    Stoichiometry of HLA Class II-Invariant Chain Oligomers

    Get PDF
    BACKGROUND: The HLA gene complex encodes three class II isotypes, DR, DQ, and DP. HLA class II molecules are peptide receptors that present antigens for recognition by T lymphocytes. In antigen presenting cells, the assembly of matched α and β subunits to heterodimers is chaperoned by invariant chain (Ii). Ii forms a homotrimer with three binding sites for class II heterodimers. The current model of class II and Ii structure states that three αβ heterodimers bind to an Ii trimer. METHODOLOGY/PRINCIPAL FINDINGS: [corrected] We have now analyzed the composition and size of the complexes of class II and Ii using epitope tagged class II subunits and density gradient experiments. We show here that class II-Ii oligomers consist of one class II heterodimer associated with one Ii trimer, such that the DR, DQ and DP isotypes are contained within separate complexes with Ii. CONCLUSION/SIGNIFICANCE: We propose a structural model of the class II-Ii oligomer and speculate that the pentameric class II-Ii complex is bent towards the cell membrane, inhibiting the binding of additional class II heterodimers to Ii

    Profound Effect of Profiling Platform and Normalization Strategy on Detection of Differentially Expressed MicroRNAs – A Comparative Study

    Get PDF
    Adequate normalization minimizes the effects of systematic technical variations and is a prerequisite for getting meaningful biological changes. However, there is inconsistency about miRNA normalization performances and recommendations. Thus, we investigated the impact of seven different normalization methods (reference gene index, global geometric mean, quantile, invariant selection, loess, loessM, and generalized procrustes analysis) on intra- and inter-platform performance of two distinct and commonly used miRNA profiling platforms.We included data from miRNA profiling analyses derived from a hybridization-based platform (Agilent Technologies) and an RT-qPCR platform (Applied Biosystems). Furthermore, we validated a subset of miRNAs by individual RT-qPCR assays. Our analyses incorporated data from the effect of differentiation and tumor necrosis factor alpha treatment on primary human skeletal muscle cells and a murine skeletal muscle cell line. Distinct normalization methods differed in their impact on (i) standard deviations, (ii) the area under the receiver operating characteristic (ROC) curve, (iii) the similarity of differential expression. Loess, loessM, and quantile analysis were most effective in minimizing standard deviations on the Agilent and TLDA platform. Moreover, loess, loessM, invariant selection and generalized procrustes analysis increased the area under the ROC curve, a measure for the statistical performance of a test. The Jaccard index revealed that inter-platform concordance of differential expression tended to be increased by loess, loessM, quantile, and GPA normalization of AGL and TLDA data as well as RGI normalization of TLDA data.We recommend the application of loess, or loessM, and GPA normalization for miRNA Agilent arrays and qPCR cards as these normalization approaches showed to (i) effectively reduce standard deviations, (ii) increase sensitivity and accuracy of differential miRNA expression detection as well as (iii) increase inter-platform concordance. Results showed the successful adoption of loessM and generalized procrustes analysis to one-color miRNA profiling experiments

    Assessment of Metabolic Phenotypes in Patients with Non-ischemic Dilated Cardiomyopathy Undergoing Cardiac Resynchronization Therapy

    Get PDF
    Studies of myocardial metabolism have reported that contractile performance at a given myocardial oxygen consumption (MVO2) can be lower when the heart is oxidizing fatty acids rather than glucose or lactate. The objective of this study is to assess the prognostic value of myocardial metabolic phenotypes in identifying non-responders among non-ischemic dilated cardiomyopathy (NIDCM) patients undergoing cardiac resynchronization therapy (CRT). Arterial and coronary sinus plasma concentrations of oxygen, glucose, lactate, pyruvate, free fatty acids (FFA), and 22 amino acids were obtained from 19 male and 2 female patients (mean age 56 ± 16) with NIDCM undergoing CRT. Metabolite fluxes/MVO2 and extraction fractions were calculated. Flux balance analysis (FBA) was performed with MetaFluxNet 1.8 on a metabolic network of the cardiac mitochondria (189 reactions, 230 metabolites) reconstructed from mitochondrial proteomic data (615 proteins) from human heart tissue. Non-responders based on left ventricular ejection fraction (LVEF) demonstrated a greater mean FFA extraction fraction (35% ± 17%) than responders [18 ± 10%, p = 0.0098, area under the estimated ROC curve (AUC) was 0.8238, S.E. 0.1115]. Calculated adenosine triphosphate (ATP)/MVO2 using FBA correlated with change in New York Heart Association (NYHA) class (rho = 0.63, p = 0.0298; AUC = 0.8381, S.E. 0.1316). Non-responders based on both LVEF and NYHA demonstrated a greater mean FFA uptake/MVO2 (0.115 ± 0.112) than responders (0.034 ± 0.030, p = 0.0171; AUC = 0.8593, S.E. 0.0965). Myocardial FFA flux and calculated maximal ATP synthesis flux using FBA may be helpful as biomarkers in identifying non-responders among NIDCM patients undergoing CRT

    Dengue Virus Capsid Protein Binds Core Histones and Inhibits Nucleosome Formation in Human Liver Cells

    Get PDF
    Dengue virus (DENV) is a member of the Flaviviridae and a globally (re)emerging pathogen that causes serious human disease. There is no specific antiviral or vaccine for dengue virus infection. Flavivirus capsid (C) is a structural protein responsible for gathering the viral RNA into a nucleocapsid that forms the core of a mature virus particle. Flaviviral replication is known to occur in the cytoplasm yet a large portion of capsid protein localizes to the nucleus during infection. The reasons for the nuclear presences of capsid are not completely understood. Here, we expressed mature DENV C in a tandem affinity purification assay to identify potential binding partners in human liver cells. DENV C targeted the four core histones, H2A, H2B, H3 and H4. DENV C bound recombinant histones in solution and colocalized with histones in the nucleus and cytoplasm of liver cells during DENV infection. We show that DENV C acts as a histone mimic, forming heterodimers with core histones, binding DNA and disrupting nucleosome formation. We also demonstrate that DENV infection increases the amounts of core histones in livers cells, which may be a cellular response to C binding away the histone proteins. Infection with DENV additionally alters levels of H2A phosphorylation in a time-dependent manner. The interactions of C and histones add an interesting new role for the presence of C in the nucleus during DENV infection

    High-resolution analysis of multi-copy variant surface glycoprotein gene expression sites in African trypanosomes

    Get PDF
    BACKGROUND: African trypanosomes cause lethal diseases in humans and animals and escape host immune attack by switching the expression of Variant Surface Glycoprotein (VSG) genes. The expressed VSGs are located at the ends of telomeric, polycistronic transcription units known as VSG expression sites (VSG-ESs). Each cell has many VSG-ESs but only one is transcribed in bloodstream-form parasites and all of them are inactive upon transmission to the insect vector mid-gut; a subset of monocistronic metacyclic VSG-ESs are then activated in the insect salivary gland. Deep-sequence analyses have been informative but assigning sequences to individual VSG-ESs has been challenging because they each contain closely related expression-site associated genes, or ESAGs, thought to contribute to virulence. RESULTS: We utilised ART, an in silico short read simulator to demonstrate the feasibility of accurately aligning reads to VSG-ESs. Then, using high-resolution transcriptomes from isogenic bloodstream and insect-stage Lister 427 Trypanosoma brucei, we uncover increased abundance in the insect mid-gut stage of mRNAs from metacyclic VSG-ESs and of mRNAs from the unusual ESAG, ESAG10. Further, we show that the silencing associated with allelic exclusion involves repression focussed at the ends of the VSG-ESs. We also use the approach to report relative fitness costs following ESAG RNAi from a genome-scale screen. CONCLUSIONS: By assigning sequences to individual VSG-ESs we provide new insights into VSG-ES transcription control, allelic exclusion and impacts on fitness. Thus, deeper insights into the expression and function of regulated multi-gene families are more accessible than previously anticipated. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-3154-8) contains supplementary material, which is available to authorized users
    • …
    corecore