16 research outputs found

    MedEdit: Model Editing for Medical Question Answering with External Knowledge Bases

    Full text link
    Large Language Models (LLMs), although powerful in general domains, often perform poorly on domain-specific tasks like medical question answering (QA). Moreover, they tend to function as "black-boxes," making it challenging to modify their behavior. Addressing this, our study delves into model editing utilizing in-context learning, aiming to improve LLM responses without the need for fine-tuning or retraining. Specifically, we propose a comprehensive retrieval strategy to extract medical facts from an external knowledge base, and then we incorporate them into the query prompt for the LLM. Focusing on medical QA using the MedQA-SMILE dataset, we evaluate the impact of different retrieval models and the number of facts provided to the LLM. Notably, our edited Vicuna model exhibited an accuracy improvement from 44.46% to 48.54%. This work underscores the potential of model editing to enhance LLM performance, offering a practical approach to mitigate the challenges of black-box LLMs.Comment: 6 page

    A new approach for obtaining rapid uniformity in rice (Oryza sativa L.) via a 3x x 2x cross

    Get PDF
    A triploid (2n = 3x = 36) rice plant was obtained by screening a twin seedling population in which each seed germinated to two or three sprouts that were then crossed with diploid plants. One diploid plant was chosen among the various F1 progenies and developed into an F 2 population via self-pollination. Compared with the control variety Shanyou 63, this F 2 population had a stable agronomical performance in field trials, as confirmed by the F-test. The stability of the F 2 population was further substantiated by molecular analysis with simple sequence repeat markers. Specifically, of 160 markers assayed, 37 (covering all 12 chromosomes) were polymorphic between the parental lines. Testing the F 1 hybrid individually with these markers showed that each PCR product had only a single band instead of two bands from each parent. The bands were identical to either maternal (23 markers) or paternal (eight markers) bands or distinct from both parents (six markers). The amplified bands of all 60 randomly selected F 2 plants were uniform and identical to those of the F 1 hybrid. These results suggest that the F 1 plant is a non-segregating hybrid and that a stable F 2 population was obtained. This novel system provides an efficient means for shortening the cycle of hybrid rice seed production

    DeePMD-kit v2: A software package for Deep Potential models

    Full text link
    DeePMD-kit is a powerful open-source software package that facilitates molecular dynamics simulations using machine learning potentials (MLP) known as Deep Potential (DP) models. This package, which was released in 2017, has been widely used in the fields of physics, chemistry, biology, and material science for studying atomistic systems. The current version of DeePMD-kit offers numerous advanced features such as DeepPot-SE, attention-based and hybrid descriptors, the ability to fit tensile properties, type embedding, model deviation, Deep Potential - Range Correction (DPRc), Deep Potential Long Range (DPLR), GPU support for customized operators, model compression, non-von Neumann molecular dynamics (NVNMD), and improved usability, including documentation, compiled binary packages, graphical user interfaces (GUI), and application programming interfaces (API). This article presents an overview of the current major version of the DeePMD-kit package, highlighting its features and technical details. Additionally, the article benchmarks the accuracy and efficiency of different models and discusses ongoing developments.Comment: 51 pages, 2 figure

    Seismic Impedance Inversion Using a Joint Deep Learning Model Based on Convolutional Neural Network and Transformer

    No full text
    Seismic impedance is an important factor in characterizing reservoirs, so accurate seismic impedance inversion is significant in seismic exploration. However, achieving high-resolution impedance inversion has remained a complex problem due to challenges related to the unknown seismic wavelet and the frequency band limitations of the observed data. In recent years, deep learning methods such as convolutional neural network (CNN) have been successfully applied to the field of seismic impedance inversion, which can obtain higher resolution results compared with traditional inversion methods. However, limited by the size of the local receptive field, CNN is not conducive to extracting global information. In contrast, a transformer can efficiently extract long-range dependencies but relies entirely on the self-attention mechanism to compute correlations between data, which requires a lot of training data. Therefore, this article proposes a joint deep learning model based on CNN and a transformer for impedance inversion. Among them, CNN and transformer are used to learn local and global information in the data, respectively, and feature fusion through residual connection, which can improve the feature representation capability of neural networks. In addition, we train a CNN-based forward operator that can introduce information from unlabeled data into the network training to enhance the network's generalization ability and improve the stability of the inversion. Experimental results in the SEAM model and field data show that the method can predict impedance effectively and with better accuracy than classical constrained sparse spike inversion and conventional deep learning methods
    corecore