52 research outputs found

    QASnowball: An Iterative Bootstrapping Framework for High-Quality Question-Answering Data Generation

    Full text link
    Recent years have witnessed the success of question answering (QA), especially its potential to be a foundation paradigm for tackling diverse NLP tasks. However, obtaining sufficient data to build an effective and stable QA system still remains an open problem. For this problem, we introduce an iterative bootstrapping framework for QA data augmentation (named QASnowball), which can iteratively generate large-scale high-quality QA data based on a seed set of supervised examples. Specifically, QASnowball consists of three modules, an answer extractor to extract core phrases in unlabeled documents as candidate answers, a question generator to generate questions based on documents and candidate answers, and a QA data filter to filter out high-quality QA data. Moreover, QASnowball can be self-enhanced by reseeding the seed set to fine-tune itself in different iterations, leading to continual improvements in the generation quality. We conduct experiments in the high-resource English scenario and the medium-resource Chinese scenario, and the experimental results show that the data generated by QASnowball can facilitate QA models: (1) training models on the generated data achieves comparable results to using supervised data, and (2) pre-training on the generated data and fine-tuning on supervised data can achieve better performance. Our code and generated data will be released to advance further work

    On the Performance of RIS-Aided Spatial Scattering Modulation for mmWave Transmission

    Full text link
    In this paper, we investigate a state-of-the-art reconfigurable intelligent surface (RIS)-assisted spatial scattering modulation (SSM) scheme for millimeter-wave (mmWave) systems, where a more practical scenario that the RIS is near the transmitter while the receiver is far from RIS is considered. To this end, the line-of-sight (LoS) and non-LoS links are utilized in the transmitter-RIS and RIS-receiver channels, respectively. By employing the maximum likelihood detector at the receiver, the conditional pairwise error probability (CPEP) expression for the RIS-SSM scheme is derived under the two scenarios that the received beam demodulation is correct or not. Furthermore, the union upper bound of average bit error probability (ABEP) is obtained based on the CPEP expression. Finally, the derivation results are exhaustively validated by the Monte Carlo simulations.Comment: arXiv admin note: substantial text overlap with arXiv:2307.1466

    Exploring Format Consistency for Instruction Tuning

    Full text link
    Instruction tuning has emerged as a promising approach to enhancing large language models in following human instructions. It is shown that increasing the diversity and number of instructions in the training data can consistently enhance generalization performance, which facilitates a recent endeavor to collect various instructions and integrate existing instruction tuning datasets into larger collections. However, different users have their unique ways of expressing instructions, and there often exist variations across different datasets in the instruction styles and formats, i.e., format inconsistency. In this work, we study how format inconsistency may impact the performance of instruction tuning. We propose a framework called "Unified Instruction Tuning" (UIT), which calls OpenAI APIs for automatic format transfer among different instruction tuning datasets. We show that UIT successfully improves the generalization performance on unseen instructions, which highlights the importance of format consistency for instruction tuning. To make the UIT framework more practical, we further propose a novel perplexity-based denoising method to reduce the noise of automatic format transfer. We also train a smaller offline model that achieves comparable format transfer capability than OpenAI APIs to reduce costs in practice

    Adipose tissues of MPC1± mice display altered lipid metabolism-related enzyme expression levels

    Get PDF
    Mitochondrial pyruvate carrier 1 (MPC1) is a component of the MPC1/MPC2 heterodimer that facilitates the transport of pyruvate into mitochondria. Pyruvate plays a central role in carbohydrate, fatty, and amino acid catabolism. The present study examined epididymal white adipose tissue (eWAT) and intrascapular brown adipose tissue (iBAT) from MPC1± mice following 24 weeks of feeding, which indicated low energy accumulation as evidenced by low body and eWAT weight and adipocyte volume. To characterize molecular changes in energy metabolism, we analyzed the transcriptomes of the adipose tissues using RNA-Sequencing (RNA-Seq). The results showed that the fatty acid oxidation pathway was activated and several genes involved in this pathway were upregulated. Furthermore, qPCR and western blotting indicated that numerous genes and proteins that participate in lipolysis were also upregulated. Based on these findings, we propose that the energy deficiency caused by reduced MPC1 activity can be alleviated by activating the lipolytic pathway

    Risk factor analysis and construction of prediction models of gallbladder carcinoma in patients with gallstones

    Get PDF
    BackgroundGallbladder carcinoma (GBC) is a biliary tract tumor with a high mortality rate. The objectives of this study were to explore the risk factors of GBC in patients with gallstones and to establish effective screening indicators.MethodsA total of 588 patients from medical centers in two different regions of China were included in this study and defined as the internal test samples and the external validation samples, respectively. We retrospectively reviewed the differences in clinicopathologic data of the internal test samples to find the independent risk factors that affect the occurrence of GBC. Then, we constructed three different combined predictive factors (CPFs) through the weighting method, integral system, and nomogram, respectively, and named them CPF-A, CPF-B, and CPF-C sequentially. Furthermore, we evaluated these indicators through calibration and DCA curves. The ROC curve was used to analyze their diagnostic efficiency. Finally, their diagnostic capabilities were validated in the external validation samples.ResultsIn the internal test samples, the results showed that five factors, namely, age (RR = 3.077, 95% CI: 1.731-5.496), size of gallstones (RR = 13.732, 95% CI: 5.937-31.762), course of gallstones (RR = 2.438, 95% CI: 1.350-4.403), CEA (RR = 9.464, 95% CI: 3.394-26.392), and CA199 (RR = 9.605, 95% CI: 4.512-20.446), were independent risk factors for GBC in patients with gallstones. Then, we established three predictive indicators: CPF-A, CPF-B, and CPF-C. These models were further validated using bootstrapping with 1,000 repetitions. Calibration and decision curve analysis showed that the three models fit well. Meanwhile, multivariate analysis showed that CPF-B and CPF-C were independent risk factors for GBC in patients with gallstones. In addition, the validation results of the external validation samples are essentially consistent with the internal test samples.ConclusionAge (≤58.5 vs. >58.5 years), size of gallstones (≤1.95 vs. >1.95cm), course of gallstones (≤10 vs. >10 years), CEA (≤5 vs. >5 ng/ml), and CA199 (≤37 vs. >37 U/ml) are independent risk factors for GBC in patients with gallstones. When positive indicators were ≥2 among the five independent risk factors or the score of the nomogram was >82.64, the risk of GBC was high in gallstone patients

    A review of the extraction and purification methods, biological activities, and applications of active compounds in Acanthopanax senticosus

    Get PDF
    Acanthopanax senticosus (AS) is a geo-authentic crude medicinal plant that grows in China, Korea, Russia, and Japan. AS contains bioactive compounds such as eleutherosides, polysaccharides, and flavonoids. It is also a key traditional herb in the Red List of Chinese Species. AS is mainly distributed in Northeast China, specifically in Heilongjiang, Jilin, and Liaoning provinces. Its active compounds contribute to significant biological activities, including neuroprotective, antioxidant, anti-fatigue, and antitumor effects. However, the extraction methods of active compounds are complex, the extraction efficiency is poor, and the structure–activity relationship is unclear. This study focused on the nutrients in AS, including protein, carbohydrates, and lipids. Particularly, the active ingredients (eleutherosides, polysaccharides, and flavonoids) in AS and their extraction and purification methods were analyzed and summarized. The biological activities of extracts have been reviewed, and the mechanisms of anti-oxidation, antitumor, anti-inflammation, and other activities are introduced in detail. The applications of AS in various domains, such as health foods, medicines, and animal dietary supplements, are then reported. Compared with other extraction methods, ultrasonic or microwave extraction improves efficiency, yet they can damage structures. Challenges arise in the recovery of solvents and in achieving extraction efficiency when using green solvents, such as deep eutectic solvents. Improvements can be made by combining extraction methods and controlling conditions (power, temperature, and time). Bioactive molecules and related activities are exposited clearly. The applications of AS have not been widely popularized, and the corresponding functions require further development

    Recent trends in extraction, purification, structural characterization, and biological activities evaluation of Perilla frutescens (L.) Britton polysaccharide

    Get PDF
    Perilla frutescens (L.) Britton is an annual herb plant of the Perilla genus in the Labiatae family, which is commonly utilized as an edible and medicinal resource. Polysaccharides are among the major components and essential bioactive compounds of P. frutescens, which exhibit a multitude of biological activities, including antioxidant, antitumor, anti-fatigue, immunoregulation, hepatoprotective, anti-inflammatory, and lipid-lowering effects. As a natural carbohydrate, P. frutescens polysaccharide has the potential to be utilized in the development of drugs and functional materials. In this paper, we provide an overview of progress made on the extraction, purification, structural characterization, and bioactivity of polysaccharides from different parts of P. frutescens. The challenges and opportunities for research are discussed, along with the potential development prospects and future areas of focus in the study of P. frutescens polysaccharides

    Procalcitonin as a marker of sepsis and outcome in patients with neurotrauma: an observation study

    Get PDF
    BACKGROUND: Procalcitonin (PCT) is a reliable biomarker of sepsis and infection. The level of PCT associated with sepsis and infection in patients with traumatic brain injury is currently unknown. The purpose of this study was to investigate the value of PCT and C-reactive protein (CRP) as diagnostic markers of sepsis and to evaluate the prognostic value of these markers related to the severity of injury, sepsis and mortality. METHODS: 105 adult patients with neurotrauma were enrolled in this study from June 2011 to February 2013. PCT and CRP were measured at admission and 2, 3, 5 and 7 days after admission. The sepsis criteria established by American College of Chest Physicians /Society of Critical Care Medicine Consensus Conference were used to identify patients. Injury Severity Score (ISS) and Glasgow Coma Score (GCS) were used to assess the severity of the injury. All these patients were monitored for 28 days. RESULTS: At admission, the median level of PCT was consistent with the severity of brain injury as follows: mild 0.08 ng/ml (0.05 - 0.13), moderate 0.25 ng/ml (0.11 - 0.55) and severe 0.31 ng/ml (0.17 - 0.79), but the range of CRP levels varied greatly within the given severity of brain injury. Seventy-one (67.6%) patients developed sepsis. The initial levels of PCT at admission were statistically higher in patients with sepsis, compared with patients with systemic inflammatory response syndrome (SIRS), but there were no differences in the initial concentration of CRP between sepsis and SIRS. After adjusting for these parameters, multivariate logistic regression analysis revealed that PCT was an independent risk factor for septic complications (p < 0.05). The areas under the ROCs at admission for the prediction of mortality were 0.76 (p < 0.05) and 0.733 for PCT and CRP, respectively. CONCLUSIONS: Increased levels of PCT during the course of the ICU stay could be an important indicator for the early diagnosis of sepsis after neurotrauma. In addition, high serum levels of PCT in patients with neurotrauma at admission indicate an increased risk of septic complications, and the daily measurement of PCT assists in guiding antibiotic therapy in neurotrauma patients

    Strip calculations for ship oscillation coupled response in regular waves

    No full text
    [Objectives] The strip method is widely used in the sea-keeping design of ships, but the hydrodynamics are only evaluated for the mean-hull position,so heaving,pitching and rolling motions are not essentially coupled. [Methods] For the effective coupling of hull heaving,pitching and rolling motions,based on the extensive pitch angle and increased draught,and the analytical expression of the instantaneous wave surface equation under the hull coordinate system,the calculation formula of pressure distribution under the wave surface is amended under the condition that the pressure at the wave surface is zero(Smith effect). Based on the wave surface equation and pressure distribution correction formula,the calculation method for obtaining the hydrostatic force on hull sections under an instantaneous wave surface and Froude-Krylov wave excitation force is given. Inertial hydrodynamic force and damping force are calculated by empirical formulations. As such,the heaving,pitching and rolling coupling dynamic equations are derived via the time variants of the coefficients,and calculation software is developed on the basis of surface area computing technology of AutoCAD.[Results] The simulation results show very clear characters with the linear method on small wave height,the rolling amplitude frequency response very evidently shows non-linear effects for heavy seas,and the rolling-yawing can also be seen in the wave direction in the resonance region.[Conclusions] This approach can be useful for predicting sea-keeping performance in heavy seas,and the developed software may be used for evaluating sea-keeping hull forms
    • …
    corecore