36 research outputs found

    A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation

    Full text link
    Generating coherent and credible explanations remains a significant challenge in the field of AI. In recent years, researchers have delved into the utilization of entailment trees to depict explanations, which exhibit a reasoning process of how a hypothesis is deduced from the supporting facts. However, existing models often overlook the importance of generating intermediate conclusions with logical consistency from the given facts, leading to inaccurate conclusions and undermining the overall credibility of entailment trees. To address this limitation, we propose the logical pattern memory pre-trained model (LMPM). LMPM incorporates an external memory structure to learn and store the latent representations of logical patterns, which aids in generating logically consistent conclusions. Furthermore, to mitigate the influence of logically irrelevant domain knowledge in the Wikipedia-based data, we introduce an entity abstraction approach to construct the dataset for pre-training LMPM. The experimental results highlight the effectiveness of our approach in improving the quality of entailment tree generation. By leveraging logical entailment patterns, our model produces more coherent and reasonable conclusions that closely align with the underlying premises. Code and Data are released at https://github.com/YuanLi95/T5-LMPMComment: Accepted By Coling 202

    A network‐based variable selection approach for identification of modules and biomarker genes associated with end‐stage kidney disease

    Full text link
    AimsIntervention for end‐stage kidney disease (ESKD), which is associated with adverse prognoses and major economic burdens, is challenging due to its complex pathogenesis. The study was performed to identify biomarker genes and molecular mechanisms for ESKD by bioinformatics approach.MethodsUsing the Gene Expression Omnibus dataset GSE37171, this study identified pathways and genomic biomarkers associated with ESKD via a multi‐stage knowledge discovery process, including identification of modules of genes by weighted gene co‐expression network analysis, discovery of important involved pathways by Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses, selection of differentially expressed genes by the empirical Bayes method, and screening biomarker genes by the least absolute shrinkage and selection operator (Lasso) logistic regression. The results were validated using GSE70528, an independent testing dataset.ResultsThree clinically important gene modules associated with ESKD, were identified by weighted gene co‐expression network analysis. Within these modules, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses revealed important biological pathways involved in ESKD, including transforming growth factor‐β and Wnt signalling, RNA‐splicing, autophagy and chromatin and histone modification. Furthermore, Lasso logistic regression was conducted to identify five final genes, namely, CNOT8, MST4, PPP2CB, PCSK7 and RBBP4 that are differentially expressed and associated with ESKD. The accuracy of the final model in distinguishing the ESKD cases and controls was 96.8% and 91.7% in the training and validation datasets, respectively.ConclusionNetwork‐based variable selection approaches can identify biological pathways and biomarker genes associated with ESKD. The findings may inform more in‐depth follow‐up research and effective therapy.SUMMARY AT A GLANCEThis gene–gene network analysis to identify genes associated with end‐stage renal disease is an important step, albeit early, towards the discovery of biomarkers using peripheral blood cells. The findings also provide insight on disease pathophysiology at the molecular level, and hence therapeutic targets for future research.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/162799/2/nep13655.pdfhttp://deepblue.lib.umich.edu/bitstream/2027.42/162799/1/nep13655_am.pd

    GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech

    Full text link
    Cross-lingual timbre and style generalizable text-to-speech (TTS) aims to synthesize speech with a specific reference timbre or style that is never trained in the target language. It encounters the following challenges: 1) timbre and pronunciation are correlated since multilingual speech of a specific speaker is usually hard to obtain; 2) style and pronunciation are mixed because the speech style contains language-agnostic and language-specific parts. To address these challenges, we propose GenerTTS, which mainly includes the following works: 1) we elaborately design a HuBERT-based information bottleneck to disentangle timbre and pronunciation/style; 2) we minimize the mutual information between style and language to discard the language-specific information in the style embedding. The experiments indicate that GenerTTS outperforms baseline systems in terms of style similarity and pronunciation accuracy, and enables cross-lingual timbre and style generalization.Comment: Accepted by INTERSPEECH 202

    Through the Lens of Core Competency: Survey on Evaluation of Large Language Models

    Full text link
    From pre-trained language model (PLM) to large language model (LLM), the field of natural language processing (NLP) has witnessed steep performance gains and wide practical uses. The evaluation of a research field guides its direction of improvement. However, LLMs are extremely hard to thoroughly evaluate for two reasons. First of all, traditional NLP tasks become inadequate due to the excellent performance of LLM. Secondly, existing evaluation tasks are difficult to keep up with the wide range of applications in real-world scenarios. To tackle these problems, existing works proposed various benchmarks to better evaluate LLMs. To clarify the numerous evaluation tasks in both academia and industry, we investigate multiple papers concerning LLM evaluations. We summarize 4 core competencies of LLM, including reasoning, knowledge, reliability, and safety. For every competency, we introduce its definition, corresponding benchmarks, and metrics. Under this competency architecture, similar tasks are combined to reflect corresponding ability, while new tasks can also be easily added into the system. Finally, we give our suggestions on the future direction of LLM's evaluation

    Genetic characteristics of common variable immunodeficiency patients with autoimmunity

    Get PDF
    Background: The pathogenesis of common variable immunodeficiency disorder (CVID) is complex, especially when combined with autoimmunity. Genetic factors may be potential explanations for this complex situation, and whole genome sequencing (WGS) provide the basis for this potential.Methods: Genetic information of patients with CVID with autoimmunity, together with their first-degree relatives, was collected through WGS. The association between genetic factors and clinical phenotypes was studied using genetic analysis strategies such as sporadic and pedigree.Results: We collected 42 blood samples for WGS (16 CVID patients and 26 first-degree relatives of healthy controls). Through pedigree, sporadic screening strategies and low-frequency deleterious screening of rare diseases, we obtained 9,148 mutation sites, including 8,171 single-nucleotide variants (SNVs) and 977 Insertion-deletions (InDels). Finally, we obtained a total of 28 candidate genes (32 loci), of which the most common mutant was LRBA. The most common autoimmunity in the 16 patients was systematic lupus erythematosis. Through KEGG pathway enrichment, we identified the top ten signaling pathways, including “primary immunodeficiency”, “JAK-STAT signaling pathway”, and “T-cell receptor signaling pathway”. We used PyMOL to predict and analyse the three-dimensional protein structures of the NFKB1, RAG1, TIRAP, NCF2, and MYB genes. In addition, we constructed a PPI network by combining candidate mutants with genes associated with CVID in the OMIM database via the STRING database.Conclusion: The genetic background of CVID includes not only monogenic origins but also oligogenic effects. Our study showed that immunodeficiency and autoimmunity may overlap in genetic backgrounds.Clinical Trial Registration: identifier ChiCTR210004403

    Aspect-Opinion Sentiment Alignment for Cross-Domain Sentiment Analysis (Student Abstract)

    No full text
    Cross-domain sentiment analysis (SA) has recently attracted significant attention, which can effectively alleviate the problem of lacking large-scale labeled data for deep neural network based methods. However, exiting unsupervised cross-domain SA models ignore the relation between the aspect and opinion, which suffer from the sentiment transfer error problem. To solve this problem, we propose an aspect-opinion sentiment alignment SA model and extensive experiments are conducted to evaluate the effectiveness of our model

    Refined Beam Theory for Geometrically Nonlinear Pre-Twisted Structures

    No full text
    This paper proposes a novel fully nonlinear refined beam element for pre-twisted structures undergoing large deformation and finite untwisting. The present model is constructed in the twisted basis to account for the effects of geometrical nonlinearity and initial twist. Cross-sectional deformation is allowed by introducing Lagrange polynomials in the framework of a Carrera unified formulation. The principle of virtual work is applied to obtain the Green–Lagrange strain tensor and second Piola–Kirchhoff stress tensor. In the nonlinear governing formulation, expressions are given for secant and tangent matrices with linear, nonlinear, and geometrically stiffening contributions. The developed beam model could detect the coupled axial, torsional, and flexure deformations, as well as the local deformations around the point of application of the force. The maximum difference between the present deformation results and those of shell/solid finite element simulations is 6%. Compared to traditional beam theories and finite element models, the proposed method significantly reduces the computational complexity and cost by implementing constant beam elements in the twisted basis

    Refined Beam Theory for Geometrically Nonlinear Pre-Twisted Structures

    No full text
    This paper proposes a novel fully nonlinear refined beam element for pre-twisted structures undergoing large deformation and finite untwisting. The present model is constructed in the twisted basis to account for the effects of geometrical nonlinearity and initial twist. Cross-sectional deformation is allowed by introducing Lagrange polynomials in the framework of a Carrera unified formulation. The principle of virtual work is applied to obtain the Green–Lagrange strain tensor and second Piola–Kirchhoff stress tensor. In the nonlinear governing formulation, expressions are given for secant and tangent matrices with linear, nonlinear, and geometrically stiffening contributions. The developed beam model could detect the coupled axial, torsional, and flexure deformations, as well as the local deformations around the point of application of the force. The maximum difference between the present deformation results and those of shell/solid finite element simulations is 6%. Compared to traditional beam theories and finite element models, the proposed method significantly reduces the computational complexity and cost by implementing constant beam elements in the twisted basis

    Control Strategy of Doubly-Fed Induction Generator under Zero Voltage Fault of Power Grid

    No full text
    For improving the zero-voltage ride through the capability of a doubly fed induction generator in high proportion new energy grid in extreme faults, a coordinated control scheme of hardware and optimal control strategy is proposed. A high-temperature superconductive-fault current limiter suppresses stator fault current, adaptive virtual impedance control and active dynamic reactive power support control act on the back-to-back converter of wind turbines as optimal control strategies. Optimizing the control strategy without changing the controller structure is beneficial to engineering implementation. After mathematical derivation and simulation verification, the coordinated control strategy adopted in this paper can effectively avoid the rotor current and voltage exceeding the limit when the wind turbine is facing extreme faults, actively provide reactive power support for the busbar, realize zero voltage ride through and reduce the risk of high voltage failure at the point of failure. The control effect is obviously better than the traditional virtual impedance control

    Enhance Cross-Domain Aspect-Based Sentiment Analysis by Incorporating Commonsense Relational Structure (Student Abstract)

    No full text
    Aspect Based Sentiment Analysis (ABSA) aims to extract aspect terms and identify the sentiment polarities towards each extracted aspect term. Currently, syntactic information is seen as the bridge for the domain adaptation and achieves remarkable performance. However, the transferable syntactic knowledge is complex and diverse, which causes the transfer error problem in domain adaptation. In our paper, we propose a domain-shared relational structure incorporated cross-domain ABSA model. The experimental results show the effectiveness of our model
    corecore