36 research outputs found
A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation
Generating coherent and credible explanations remains a significant challenge
in the field of AI. In recent years, researchers have delved into the
utilization of entailment trees to depict explanations, which exhibit a
reasoning process of how a hypothesis is deduced from the supporting facts.
However, existing models often overlook the importance of generating
intermediate conclusions with logical consistency from the given facts, leading
to inaccurate conclusions and undermining the overall credibility of entailment
trees. To address this limitation, we propose the logical pattern memory
pre-trained model (LMPM). LMPM incorporates an external memory structure to
learn and store the latent representations of logical patterns, which aids in
generating logically consistent conclusions. Furthermore, to mitigate the
influence of logically irrelevant domain knowledge in the Wikipedia-based data,
we introduce an entity abstraction approach to construct the dataset for
pre-training LMPM. The experimental results highlight the effectiveness of our
approach in improving the quality of entailment tree generation. By leveraging
logical entailment patterns, our model produces more coherent and reasonable
conclusions that closely align with the underlying premises. Code and Data are
released at https://github.com/YuanLi95/T5-LMPMComment: Accepted By Coling 202
A networkâbased variable selection approach for identification of modules and biomarker genes associated with endâstage kidney disease
AimsIntervention for endâstage kidney disease (ESKD), which is associated with adverse prognoses and major economic burdens, is challenging due to its complex pathogenesis. The study was performed to identify biomarker genes and molecular mechanisms for ESKD by bioinformatics approach.MethodsUsing the Gene Expression Omnibus dataset GSE37171, this study identified pathways and genomic biomarkers associated with ESKD via a multiâstage knowledge discovery process, including identification of modules of genes by weighted gene coâexpression network analysis, discovery of important involved pathways by Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses, selection of differentially expressed genes by the empirical Bayes method, and screening biomarker genes by the least absolute shrinkage and selection operator (Lasso) logistic regression. The results were validated using GSE70528, an independent testing dataset.ResultsThree clinically important gene modules associated with ESKD, were identified by weighted gene coâexpression network analysis. Within these modules, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses revealed important biological pathways involved in ESKD, including transforming growth factorâβ and Wnt signalling, RNAâsplicing, autophagy and chromatin and histone modification. Furthermore, Lasso logistic regression was conducted to identify five final genes, namely, CNOT8, MST4, PPP2CB, PCSK7 and RBBP4 that are differentially expressed and associated with ESKD. The accuracy of the final model in distinguishing the ESKD cases and controls was 96.8% and 91.7% in the training and validation datasets, respectively.ConclusionNetworkâbased variable selection approaches can identify biological pathways and biomarker genes associated with ESKD. The findings may inform more inâdepth followâup research and effective therapy.SUMMARY AT A GLANCEThis geneâgene network analysis to identify genes associated with endâstage renal disease is an important step, albeit early, towards the discovery of biomarkers using peripheral blood cells. The findings also provide insight on disease pathophysiology at the molecular level, and hence therapeutic targets for future research.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/162799/2/nep13655.pdfhttp://deepblue.lib.umich.edu/bitstream/2027.42/162799/1/nep13655_am.pd
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech
Cross-lingual timbre and style generalizable text-to-speech (TTS) aims to
synthesize speech with a specific reference timbre or style that is never
trained in the target language. It encounters the following challenges: 1)
timbre and pronunciation are correlated since multilingual speech of a specific
speaker is usually hard to obtain; 2) style and pronunciation are mixed because
the speech style contains language-agnostic and language-specific parts. To
address these challenges, we propose GenerTTS, which mainly includes the
following works: 1) we elaborately design a HuBERT-based information bottleneck
to disentangle timbre and pronunciation/style; 2) we minimize the mutual
information between style and language to discard the language-specific
information in the style embedding. The experiments indicate that GenerTTS
outperforms baseline systems in terms of style similarity and pronunciation
accuracy, and enables cross-lingual timbre and style generalization.Comment: Accepted by INTERSPEECH 202
Through the Lens of Core Competency: Survey on Evaluation of Large Language Models
From pre-trained language model (PLM) to large language model (LLM), the
field of natural language processing (NLP) has witnessed steep performance
gains and wide practical uses. The evaluation of a research field guides its
direction of improvement. However, LLMs are extremely hard to thoroughly
evaluate for two reasons. First of all, traditional NLP tasks become inadequate
due to the excellent performance of LLM. Secondly, existing evaluation tasks
are difficult to keep up with the wide range of applications in real-world
scenarios. To tackle these problems, existing works proposed various benchmarks
to better evaluate LLMs. To clarify the numerous evaluation tasks in both
academia and industry, we investigate multiple papers concerning LLM
evaluations. We summarize 4 core competencies of LLM, including reasoning,
knowledge, reliability, and safety. For every competency, we introduce its
definition, corresponding benchmarks, and metrics. Under this competency
architecture, similar tasks are combined to reflect corresponding ability,
while new tasks can also be easily added into the system. Finally, we give our
suggestions on the future direction of LLM's evaluation
Genetic characteristics of common variable immunodeficiency patients with autoimmunity
Background: The pathogenesis of common variable immunodeficiency disorder (CVID) is complex, especially when combined with autoimmunity. Genetic factors may be potential explanations for this complex situation, and whole genome sequencing (WGS) provide the basis for this potential.Methods: Genetic information of patients with CVID with autoimmunity, together with their first-degree relatives, was collected through WGS. The association between genetic factors and clinical phenotypes was studied using genetic analysis strategies such as sporadic and pedigree.Results: We collected 42 blood samples for WGS (16 CVID patients and 26 first-degree relatives of healthy controls). Through pedigree, sporadic screening strategies and low-frequency deleterious screening of rare diseases, we obtained 9,148 mutation sites, including 8,171 single-nucleotide variants (SNVs) and 977 Insertion-deletions (InDels). Finally, we obtained a total of 28 candidate genes (32 loci), of which the most common mutant was LRBA. The most common autoimmunity in the 16 patients was systematic lupus erythematosis. Through KEGG pathway enrichment, we identified the top ten signaling pathways, including âprimary immunodeficiencyâ, âJAK-STAT signaling pathwayâ, and âT-cell receptor signaling pathwayâ. We used PyMOL to predict and analyse the three-dimensional protein structures of the NFKB1, RAG1, TIRAP, NCF2, and MYB genes. In addition, we constructed a PPI network by combining candidate mutants with genes associated with CVID in the OMIM database via the STRING database.Conclusion: The genetic background of CVID includes not only monogenic origins but also oligogenic effects. Our study showed that immunodeficiency and autoimmunity may overlap in genetic backgrounds.Clinical Trial Registration: identifier ChiCTR210004403
Aspect-Opinion Sentiment Alignment for Cross-Domain Sentiment Analysis (Student Abstract)
Cross-domain sentiment analysis (SA) has recently attracted significant attention, which can effectively alleviate the problem of lacking large-scale labeled data for deep neural network based methods. However, exiting unsupervised cross-domain SA models ignore the relation between the aspect and opinion, which suffer from the sentiment transfer error problem. To solve this problem, we propose an aspect-opinion sentiment alignment SA model and extensive experiments are conducted to evaluate the effectiveness of our model
Refined Beam Theory for Geometrically Nonlinear Pre-Twisted Structures
This paper proposes a novel fully nonlinear refined beam element for pre-twisted structures undergoing large deformation and finite untwisting. The present model is constructed in the twisted basis to account for the effects of geometrical nonlinearity and initial twist. Cross-sectional deformation is allowed by introducing Lagrange polynomials in the framework of a Carrera unified formulation. The principle of virtual work is applied to obtain the GreenâLagrange strain tensor and second PiolaâKirchhoff stress tensor. In the nonlinear governing formulation, expressions are given for secant and tangent matrices with linear, nonlinear, and geometrically stiffening contributions. The developed beam model could detect the coupled axial, torsional, and flexure deformations, as well as the local deformations around the point of application of the force. The maximum difference between the present deformation results and those of shell/solid finite element simulations is 6%. Compared to traditional beam theories and finite element models, the proposed method significantly reduces the computational complexity and cost by implementing constant beam elements in the twisted basis
Refined Beam Theory for Geometrically Nonlinear Pre-Twisted Structures
This paper proposes a novel fully nonlinear refined beam element for pre-twisted structures undergoing large deformation and finite untwisting. The present model is constructed in the twisted basis to account for the effects of geometrical nonlinearity and initial twist. Cross-sectional deformation is allowed by introducing Lagrange polynomials in the framework of a Carrera unified formulation. The principle of virtual work is applied to obtain the Green–Lagrange strain tensor and second Piola–Kirchhoff stress tensor. In the nonlinear governing formulation, expressions are given for secant and tangent matrices with linear, nonlinear, and geometrically stiffening contributions. The developed beam model could detect the coupled axial, torsional, and flexure deformations, as well as the local deformations around the point of application of the force. The maximum difference between the present deformation results and those of shell/solid finite element simulations is 6%. Compared to traditional beam theories and finite element models, the proposed method significantly reduces the computational complexity and cost by implementing constant beam elements in the twisted basis
Control Strategy of Doubly-Fed Induction Generator under Zero Voltage Fault of Power Grid
For improving the zero-voltage ride through the capability of a doubly fed induction generator in high proportion new energy grid in extreme faults, a coordinated control scheme of hardware and optimal control strategy is proposed. A high-temperature superconductive-fault current limiter suppresses stator fault current, adaptive virtual impedance control and active dynamic reactive power support control act on the back-to-back converter of wind turbines as optimal control strategies. Optimizing the control strategy without changing the controller structure is beneficial to engineering implementation. After mathematical derivation and simulation verification, the coordinated control strategy adopted in this paper can effectively avoid the rotor current and voltage exceeding the limit when the wind turbine is facing extreme faults, actively provide reactive power support for the busbar, realize zero voltage ride through and reduce the risk of high voltage failure at the point of failure. The control effect is obviously better than the traditional virtual impedance control
Enhance Cross-Domain Aspect-Based Sentiment Analysis by Incorporating Commonsense Relational Structure (Student Abstract)
Aspect Based Sentiment Analysis (ABSA) aims to extract aspect terms and identify the sentiment polarities towards each extracted aspect term. Currently, syntactic information is seen as the bridge for the domain adaptation and achieves remarkable performance. However, the transferable syntactic knowledge is complex and diverse, which causes the transfer error problem in domain adaptation. In our paper, we propose a domain-shared relational structure incorporated cross-domain ABSA model. The experimental results show the effectiveness of our model