Search CORE

36 research outputs found

A Logical Pattern Memory Pre-trained Model for Entailment Tree Generation

Author: Cai Yi
Ren Haopeng
Wang Jiexin
Yuan Li
Publication venue
Publication date: 10/03/2024
Field of study

Generating coherent and credible explanations remains a significant challenge in the field of AI. In recent years, researchers have delved into the utilization of entailment trees to depict explanations, which exhibit a reasoning process of how a hypothesis is deduced from the supporting facts. However, existing models often overlook the importance of generating intermediate conclusions with logical consistency from the given facts, leading to inaccurate conclusions and undermining the overall credibility of entailment trees. To address this limitation, we propose the logical pattern memory pre-trained model (LMPM). LMPM incorporates an external memory structure to learn and store the latent representations of logical patterns, which aids in generating logically consistent conclusions. Furthermore, to mitigate the influence of logically irrelevant domain knowledge in the Wikipedia-based data, we introduce an entity abstraction approach to construct the dataset for pre-training LMPM. The experimental results highlight the effectiveness of our approach in improving the quality of entailment tree generation. By leveraging logical entailment patterns, our model produces more coherent and reasonable conclusions that closely align with the underlying premises. Code and Data are released at https://github.com/YuanLi95/T5-LMPMComment: Accepted By Coling 202

arXiv.org e-Print Archive

A network‐based variable selection approach for identification of modules and biomarker genes associated with end‐stage kidney disease

Author: Fu Ping
Hong Hyokyoung G.
Li Chunyang
Li Yi
Yu Haopeng
Zeng Xiaoxi
Zhang Wei
Publication venue: 'Wiley'
Publication date: 01/10/2020
Field of study

AimsIntervention for end‐stage kidney disease (ESKD), which is associated with adverse prognoses and major economic burdens, is challenging due to its complex pathogenesis. The study was performed to identify biomarker genes and molecular mechanisms for ESKD by bioinformatics approach.MethodsUsing the Gene Expression Omnibus dataset GSE37171, this study identified pathways and genomic biomarkers associated with ESKD via a multi‐stage knowledge discovery process, including identification of modules of genes by weighted gene co‐expression network analysis, discovery of important involved pathways by Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses, selection of differentially expressed genes by the empirical Bayes method, and screening biomarker genes by the least absolute shrinkage and selection operator (Lasso) logistic regression. The results were validated using GSE70528, an independent testing dataset.ResultsThree clinically important gene modules associated with ESKD, were identified by weighted gene co‐expression network analysis. Within these modules, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses revealed important biological pathways involved in ESKD, including transforming growth factor‐β and Wnt signalling, RNA‐splicing, autophagy and chromatin and histone modification. Furthermore, Lasso logistic regression was conducted to identify five final genes, namely, CNOT8, MST4, PPP2CB, PCSK7 and RBBP4 that are differentially expressed and associated with ESKD. The accuracy of the final model in distinguishing the ESKD cases and controls was 96.8% and 91.7% in the training and validation datasets, respectively.ConclusionNetwork‐based variable selection approaches can identify biological pathways and biomarker genes associated with ESKD. The findings may inform more in‐depth follow‐up research and effective therapy.SUMMARY AT A GLANCEThis gene–gene network analysis to identify genes associated with end‐stage renal disease is an important step, albeit early, towards the discovery of biomarkers using peripheral blood cells. The findings also provide insight on disease pathophysiology at the molecular level, and hence therapeutic targets for future research.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/162799/2/nep13655.pdfhttp://deepblue.lib.umich.edu/bitstream/2027.42/162799/1/nep13655_am.pd

Crossref

Deep Blue Documents

GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech

Author: Cong Yahuan
Lin Haopeng
Liu Shichao
Ma Zejun
Ren Yi
Wang Chunfeng
Yin Xiang
Zhang Haoyu
Publication venue
Publication date: 27/06/2023
Field of study

Cross-lingual timbre and style generalizable text-to-speech (TTS) aims to synthesize speech with a specific reference timbre or style that is never trained in the target language. It encounters the following challenges: 1) timbre and pronunciation are correlated since multilingual speech of a specific speaker is usually hard to obtain; 2) style and pronunciation are mixed because the speech style contains language-agnostic and language-specific parts. To address these challenges, we propose GenerTTS, which mainly includes the following works: 1) we elaborately design a HuBERT-based information bottleneck to disentangle timbre and pronunciation/style; 2) we minimize the mutual information between style and language to discard the language-specific information in the style embedding. The experiments indicate that GenerTTS outperforms baseline systems in terms of style similarity and pronunciation accuracy, and enables cross-lingual timbre and style generalization.Comment: Accepted by INTERSPEECH 202

arXiv.org e-Print Archive

Through the Lens of Core Competency: Survey on Evaluation of Large Language Models

Author: Bai Haopeng
Chen Qiguang
Feng Zixian
Han Yi
Li Mingda
Liu Ting
Ma Longxuan
Qian Yushan
Zhang Weinan
Zhuang Ziyu
Publication venue
Publication date: 15/08/2023
Field of study

From pre-trained language model (PLM) to large language model (LLM), the field of natural language processing (NLP) has witnessed steep performance gains and wide practical uses. The evaluation of a research field guides its direction of improvement. However, LLMs are extremely hard to thoroughly evaluate for two reasons. First of all, traditional NLP tasks become inadequate due to the excellent performance of LLM. Secondly, existing evaluation tasks are difficult to keep up with the wide range of applications in real-world scenarios. To tackle these problems, existing works proposed various benchmarks to better evaluate LLMs. To clarify the numerous evaluation tasks in both academia and industry, we investigate multiple papers concerning LLM evaluations. We summarize 4 core competencies of LLM, including reasoning, knowledge, reliability, and safety. For every competency, we introduce its definition, corresponding benchmarks, and metrics. Under this competency architecture, similar tasks are combined to reflect corresponding ability, while new tasks can also be easily added into the system. Finally, we give our suggestions on the future direction of LLM's evaluation

arXiv.org e-Print Archive

Genetic characteristics of common variable immunodeficiency patients with autoimmunity

Author: Chenyang Lu
Haopeng Yu
Haopeng Yu
Pingying Qing
Ruijuan Cheng
Xue Guo
Ye Chen
Yi Liu
Yujie Li
Zhihui Liu
Zhiye Ying
Publication venue: Frontiers Media S.A.
Publication date: 01/11/2023
Field of study

Background: The pathogenesis of common variable immunodeficiency disorder (CVID) is complex, especially when combined with autoimmunity. Genetic factors may be potential explanations for this complex situation, and whole genome sequencing (WGS) provide the basis for this potential.Methods: Genetic information of patients with CVID with autoimmunity, together with their first-degree relatives, was collected through WGS. The association between genetic factors and clinical phenotypes was studied using genetic analysis strategies such as sporadic and pedigree.Results: We collected 42 blood samples for WGS (16 CVID patients and 26 first-degree relatives of healthy controls). Through pedigree, sporadic screening strategies and low-frequency deleterious screening of rare diseases, we obtained 9,148 mutation sites, including 8,171 single-nucleotide variants (SNVs) and 977 Insertion-deletions (InDels). Finally, we obtained a total of 28 candidate genes (32 loci), of which the most common mutant was LRBA. The most common autoimmunity in the 16 patients was systematic lupus erythematosis. Through KEGG pathway enrichment, we identified the top ten signaling pathways, including “primary immunodeficiency”, “JAK-STAT signaling pathway”, and “T-cell receptor signaling pathway”. We used PyMOL to predict and analyse the three-dimensional protein structures of the NFKB1, RAG1, TIRAP, NCF2, and MYB genes. In addition, we constructed a PPI network by combining candidate mutants with genes associated with CVID in the OMIM database via the STRING database.Conclusion: The genetic background of CVID includes not only monogenic origins but also oligogenic effects. Our study showed that immunodeficiency and autoimmunity may overlap in genetic backgrounds.Clinical Trial Registration: identifier ChiCTR210004403

Directory of Open Access Journals

Aspect-Opinion Sentiment Alignment for Cross-Domain Sentiment Analysis (Student Abstract)

Author: Cai Yi
Ren Haopeng
Zeng Yushi
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 28/06/2022
Field of study

Cross-domain sentiment analysis (SA) has recently attracted significant attention, which can effectively alleviate the problem of lacking large-scale labeled data for deep neural network based methods. However, exiting unsupervised cross-domain SA models ignore the relation between the aspect and opinion, which suffer from the sentiment transfer error problem. To solve this problem, we propose an aspect-opinion sentiment alignment SA model and extensive experiments are conducted to evaluate the effectiveness of our model

Association for the Advancement of Artificial Intelligence: AAAI Publications

Refined Beam Theory for Geometrically Nonlinear Pre-Twisted Structures

Author: Haopeng Liang
Yi Hu
Yong Zhao
Publication venue: MDPI AG
Publication date: 01/07/2022
Field of study

This paper proposes a novel fully nonlinear refined beam element for pre-twisted structures undergoing large deformation and finite untwisting. The present model is constructed in the twisted basis to account for the effects of geometrical nonlinearity and initial twist. Cross-sectional deformation is allowed by introducing Lagrange polynomials in the framework of a Carrera unified formulation. The principle of virtual work is applied to obtain the Green–Lagrange strain tensor and second Piola–Kirchhoff stress tensor. In the nonlinear governing formulation, expressions are given for secant and tangent matrices with linear, nonlinear, and geometrically stiffening contributions. The developed beam model could detect the coupled axial, torsional, and flexure deformations, as well as the local deformations around the point of application of the force. The maximum difference between the present deformation results and those of shell/solid finite element simulations is 6%. Compared to traditional beam theories and finite element models, the proposed method significantly reduces the computational complexity and cost by implementing constant beam elements in the twisted basis

Directory of Open Access Journals

Refined Beam Theory for Geometrically Nonlinear Pre-Twisted Structures

Author: Haopeng Liang
Yi Hu
Yong Zhao
Publication venue: 'MDPI AG'
Publication date: 06/07/2022
Field of study

Multidisciplinary Digital Publishing Institute

Control Strategy of Doubly-Fed Induction Generator under Zero Voltage Fault of Power Grid

Author: Haopeng Yi
Jiawei Hu
Lu Zhang
Yi Ye
Zelin Yang
Publication venue: MDPI AG
Publication date: 01/11/2022
Field of study

For improving the zero-voltage ride through the capability of a doubly fed induction generator in high proportion new energy grid in extreme faults, a coordinated control scheme of hardware and optimal control strategy is proposed. A high-temperature superconductive-fault current limiter suppresses stator fault current, adaptive virtual impedance control and active dynamic reactive power support control act on the back-to-back converter of wind turbines as optimal control strategies. Optimizing the control strategy without changing the controller structure is beneficial to engineering implementation. After mathematical derivation and simulation verification, the coordinated control strategy adopted in this paper can effectively avoid the rotor current and voltage exceeding the limit when the wind turbine is facing extreme faults, actively provide reactive power support for the busbar, realize zero voltage ride through and reduce the risk of high voltage failure at the point of failure. The control effect is obviously better than the traditional virtual impedance control

Directory of Open Access Journals

Enhance Cross-Domain Aspect-Based Sentiment Analysis by Incorporating Commonsense Relational Structure (Student Abstract)

Author: Cai Yi
Ren Haopeng
Wang Guohua
Zeng Yushi
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 28/06/2022
Field of study

Aspect Based Sentiment Analysis (ABSA) aims to extract aspect terms and identify the sentiment polarities towards each extracted aspect term. Currently, syntactic information is seen as the bridge for the domain adaptation and achieves remarkable performance. However, the transferable syntactic knowledge is complex and diverse, which causes the transfer error problem in domain adaptation. In our paper, we propose a domain-shared relational structure incorporated cross-domain ABSA model. The experimental results show the effectiveness of our model

Association for the Advancement of Artificial Intelligence: AAAI Publications