Search CORE

369 research outputs found

AIS: A nonlinear activation function for industrial safety engineering

Author: Gao Dong
Wang Zhenhua
Zhang Beike
Publication venue
Publication date: 27/11/2021
Field of study

In the task of Chinese named entity recognition based on deep learning, activation function plays an irreplaceable role, it introduces nonlinear characteristics into neural network, so that the fitted model can be applied to various tasks. However, the information density of industrial safety analysis text is relatively high, and the correlation and similarity between the information are large, which is easy to cause the problem of high deviation and high standard deviation of the model, no specific activation function has been designed in previous studies, and the traditional activation function has the problems of gradient vanishing and negative region, which also lead to the recognition accuracy of the model can not be further improved. To solve these problems, a novel activation function AIS is proposed in this paper. AIS is an activation function applied in industrial safety engineering, which is composed of two piecewise nonlinear functions. In the positive region, the structure combining exponential function and quadratic function is used to alleviate the problem of deviation and standard deviation, and the linear function is added to modify it, which makes the whole activation function smoother and overcomes the problem of gradient vanishing. In the negative region, the cubic function structure is used to solve the negative region problem and accelerate the convergence of the model. Based on the deep learning model of BERT-BiLSTM-CRF, the performance of AIS is evaluated. The results show that, compared with other activation functions, AIS overcomes the problems of gradient vanishing and negative region, reduces the deviation of the model, speeds up the model fitting, and improves the extraction ability of the model for industrial entities

arXiv.org e-Print Archive

Molecular Characterization of Isoniazid and Rifampin Resistance of Mycobacterium tuberculosis Clinical Isolates from Malatya, Turkey

Author: Aktas Elif
Durmaz Riza
Yang Dong
Yang Zhenhua
Publication venue: 'Mary Ann Liebert Inc'
Publication date: 01/06/2005
Field of study

Molecular characterization of drug resistance of Mycobacterium tuberculosis strains of different origins can generate information useful for developing molecular methods that are widely applicable for rapid drug resistance detection. Using DNA sequencing and allele-specific polymerase chain reaction (AS-PCR), we investigated genetic mutations associated with isoniazid (INH) and rifampin (RIF) resistance among 29 drug-resistant clinical isolates of M. tuberculosis collected from Malatya, Turkey, including 19 multi-drug-resistant (MDR) isolates. Point mutations were detected at codons 531, 516, 526, and 513 of the RNA polymerase β- subunit gene (rpoB) in 10 (47.6%), five (23.8%), three (14.3%), and three (14.3%) of the 21 RIF-resistant isolates, respectively. Of the five isolates having mutations in codon 516, three also had mutations at codon 527; one had a concurrent mutation at codon 572. Mutations at codon 315 of the catalase-peroxidase-encoding gene (katG) were found in 17 (63.0%) of the 27 INH-resistant isolates. Interestingly, the katG codon 315 mutation was observed at a much higher frequency in MDR isolates than in INH-mono-resistant isolates (∼79% vs. 25%). This study provided the first molecular characterization of INH and RIF resistance of M. tuberculosis clinical isolates from Eastern Turkey, and extended our knowledge of molecular basis of M. tuberculosis drug resistance.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/63182/1/mdr.2005.11.94.pd

Deep Blue Documents

Recommendation with User Active Disclosing Willingness

Author: Chen Xu
Dai Quanyu
Dong Zhenhua
Wang Lei
Publication venue
Publication date: 25/10/2022
Field of study

Recommender system has been deployed in a large amount of real-world applications, profoundly influencing people's daily life and production.Traditional recommender models mostly collect as comprehensive as possible user behaviors for accurate preference estimation. However, considering the privacy, preference shaping and other issues, the users may not want to disclose all their behaviors for training the model. In this paper, we study a novel recommendation paradigm, where the users are allowed to indicate their "willingness" on disclosing different behaviors, and the models are optimized by trading-off the recommendation quality as well as the violation of the user "willingness". More specifically, we formulate the recommendation problem as a multiplayer game, where the action is a selection vector representing whether the items are involved into the model training. For efficiently solving this game, we design a tailored algorithm based on influence function to lower the time cost for recommendation quality exploration, and also extend it with multiple anchor selection vectors.We conduct extensive experiments to demonstrate the effectiveness of our model on balancing the recommendation quality and user disclosing willingness

arXiv.org e-Print Archive

Why KDAC? A general activation function for knowledge discovery

Author: Gao Dong
Liu Fanglin
Liu Haozhe
Wang Zhenhua
Publication venue
Publication date: 23/05/2022
Field of study

Deep learning oriented named entity recognition (DNER) has gradually become the paradigm of knowledge discovery, which greatly promotes domain intelligence. However, the current activation function of DNER fails to treat gradient vanishing, no negative output or non-differentiable existence, which may impede knowledge exploration caused by the omission and incomplete representation of latent semantics. To break through the dilemma, we present a novel activation function termed KDAC. Detailly, KDAC is an aggregation function with multiple conversion modes. The backbone of the activation region is the interaction between exponent and linearity, and the both ends extend through adaptive linear divergence, which surmounts the obstacle of gradient vanishing and no negative output. Crucially, the non-differentiable points are alerted and eliminated by an approximate smoothing algorithm. KDAC has a series of brilliant properties, including nonlinear, stable near-linear transformation and derivative, as well as dynamic style, etc. We perform experiments based on BERT-BiLSTM-CNN-CRF model on six benchmark datasets containing different domain knowledge, such as Weibo, Clinical, E-commerce, Resume, HAZOP and People's daily. The evaluation results show that KDAC is advanced and effective, and can provide more generalized activation to stimulate the performance of DNER. We hope that KDAC can be exploited as a promising activation function to devote itself to the construction of knowledge.Comment: Accepted by Neurocomputin

arXiv.org e-Print Archive

Preferential gene expression in the limbus of the vervet monkey

Author: Deng Sophie X.
Ding Zhenhua
Dong Jun
Liu Jason
Publication venue: Molecular Vision
Publication date: 01/01/2008
Field of study

PurposeTo elucidate the unique molecular factors and biological processes that are differentially expressed in the limbal stem cell microenvironment by comparing directly to that of its immediate adjacent structures, the cornea and conjunctiva.MethodsTotal RNA was isolated and amplified from the limbus, cornea, and conjunctiva. A gene expression profile of each tissue type was obtained by using microarray technique. The transcripts in which the expression level was at least twofold higher than that in the other two tissue types were identified. The expression levels of selected genes were confirmed by quantitative reverse transcription polymerase chain reaction (QRT-PCR). Protein expression of selected genes were confirmed by an immunohistochemistry study in normal human ocular tissue.ResultsThere were 186 preferentially overexpressed transcripts in the limbus in direct comparison to that in the cornea and conjunctiva. Many signature genes in the cornea and conjunctiva were among the preferentially expressed transcripts obtained by the microarray data. In addition, a significant number of new genes were identified, and the expression level of all nine selected genes was verified by QRT-PCR. Protein expression levels of keratin 13, tenascin c, homeodomain only protein (HOP), and TP53 apoptosis effector (PERP) were confirmed in human ocular tissues. Functional analysis of the preferentially expressed genes in the limbus reviewed that melanin metabolism and cell-cell adhesion were among the noticeable biological processes. Chromosomal distribution of the overexpressed genes in the limbus was disproportional to that of all known human genes.ConclusionsThese findings may shed light on the unique molecular components and regulation of limbal stem cells and their niche

PubMed Central

eScholarship - University of California

Law Article-Enhanced Legal Case Matching: a Causal Learning Approach

Author: Dong Zhenhua
Sun Zhongxiang
Wen Ji-Rong
Xu Jun
Zhang Xiao
Publication venue
Publication date: 26/04/2023
Field of study

Legal case matching, which automatically constructs a model to estimate the similarities between the source and target cases, has played an essential role in intelligent legal systems. Semantic text matching models have been applied to the task where the source and target legal cases are considered as long-form text documents. These general-purpose matching models make the predictions solely based on the texts in the legal cases, overlooking the essential role of the law articles in legal case matching. In the real world, the matching results (e.g., relevance labels) are dramatically affected by the law articles because the contents and the judgments of a legal case are radically formed on the basis of law. From the causal sense, a matching decision is affected by the mediation effect from the cited law articles by the legal cases, and the direct effect of the key circumstances (e.g., detailed fact descriptions) in the legal cases. In light of the observation, this paper proposes a model-agnostic causal learning framework called Law-Match, under which the legal case matching models are learned by respecting the corresponding law articles. Given a pair of legal cases and the related law articles, Law-Match considers the embeddings of the law articles as instrumental variables (IVs), and the embeddings of legal cases as treatments. Using IV regression, the treatments can be decomposed into law-related and law-unrelated parts, respectively reflecting the mediation and direct effects. These two parts are then combined with different weights to collectively support the final matching prediction. We show that the framework is model-agnostic, and a number of legal case matching models can be applied as the underlying models. Comprehensive experiments show that Law-Match can outperform state-of-the-art baselines on three public datasets.Comment: 10 pages accepted by SIGIR202

arXiv.org e-Print Archive

Less Is Better: Unweighted Data Subsampling via Influence Function

Author: Dong Zhenhua
He Xiuqiang
Huang Shao-Lun
Wang Zifeng
Zhu Hong
Publication venue
Publication date: 03/02/2020
Field of study

In the time of Big Data, training complex models on large-scale data sets is challenging, making it appealing to reduce data volume for saving computation resources by subsampling. Most previous works in subsampling are weighted methods designed to help the performance of subset-model approach the full-set-model, hence the weighted methods have no chance to acquire a subset-model that is better than the full-set-model. However, we question that how can we achieve better model with less data? In this work, we propose a novel Unweighted Influence Data Subsampling (UIDS) method, and prove that the subset-model acquired through our method can outperform the full-set-model. Besides, we show that overly confident on a given test set for sampling is common in Influence-based subsampling methods, which can eventually cause our subset-model's failure in out-of-sample test. To mitigate it, we develop a probabilistic sampling scheme to control the worst-case risk over all distributions close to the empirical distribution. The experiment results demonstrate our methods superiority over existed subsampling methods in diverse tasks, such as text classification, image classification, click-through prediction, etc.Comment: AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction

Author: Cai Guohao
Dong Zhenhua
Li Yuru
Mao Kelong
Su Liangcai
Zhu Jieming
Publication venue
Publication date: 29/05/2023
Field of study

Click-through rate (CTR) prediction is one of the fundamental tasks for online advertising and recommendation. While multi-layer perceptron (MLP) serves as a core component in many deep CTR prediction models, it has been widely recognized that applying a vanilla MLP network alone is inefficient in learning multiplicative feature interactions. As such, many two-stream interaction models (e.g., DeepFM and DCN) have been proposed by integrating an MLP network with another dedicated network for enhanced CTR prediction. As the MLP stream learns feature interactions implicitly, existing research focuses mainly on enhancing explicit feature interactions in the complementary stream. In contrast, our empirical study shows that a well-tuned two-stream MLP model that simply combines two MLPs can even achieve surprisingly good performance, which has never been reported before by existing work. Based on this observation, we further propose feature gating and interaction aggregation layers that can be easily plugged to make an enhanced two-stream MLP model, FinalMLP. In this way, it not only enables differentiated feature inputs but also effectively fuses stream-level interactions across two streams. Our evaluation results on four open benchmark datasets as well as an online A/B test in our industrial system show that FinalMLP achieves better performance than many sophisticated two-stream CTR models. Our source code will be available at MindSpore/models.Comment: Accepted by AAAI 2023. Code available at https://xpai.github.io/FinalML

arXiv.org e-Print Archive

Ranolazine recruits muscle microvasculature and enhances insulin action in rats: Ranolazine, microvasculature and insulin action

Author: Cao Wenhong
Chai Weidong
Dong Zhenhua
Fu Zhuo
Liu Zhenqi
Zhao Lina
Publication venue
Publication date: 01/01/2013
Field of study

Ranolazine, an anti-anginal compound, has been shown to significantly improve glycaemic control in large-scale clinical trials, and short-term ranolazine treatment is associated with an improvement in myocardial blood flow. As microvascular perfusion plays critical roles in insulin delivery and action, we aimed to determine if ranolazine could improve muscle microvascular blood flow, thereby increasing muscle insulin delivery and glucose use. Overnight-fasted, anaesthetized Sprague-Dawley rats were used to determine the effects of ranolazine on microvascular recruitment using contrast-enhanced ultrasound, insulin action with euglycaemic hyperinsulinaemic clamp, and muscle insulin uptake using 125I-insulin. Ranolazine's effects on endothelial nitric oxide synthase (eNOS) phosphorylation, cAMP generation and endothelial insulin uptake were determined in cultured endothelial cells. Ranolazine-induced myographical changes in tension were determined in isolated distal saphenous artery. Ranolazine at therapeutically effective dose significantly recruited muscle microvasculature by increasing muscle microvascular blood volume (∼2-fold, P < 0.05) and increased insulin-mediated whole body glucose disposal (∼30%, P= 0.02). These were associated with an increased insulin delivery into the muscle (P < 0.04). In cultured endothelial cells, ranolazine increased eNOS phosphorylation and cAMP production without affecting endothelial insulin uptake. In ex vivo studies, ranolazine exerted a potent vasodilatatory effect on phenylephrine pre-constricted arterial rings, which was partially abolished by endothelium denudement. In conclusion, ranolazine treatment vasodilatates pre-capillary arterioles and increases microvascular perfusion, which are partially mediated by endothelium, leading to expanded microvascular endothelial surface area available for nutrient and hormone exchanges and resulting in increased muscle delivery and action of insulin. Whether these actions contribute to improved glycaemic control in patients with insulin resistance warrants further investigation

Carolina Digital Repository