Search CORE

139 research outputs found

A Comparative Study on Regularization Strategies for Embedding-based Neural Networks

Author: Chen Yunchuan
Jin Zhi
Li Ge
Lu Yangyang
Mou Lili
Peng Hao
Publication venue
Publication date: 01/01/2015
Field of study

This paper aims to compare different regularization strategies to address a common phenomenon, severe overfitting, in embedding-based neural networks for NLP. We chose two widely studied neural models and tasks as our testbed. We tried several frequently applied or newly proposed regularization strategies, including penalizing weights (embeddings excluded), penalizing embeddings, re-embedding words, and dropout. We also emphasized on incremental hyperparameter tuning, and combining different regularizations. The results provide a picture on tuning hyperparameters for neural NLP models.Comment: EMNLP '1

arXiv.org e-Print Archive

Crossref

OPTIMIZATION UNDER STOCHASTIC ENVIRONMENT

Author: Li Yunchuan
Publication venue
Publication date: 01/01/2020
Field of study

Stochastic optimization (SO) is extensively studied in various fields, such as control engineering, operations research, and computer science. It has found wide applications ranging from path planning (civil engineering) and tool-life testing (industrial engineering) to Go-playing artificial intelligence (computer science). However, SO is usually a hard problem primarily because of the added complexity from random variables. The objective of this research is to investigate three types of SO problems: single-stage SO, multi-stage SO and fast real-time parameter estimation under stochastic environment.\par We first study the single-stage optimization problem. We propose Direct Gradient Augmented Response Surface Methodology (DiGARSM), a new sequential first-order method for optimizing a stochastic function. In this approach, gradients of the objective function with respect to the desired parameters are utilized in addition to response measurements. We intend to establish convergence of the proposed method, as well as traditional approaches which do not use gradients. We expect an improvement in convergence speed with the added derivative information. \par Second, we analyze a tree search problem with an underlying Markov decision process. Unlike traditional tree search algorithms where the goal is to maximize the cumulative reward in the learning process, the proposed method aims at identifying the best action at the root that achieves the highest reward. A new tree algorithm based on ranking and selection is proposed. The selection policy at each node aims at maximizing the probability of correctly selecting the best action. \par The third topic is motivated by problems arising in neuroscience, specifically, a Maximum Likelihood (ML) parameter estimation of linear models with noise-corrupted observations. We developed an optimization algorithm designed for non-convex, linear state-space model parameter estimation. The ML estimation is carried out by the Expectation-Maximization algorithm, which iteratively updates parameter estimates based on the previous estimates. Since the likelihood surface is in general non-convex, a model-based global optimization method called Model Reference Adaptive Search (MRAS) is applied

Digital Repository at the University of Maryland

Classifying Relations via Long Short Term Memory Networks along Shortest Dependency Path

Author: Chen Yunchuan
Jin Zhi
Li Ge
Mou Lili
Peng Hao
Yan Xu
Publication venue
Publication date: 01/01/2015
Field of study

Relation classification is an important research arena in the field of natural language processing (NLP). In this paper, we present SDP-LSTM, a novel neural network to classify the relation of two entities in a sentence. Our neural architecture leverages the shortest dependency path (SDP) between two entities; multichannel recurrent neural networks, with long short term memory (LSTM) units, pick up heterogeneous information along the SDP. Our proposed model has several distinct features: (1) The shortest dependency paths retain most relevant information (to relation classification), while eliminating irrelevant words in the sentence. (2) The multichannel LSTM networks allow effective information integration from heterogeneous sources over the dependency paths. (3) A customized dropout strategy regularizes the neural network to alleviate overfitting. We test our model on the SemEval 2010 relation classification task, and achieve an

F_1

-score of 83.7\%, higher than competing methods in the literature.Comment: EMNLP '1

arXiv.org e-Print Archive

CiteSeerX

Crossref

Determinants of price fluctuations in the electricity market: a study with PCA and NARDL models

Author: Joseph D. Cursio
Kun Li
Yunchuan Sun
Zizheng Zhu
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2019
Field of study

In the modern electricity markets, negative prices and spike prices coexist as a pair of opposite economic phenomena. This study investigates how these extreme prices play as the determinants to drive price fluctuations in the electricity market. We construct a two-stage analysis including a principal component analysis (PCA) and a nonlinear autoregressive distributed lags model (NARDL). We apply this analytical method to the wholesale Pennsylvania, New Jersey and Maryland (PJM) electricity market. We find that according to PCA, in the individual transmission lines, spike prices are determinants with largest explanatory power to the variation of prices, while according to NARDL, from the standpoint of the overall market, negative prices have a larger potential effect on both the real-time market and the forward market. These results are valuable and contributive to managers and operators in the electricity markets for policy decision making

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Combination model of heterogeneous data for security measurement

Author: Dong Liju
Dong Xiuze
Guo Yunchuan
Khan Arshad
Li Fenghua
Publication venue: 'Sociological Research Online'
Publication date: 01/01/2019
Field of study

Measuring security is a core step for guaranteeing security of network and information systems. Due to massiveness and heterogeneity of measurement data, it is difficult toclassify and combinethem on demand. In thispaper, consideringimplication relationship of metrics, we propose a combination model and combination policy for security measurement. Several examples demonstrate the effectiveness of our model

ZENODO

Directory of Open Access Journals

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Research Online

ARPHA OAI-PMH Endpoint

ARPHA Preprints

Combination Model of Heterogeneous Data for Security Measurement

Author: Arshad Khan
Fenghua Li
Liju Dong
Xiuze Dong
Yunchuan Guo
Publication venue: 'Verlag der Technischen Universitat Graz'
Publication date: 01/03/2019
Field of study

Measuring security is a core step for guaranteeing security of network and information systems. Due to massiveness and heterogeneity of measurement data, it is difficult to classify and combine them on demand. In this paper, considering implication relationship of metrics, we propose a combination model and combination policy for security measurement. Several examples demonstrate the effectiveness of our model

Directory of Open Access Journals

Comparison of Diagnostic Performance of Three-Dimensional Positron Emission Mammography versus Whole Body Positron Emission Tomography in Breast Cancer

Author: Dong Dai
Jin Zhang
Juntian Liu
Lin Li
Man Wang
Peifang Liu
Wenchao Ma
Wengui Xu
Xiaoyue Gu
Xiuyu Song
Yunchuan Ma
Yusheng Su
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2017
Field of study

Objective. To compare the diagnostic performance of three-dimensional (3D) positron emission mammography (PEM) versus whole body positron emission tomography (WBPET) for breast cancer. Methods. A total of 410 women with normal breast or benign or highly suspicious malignant tumors were randomized at 1 : 1 ratio to undergo 3D-PEM followed by WBPET or WBPET followed by 3D-PEM. Lumpectomy or mastectomy was performed on eligible participants after the scanning. Results. The sensitivity and specificity of 3D-PEM were 92.8% and 54.5%, respectively. WBPET showed a sensitivity of 95.7% and specificity of 56.8%. After exclusion of the patients with lesions beyond the detecting range of the 3D-PEM instrument, 3D-PEM showed higher sensitivity than WBPET (97.0% versus 95.5%, P = 0.913), particularly for small lesions (<1 cm) (72.0% versus 60.0%, P = 0.685). Conclusions. The 3D-PEM appears more sensitive to small lesions than WBPET but may fail to detect lesions that are beyond the detecting range. This study was approved by the Ethics Committee (E2012052) at the Tianjin Medical University Cancer Institute and Hospital (Tianjin, China). The instrument positron emission mammography (PEMi) was approved by China State Food and Drug Administration under the registration number 20153331166

Crossref

Directory of Open Access Journals

The efficacy of upfront craniocerebral radiotherapy and epidermal growth factor receptor-tyrosine kinase inhibitors in patients with epidermal growth factor receptor-positive non-small cell lung cancer with brain metastases

Author: Hongling Lu
Jianxi Zhou
Kui Fan
Li Xiao
Xiaoming Yin
Yingnan Zhou
Yunchuan Sun
Publication venue: Frontiers Media S.A.
Publication date: 01/01/2024
Field of study

The present study aims to investigate the therapeutic value of third-generation epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKIs) combined with cranial radiotherapy (RT) in patients with EGFR-positive non-small cell lung cancer (NSCLC) and brain metastases (BMs).MethodologyThis is a retrospective study that involved 213 patients with EGFR-NSCLC and BMs, with the patients divided into two groups: the upfront cranial RT (ucRT) group (n = 96) and the non-ucRT group (n = 117). All patients were administered with osimertinib, and those in the ucRT group also underwent RT. The overall survival (OS), progression-free survival (PFS) and intracranial PFS (IPFS) of the two groups were compared.ResultsThe ucRT group manifested a markedly higher IPFS than the non-ucRT group (29.65 months vs 21.8 months; P < 0.0001). The subgroup analysis revealed that patients with oligometastases (OLOGO-BMs; 1–3 BMs) demonstrated a notably longer OS (44.5 months vs 37.3 months; P < 0.0001), PFS (32.3 months vs 20.8 months; P = 0.6884) and IPFS (37.8 months vs 22.1 months; P < 0.0001) in the ucRT group than in the non-ucRT group. However, for patients with multiple BMs, there was no significant difference in OS (27.3 months vs 34.4 months; P = 0.0710) and PFS (13.7 months vs 13.2 months; P = 0.0516) between the ucRT group and the non-ucRT group; the ucRT group exhibited a higher IPFS (26.4 months vs 21.35 months; P = 0.0028). Cox’s multivariate analysis of patients with OLOGO-BM indicated that the use of ucRT was linked to a better OS (heart rate [HR] = 0.392; 95% confidence interval [CI]: 0.178–0.863; P = 0.020) and PFS (HR = 0.558; 95% CI: 0.316–0.986; P = 0.044).ConclusionUpfront cerebral cranial stereotactic radiosurgery can improve outcomes in EGFR-positive patients with NSCLC and OLOGO-BM. However, for patients with multiple BMs, the preferable strategy may be pre-treatment with EGFR-TKIs

Directory of Open Access Journals