Search CORE

113 research outputs found

Stepwise Lithospheric Delamination Leads to Pulsed Cenozoic Uplifts of Central Tien Shan

Author: Haixiang Zhao
Xiaohui Yuan
Zhouchuan Huang
Publication venue: 'GeoScienceWorld'
Publication date: 01/12/2022
Field of study

AbstractThe Tien Shan provides an ideal site to study mechanism of intracontinental orogeny due to distant effect of Indo-Asian collision. We investigate lithospheric structures, in particular the lithosphere-asthenosphere boundary (LAB), of Central Tien Shan (CTS) using S wave receiver functions. The results show distinct structures across the orogen. Under the southern CTS, the LAB is shallower than that of the Tarim Basin; a 50 km vertical offset implies that part of the lithosphere has been delaminated. Under the middle CTS, two phases of negative velocity gradient are obtained, which may indicate a new LAB and an ongoing delamination underneath. Under the northern CTS and Kazakh Shield northward, the lithosphere is stable although the LAB inclines southward slightly. The two periods of lithospheric delamination under the southern and middle CTS account well for pulsed uplifts of the Tien Shan at ~11-8 Ma and ~5-0 Ma, respectively

Directory of Open Access Journals

Index tracking model, downside risk and non-parametric kernel estimation

Author: Huang Jinbo
Li Yong
Yao Haixiang
Publication venue: 'Elsevier BV'
Publication date: 01/07/2018
Field of study

In this paper, we propose an index tracking model with the conditional value-at-risk (CVaR) constraint based on a non-parametric kernel (NPK) estimation framework. In theory, we demonstrate that the index tracking model with the CVaR constraint is a convex optimization problem. We then derive NPK estimators for tracking errors and CVaR, and thereby construct the NPK index tracking model. Monte Carlo simulations show that the NPK method outperforms the linear programming (LP) method in terms of estimation accuracy. In addition, the NPK method can enhance computational efficiency when the sample size is large. Empirical tests show that the NPK method can effectively control downside risk and obtain higher excess returns, in both bearish and bullish market environments

University of Queensland eSpace

Efficient Black-box Checking of Snapshot Isolation in Databases

Author: Basin David
Chen Zhenge
Huang Kaile
Li Haixiang
Liu Si
Pan Anqun
Wei Hengfeng
Publication venue
Publication date: 24/10/2023
Field of study

Snapshot isolation (SI) is a prevalent weak isolation level that avoids the performance penalty imposed by serializability and simultaneously prevents various undesired data anomalies. Nevertheless, SI anomalies have recently been found in production cloud databases that claim to provide the SI guarantee. Given the complex and often unavailable internals of such databases, a black-box SI checker is highly desirable. In this paper we present PolySI, a novel black-box checker that efficiently checks SI and provides understandable counterexamples upon detecting violations. PolySI builds on a novel characterization of SI using generalized polygraphs (GPs), for which we establish its soundness and completeness. PolySI employs an SMT solver and also accelerates SMT solving by utilizing the compact constraint encoding of GPs and domain-specific optimizations for pruning constraints. As demonstrated by our extensive assessment, PolySI successfully reproduces all of 2477 known SI anomalies, detects novel SI violations in three production cloud databases, identifies their causes, outperforms the state-of-the-art black-box checkers under a wide range of workloads, and can scale up to large-sized workloads.Comment: 20 pages, 15 figures, accepted by PVLD

arXiv.org e-Print Archive

Predicting no-show medical appointments using machine learning

Author: A Aggarwal
A Estabrooks
C Cortes
DL Vargas
F Mallor
G Haixiang
H He
I Mohammadi
L Nanni
LA Nuti
M Bécue-Bertaut
NV Chawla
P Branco
P Kheirkhah
S Arora
S Belciug
V López
W Han
XY Liu
Y Huang
Y Huang
Y Sun
Y Wang
Y Xiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/12/2019
Field of study

Health care centers face many issues due to the limited availability of resources, such as funds, equipment, beds, physicians, and nurses. Appointment absences lead to a waste of hospital resources as well as endangering patient health. This fact makes unattended medi- cal appointments both socially expensive and economically costly. This research aimed to build a predictive model to identify whether an appointment would be a no-show or not in order to reduce its consequences. This paper proposes a multi-stage framework to build an accurate predictor that also tackles the imbalanced property that the data exhibits. The first stage includes dimensionality reduction to compress the data into its most important components. The second stage deals with the imbalanced nature of the data. Different machine learning algorithms were used to build the classifiers in the third stage. Various evaluation metrics are also discussed and an evaluation scheme that fits the problem at hand is described. The work presented in this paper will help decision makers at health care centers to implement effective strategies to reduce the number of no-shows

Crossref

Irish Universities

DCU Online Research Access Service

An insight into imbalanced Big Data classification: outcomes and challenges

Author: A Fernández
A Fernández
A Thusoo
B Krawczyk
C Bunkhumpornpat
CP Chen
D Lyubimov
E Elsebakhi
E Ramentol
F Hu
F Hu
G Haixiang
GEAPA Batista
GM Weiss
H He
H Yu
I Triguero
I Triguero
J Alcalá-Fdez
J Dean
J Huang
J Li
JA Sáez
JM Tomczak
K Kambatla
L Rokach
M Galar
M Galar
M Wasikowski
NV Chawla
NV Chawla
PC Zikopoulos
R Baeza-Yates
R Barandela
R Blagus
RC Prati
S Alshomrani
S Barua
S Elhag
S Kamal
S Owen
S Río
S Río
S-H Park
T Jo
T White
V García
V López
V López
V López
X Meng
X Wu
Y Guo
Y Sun
Y-S Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Big Data applications are emerging during the last years, and researchers from many disciplines are aware of the high advantages related to the knowledge extraction from this type of problem. However, traditional learning approaches cannot be directly applied due to scalability issues. To overcome this issue, the MapReduce framework has arisen as a “de facto” solution. Basically, it carries out a “divide-and-conquer” distributed procedure in a fault-tolerant way to adapt for commodity hardware. Being still a recent discipline, few research has been conducted on imbalanced classification for Big Data. The reasons behind this are mainly the difficulties in adapting standard techniques to the MapReduce programming style. Additionally, inner problems of imbalanced data, namely lack of data and small disjuncts, are accentuated during the data partitioning to fit the MapReduce programming style. This paper is designed under three main pillars. First, to present the first outcomes for imbalanced classification in Big Data problems, introducing the current research state of this area. Second, to analyze the behavior of standard pre-processing techniques in this particular framework. Finally, taking into account the experimental results obtained throughout this work, we will carry out a discussion on the challenges and future directions for the topic.This work has been partially supported by the Spanish Ministry of Science and Technology under Projects TIN2014-57251-P and TIN2015-68454-R, the Andalusian Research Plan P11-TIC-7765, the Foundation BBVA Project 75/2016 BigDaPTOOLS, and the National Science Foundation (NSF) Grant IIS-1447795

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Springer - Publisher Connector

Repositorio Institucional Universidad de Granada

Rotor end factors for 2-D FEA of induction motors with smooth or slitted solid rotor

Author: Haixiang Cao
Shanming Wang
Yuguang Sun
Ziguo Huang
Publication venue: China Electrotechnical Society
Publication date: 01/06/2017
Field of study

Modifying the equivalent rotor resistivity with rotor end factor in 2-dimension (2-D) finite element analysis (FEA) is an effective way to analyze the 3-dimension (3-D) solid rotor problems. For the smooth solid rotor, five different rotor end factors are discussed and compared with each other. It is theoretically clarified that the resistivity of rotor in 2-D FEA should be multiplied by the square of rotor end factors to take the 3-D end effect of solid rotor into account. For the slitted solid rotor, an improved rotor end factor is proposed based on the equivalent area algorithm of eddy currents in rotor, since the end factors of smooth solid rotor are not applicable. Finally, the time-harmonic finite element method (FEM) combined with the rotor end factor is applied to analyze the performance of solid rotor induction motor. The tested and computed results are in good agreement, which proves the effectiveness of rotor end factor for the simplication of 3-D solid rotor problems

Directory of Open Access Journals

Safety risk evaluations of deep foundation construction schemes based on imbalanced data sets

Author: Gong Peisong
Guo Haixiang
Guo Shengyu
Huang Yuanyue
Publication venue: 'Vilnius Gediminas Technical University'
Publication date: 20/04/2020
Field of study

Safety risk evaluations of deep foundation construction schemes are important to ensure safety. However, the amount of knowledge on these evaluations is large, and the historical data of deep foundation engineering is imbalanced. Some adverse factors influence the quality and efficiency of evaluations using traditional manual evaluation tools. Machine learning guarantees the quality of imbalanced data classifications. In this study, three strategies are proposed to improve the classification accuracy of imbalanced data sets. First, data set information redundancy is reduced using a binary particle swarm optimization algorithm. Then, a classification algorithm is modified using an Adaboost-enhanced support vector machine classifier. Finally, a new classification evaluation standard, namely, the area under the ROC curve, is adopted to ensure the classifier to be impartial to the minority. A transverse comparison experiment using multiple classification algorithms shows that the proposed integrated classification algorithm can overcome difficulties associated with correctly classifying minority samples in imbalanced data sets. The algorithm can also improve construction safety management evaluations, relieve the pressure from the lack of experienced experts accompanying rapid infrastructure construction, and facilitate knowledge reuse in the field of architecture, engineering, and construction

VGTU Journals (Vilnius Gediminas Technical University - Vilnius Tech)

中国語への翻字における関連語抽出の効果

Author: Fujii Atsushi
Huang HaiXiang
藤井敦
黄海湘
Publication venue
Publication date: 03/10/2009
Field of study

Institutional Repositories DataBase (IRDB)

A Hybrid Method for Generation of Typical Meteorological Years for Different Climates of China

Author: Guoqiang Sun
Haixiang Zang
Jing Huang
Miaomiao Wang
Zhinong Wei
Publication venue: MDPI AG
Publication date: 01/12/2016
Field of study

Since a representative dataset of the climatological features of a location is important for calculations relating to many fields, such as solar energy system, agriculture, meteorology and architecture, there is a need to investigate the methodology for generating a typical meteorological year (TMY). In this paper, a hybrid method with mixed treatment of selected results from the Danish method, the Festa-Ratto method, and the modified typical meteorological year method is proposed to determine typical meteorological years for 35 locations in six different climatic zones of China (Tropical Zone, Subtropical Zone, Warm Temperate Zone, Mid Temperate Zone, Cold Temperate Zone and Tibetan Plateau Zone). Measured weather data (air dry-bulb temperature, air relative humidity, wind speed, pressure, sunshine duration and global solar radiation), which cover the period of 1994–2015, are obtained and applied in the process of forming TMY. The TMY data and typical solar radiation data are investigated and analyzed in this study. It is found that the results of the hybrid method have better performance in terms of the long-term average measured data during the year than the other investigated methods. Moreover, the Gaussian process regression (GPR) model is recommended to forecast the monthly mean solar radiation using the last 22 years (1994–2015) of measured data

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals