104 research outputs found
Evolutionary structure learning algorithm for Bayesian network and penalized mutual information metric
This paper formulates the problem of learning Bayesian network structures from data as determining the structure that best approximates the probability distribution indicated by the data. A new metric, Penalized Mutual Information metric, is proposed, and a evolutionary algorithm is designed to search for the best structure among alternatives. The experimental results show that this approach is reliable and promising.<br /
An examination on the performance of MML causal induction
This paper presents an examination report on the performance of the improved MML based causal model discovery algorithm. In this paper, We firstly describe our improvement to the causal discovery algorithm which introduces a new encoding scheme for measuring the cost of describing the causal structure. Stiring function is also applied to further simplify the computational complexity and thus works more efficiently. It is followed by a detailed examination report on the performance of our improved discovery algorithm. The experimental results of the current version of the discovery system show that: (l) the current version is capable of discovering what discovered by previous system; (2) current system is capable of discovering more complicated causal networks with large number of variables; (3) the new version works more efficiently compared with the previous version in terms of time complexity
An efficient multiple harmonic balance method for computing quasi-periodic responses of nonlinear systems
Quasi-periodic responses composed of multiple base frequencies widely exist
in science and engineering problems. The multiple harmonic balance (MHB) method
is one of the most commonly used approaches for such problems. However, it is
limited by low-order estimations due to complex symbolic operations in
practical uses. Many variants have been developed to improve the MHB method,
among which the time domain MHB-like methods are regarded as crucial
improvements because of their high efficiency and simple derivation. But there
is still one main drawback remaining to be addressed. The time domain MHB-like
methods negatively suffer from non-physical solutions, which have been shown to
be caused by aliasing (mixtures of the high-order into the low-order
harmonics). Inspired by the collocation-based harmonic balancing framework
recently established by our group, we herein propose a reconstruction multiple
harmonic balance (RMHB) method to reconstruct the conventional MHB method using
discrete time domain collocations. Our study shows that the relation between
the MHB and time domain MHB-like methods is determined by an aliasing matrix,
which is non-zero when aliasing occurs. On this basis, a conditional
equivalence is established to form the RMHB method. Three numerical examples
demonstrate that this new method is more robust and efficient than the
state-of-the-art methods.Comment: 25 pages,12 figures, and 5 tables. Accepted manuscrip
Ensemble parameter estimation for graphical models
Parameter Estimation is one of the key issues involved in the discovery of graphical models from data. Current state of the art methods have demonstrated their abilities in different kind of graphical models. In this paper, we introduce ensemble learning into the process of parameter estimation, and examine ensemble parameter estimation methods for different kind of graphical models under complete data set and incomplete data set. We provide experimental results which show that ensemble method can achieve an improved result over the base parameter estimation method in terms of accuracy. In addition, the method is amenable to parallel or distributed processing, which is an important characteristic for data mining in large data sets.<br /
Discovering linear causal model from incomplete data
One common drawback in algorithms for learning Linear Causal Models is that they can not deal with incomplete data set. This is unfortunate since many real problems involve missing data or even hidden variable. In this paper, based on multiple imputation, we propose a three-step process to learn linear causal models from incomplete data set. Experimental results indicate that this algorithm is better than the single imputation method (EM algorithm) and the simple list deletion method, and for lower missing rate, this algorithm can even find models better than the results from the greedy learning algorithm MLGS working in a complete data set. In addition, the method is amenable to parallel or distributed processing, which is an important characteristic for data mining in large data sets.<br /
Abnormal behavior detection for early warning of terrorist attack
Many terrorist attacks are accomplished by bringing explosive devices hidden in ordinary-looking objects to public places. In such case, it is almost impossible to distinguish a terrorist from ordinary people just from the isolated appearance. However, valuable clues might be discovered through analyzing a series of actions of the same person. Abnormal behaviors of object fetching, deposit, or exchange in public places might indicate potential attacks. Based on the widely equipped CCTV surveillance systems at the entrance of many public places, this paper proposes an algorithm to detect such abnormal behaviors for early warning of terrorist attack.<br /
A Simple Time Domain Collocation Method to Precisely Search for the Periodic Orbits of Satellite Relative Motion
A numerical approach for obtaining periodic orbits of satellite relative motion is proposed, based on using the time domain collocation (TDC) method to search for the periodic solutions of an exact J2 nonlinear relative model. The initial conditions for periodic relative orbits of the Clohessy-Wiltshire (C-W) equations or Tschauner-Hempel (T-H) equations can be refined with this approach to generate nearly bounded orbits. With these orbits, a method based on the least-squares principle is then proposed to generate projected closed orbit (PCO), which is a reference for the relative motion control. Numerical simulations reveal that the presented TDC searching scheme is effective and simple, and the projected closed orbit is very fuel saving
Linear causal model discovery using the MML criterion
Determining the causal structure of a domain is a key task in the area of Data Mining and Knowledge Discovery.The algorithm proposed by Wallace et al. [15] has demonstrated its strong ability in discovering Linear Causal Models from given data sets. However, some experiments showed that this algorithm experienced difficulty in discovering linear relations with small deviation, and it occasionally gives a negative message length, which should not be allowed. In this paper, a more efficient and precise MML encoding scheme is proposed to describe the model structure and the nodes in a Linear Causal Model. The estimation of different parameters is also derived. Empirical results show that the new algorithm outperformed the previous MML-based algorithm in terms of both speed and precision. <br /
A classifier graph based recurring concept detection and prediction approach
It is common in real-world data streams that previously seen concepts will reappear, which suggests a unique kind of concept drift, known as recurring concepts. Unfortunately, most of existing algorithms do not take full account of this case. Motivated by this challenge, a novel paradigm was proposed for capturing and exploiting recurring concepts in data streams. It not only incorporates a distribution-based change detector for handling concept drift but also captures recurring concept by storing recurring concepts in a classifier graph. The possibility of detecting recurring drifts allows reusing previously learnt models and enhancing the overall learning performance. Extensive experiments on both synthetic and real-world data streams reveal that the approach performs significantly better than the state-of-the-art algorithms, especially when concepts reappear
- …