Search CORE

202,303 research outputs found

Processing techniques development, volume 3. Part 2: Data preprocessing and information extraction techniques

Author: Anuta P. E.
Landgrebe D. A.
Siegel H. J.
Swain P. H.
Publication venue
Publication date
Field of study

There are no author-identified significant results in this report

NASA Technical Reports Server

PReaCH: A Fast Lightweight Reachability Index using Pruning and Contraction Hierarchies

Author: Merz Florian
Sanders Peter
Publication venue
Publication date: 01/01/2014
Field of study

We develop the data structure PReaCH (for Pruned Reachability Contraction Hierarchies) which supports reachability queries in a directed graph, i.e., it supports queries that ask whether two nodes in the graph are connected by a directed path. PReaCH adapts the contraction hierarchy speedup techniques for shortest path queries to the reachability setting. The resulting approach is surprisingly simple and guarantees linear space and near linear preprocessing time. Orthogonally to that, we improve existing pruning techniques for the search by gathering more information from a single DFS-traversal of the graph. PReaCH-indices significantly outperform previous data structures with comparable preprocessing cost. Methods with faster queries need significantly more preprocessing time in particular for the most difficult instances

arXiv.org e-Print Archive

CiteSeerX

Crossref

The effect of data preprocessing on the performance of artificial neural networks techniques for classification problems

Author: Atomi Walid Hasen
Publication venue
Publication date: 01/12/2012
Field of study

The artificial neural network (ANN) has recently been applied in many areas, such as medical, biology, financial, economy, engineering and so on. It is known as an excellent classifier of nonlinear input and output numerical data. Improving training efficiency of ANN based algorithm is an active area of research and numerous papers have been reviewed in the literature. The performance of Multi-layer Perceptron (MLP) trained with back-propagation artificial neural network (BP-ANN) method is highly influenced by the size of the data-sets and the data-preprocessing techniques used. This work analyzes the advantages of using pre-processing datasets using different techniques in order to improve the ANN convergence. Specifically Min-Max, Z-Score and Decimal Scaling Normalization preprocessing techniques were evaluated. The simulation results showed that the computational efficiency of ANN training process is highly enhanced when coupled with different preprocessing techniques

UTHM Institutional Repository

Altimeter waveform software design

Author: Brown G. S.
Hayne G. S.
Miller L. S.
Publication venue
Publication date
Field of study

Techniques are described for preprocessing raw return waveform data from the GEOS-3 radar altimeter. Topics discussed include: (1) general altimeter data preprocessing to be done at the GEOS-3 Data Processing Center to correct altimeter waveform data for temperature calibrations, to convert between engineering and final data units and to convert telemetered parameter quantities to more appropriate final data distribution values: (2) time "tagging" of altimeter return waveform data quantities to compensate for various delays, misalignments and calculational intervals; (3) data processing procedures for use in estimating spacecraft attitude from altimeter waveform sampling gates; and (4) feasibility of use of a ground-based reflector or transponder to obtain in-flight calibration information on GEOS-3 altimeter performance

NASA Technical Reports Server

PCA 4 DCA: The Application Of Principal Component Analysis To The Dendritic Cell Algorithm

Author: Aickelin Uwe
Greensmith Julie
Gu Feng
Oates Robert
Publication venue
Publication date: 01/01/2009
Field of study

As one of the newest members in the field of artificial immune systems (AIS), the Dendritic Cell Algorithm (DCA) is based on behavioural models of natural dendritic cells (DCs). Unlike other AIS, the DCA does not rely on training data, instead domain or expert knowledge is required to predetermine the mapping between input signals from a particular instance to the three categories used by the DCA. This data preprocessing phase has received the criticism of having manually over-?tted the data to the algorithm, which is undesirable. Therefore, in this paper we have attempted to ascertain if it is possible to use principal component analysis (PCA) techniques to automatically categorise input data while still generating useful and accurate classication results. The integrated system is tested with a biometrics dataset for the stress recognition of automobile drivers. The experimental results have shown the application of PCA to the DCA for the purpose of automated data preprocessing is successful.Comment: 6 pages, 4 figures, 3 tables, (UKCI 2009

arXiv.org e-Print Archive

CiteSeerX

Nottingham eTheses

Crossref

University of Melbourne Institutional Repository

Training Process Reduction Based On Potential Weights Linear Analysis To Accelarate Back Propagation Network

Author: Asadi Roya
Mustapha Norwati
Sulaiman Nasir
Publication venue
Publication date: 01/07/2009
Field of study

Learning is the important property of Back Propagation Network (BPN) and finding the suitable weights and thresholds during training in order to improve training time as well as achieve high accuracy. Currently, data pre-processing such as dimension reduction input values and pre-training are the contributing factors in developing efficient techniques for reducing training time with high accuracy and initialization of the weights is the important issue which is random and creates paradox, and leads to low accuracy with high training time. One good data preprocessing technique for accelerating BPN classification is dimension reduction technique but it has problem of missing data. In this paper, we study current pre-training techniques and new preprocessing technique called Potential Weight Linear Analysis (PWLA) which combines normalization, dimension reduction input values and pre-training. In PWLA, the first data preprocessing is performed for generating normalized input values and then applying them by pre-training technique in order to obtain the potential weights. After these phases, dimension of input values matrix will be reduced by using real potential weights. For experiment results XOR problem and three datasets, which are SPECT Heart, SPECTF Heart and Liver disorders (BUPA) will be evaluated. Our results, however, will show that the new technique of PWLA will change BPN to new Supervised Multi Layer Feed Forward Neural Network (SMFFNN) model with high accuracy in one epoch without training cycle. Also PWLA will be able to have power of non linear supervised and unsupervised dimension reduction property for applying by other supervised multi layer feed forward neural network model in future work.Comment: 11 pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS 2009, ISSN 1947 5500, Impact factor 0.42

arXiv.org e-Print Archive

Universiti Putra Malaysia Institutional Repository