173 research outputs found
Techniques for Complex Analysis of Contemporary Data
Contemporary data objects are typically complex, semi-structured, or unstructured at all. Besides, objects are also related to form a network. In such a situation, data analysis requires not only the traditional attribute-based access but also access based on similarity as well as data mining operations. Though tools for such operations do exist, they usually specialise in operation and are available for specialized data structures supported by specific computer system environments. In contrary, advance analyses are obtained by application of several elementary access operations which in turn requires expert knowledge in multiple areas. In this paper, we propose a unification platform for various data analytical operators specified as a general-purpose analytical system ADAMiSS. An extensible data-mining and similarity-based set of operators over a common versatile data structure allow the recursive application of heterogeneous operations, thus allowing the definition of complex analytical processes, necessary to solve the contemporary analytical tasks. As a proof-of-concept, we present results that were obtained by our prototype implementation on two real-world data collections: the Twitter Higg's boson and the Kosarak datasets
Review of the mathematical foundations of data fusion techniques in surface metrology
The recent proliferation of engineered surfaces, including freeform and structured surfaces, is challenging current metrology techniques. Measurement using multiple sensors has been proposed to achieve enhanced benefits, mainly in terms of spatial frequency bandwidth, which a single sensor cannot provide. When using data from different sensors, a process of data fusion is required and there is much active research in this area. In this paper, current data fusion methods and applications are reviewed, with a focus on the mathematical foundations of the subject. Common research questions in the fusion of surface metrology data are raised and potential fusion algorithms are discussed
Coherent quantum LQG control
Based on a recently developed notion of physical realizability for quantum
linear stochastic systems, we formulate a quantum LQG optimal control problem
for quantum linear stochastic systems where the controller itself may also be a
quantum system and the plant output signal can be fully quantum. Such a control
scheme is often referred to in the quantum control literature as "coherent
feedback control.'' It distinguishes the present work from previous works on
the quantum LQG problem where measurement is performed on the plant and the
measurement signals are used as input to a fully classical controller with no
quantum degrees of freedom. The difference in our formulation is the presence
of additional non-linear and linear constraints on the coefficients of the
sought after controller, rendering the problem as a type of constrained
controller design problem. Due to the presence of these constraints our problem
is inherently computationally hard and this also distinguishes it in an
important way from the standard LQG problem. We propose a numerical procedure
for solving this problem based on an alternating projections algorithm and, as
initial demonstration of the feasibility of this approach, we provide fully
quantum controller design examples in which numerical solutions to the problem
were successfully obtained. For comparison, we also consider the case of
classical linear controllers that use direct or indirect measurements, and show
that there exists a fully quantum linear controller which offers an improvement
in performance over the classical ones.Comment: 25 pages, 1 figure, revised and corrected version (mainly to Section
8). To be published in Automatica, Journal of IFAC, 200
Tiling array data analysis: a multiscale approach using wavelets
<p>Abstract</p> <p>Background</p> <p>Tiling array data is hard to interpret due to noise. The wavelet transformation is a widely used technique in signal processing for elucidating the true signal from noisy data. Consequently, we attempted to denoise representative tiling array datasets for ChIP-chip experiments using wavelets. In doing this, we used specific wavelet basis functions, <it>Coiflets</it>, since their triangular shape closely resembles the expected profiles of true ChIP-chip peaks.</p> <p>Results</p> <p>In our wavelet-transformed data, we observed that noise tends to be confined to small scales while the useful signal-of-interest spans multiple large scales. We were also able to show that wavelet coefficients due to non-specific cross-hybridization follow a log-normal distribution, and we used this fact in developing a thresholding procedure. In particular, wavelets allow one to set an unambiguous, absolute threshold, which has been hard to define in ChIP-chip experiments. One can set this threshold by requiring a similar confidence level at different length-scales of the transformed signal. We applied our algorithm to a number of representative ChIP-chip data sets, including those of Pol II and histone modifications, which have a diverse distribution of length-scales of biochemical activity, including some broad peaks.</p> <p>Conclusions</p> <p>Finally, we benchmarked our method in comparison to other approaches for scoring ChIP-chip data using spike-ins on the ENCODE Nimblegen tiling array. This comparison demonstrated excellent performance, with wavelets getting the best overall score.</p
OSCA: a comprehensive open-access system of analysis of posterior capsular opacification
BACKGROUND: This paper presents and tests a comprehensive computerised system of analysis of digital images of posterior capsule opacification (PCO). It updates and expands significantly on a previous presentation to include facilities for selecting user defined central areas and for registering and subsequent merging of images for artefact removal. Also, the program is compiled and thus eliminates the need for specialised additional software. The system is referred to in this paper as the open-access systematic capsule assessment (OSCA). The system is designed to be evidence based, objective and openly available, improving on current systems of analysis. METHODS: Principal features of the OSCA system of analysis are discussed. Flash artefacts are automatically located in two PCO images and the images merged to produce a composite free from these artefacts. For this to be possible the second image has to be manipulated with a registration technique to bring it into alignment with the first. Further image processing and analysis steps use a location-sensitive entropy based texture analysis of PCO. Validity of measuring PCO progression of the whole new system is assessed along with visual significance of scores. Reliability of the system is assessed. RESULTS: Analysis of PCO by the system shows ability to detect early progression of PCO, as well as detection of more visually significant PCO. Images with no clinical PCO produce very low scores in the analysis. Reliability of the system of analysis is demonstrated. CONCLUSION: This system of PCO analysis is evidence-based, objective and clinically useful. It incorporates flash detection and removal as well as location sensitive texture analysis. It provides features and benefits not previously available to most researchers or clinicians. Substantial evidence is provided for this system's validity and reliability
Recommended from our members
Optimum design of reinforced concrete retaining walls with the flower pollination algorithm
The flower pollination algorithm (FPA) is anefficient metaheuristicoptimizationalgorithm mimickingthe pollinationprocessof flowering species. In this study, FPA is applied, for first time, to the optimum design of reinforced concrete (RC) cantilever retaining walls. It is foundthat FPA offers important savings with respect to conventional design approachesand that it outperformsgenetic algorithm (GA)andthe particle swarm optimization (PSO) algorithm in this designproblem.Furthermore, parameter tuning reveals that the best FPA performance is achieved for switch probability values ranging between 0.4 and 0.7, a population size of 20 individualsand aLĂ©vy flightstep sizescale factor of 0.5. Finally, parametric optimum designs show that theoptimumcost of RC retaining walls increases rapidly with the wallheight and smoothly with the magnitude of surcharge loadin
A novel selection of optimal statistical features in the DWPT domain for discrimination of ictal and seizure-free electroencephalography signals
Properly determining the discriminative features which characterize the inherent behaviors of electroencephalography (EEG) signals remains a great challenge for epileptic seizure detection. In this present study, a novel feature selection scheme based on the discrete wavelet packet decomposition and cuckoo search algorithm (CSA) was proposed. The normal as well as epileptic EEG recordings were frst decomposed into various frequency bands by means of wavelet packet decomposition, and subsequently, statistical features at all developed nodes in the wavelet packet decomposition tree were derived. Instead of using the complete set of the extracted features to construct a wavelet neural networks-based classifer, an optimal feature subset that maximizes the predictive competence of the classifer was selected by using the CSA. Experimental results on the publicly available benchmarks demonstrated that the proposed feature subset selection scheme achieved promising recognition accuracies of 98.43â100%, and the results were statistically signifcant using z-test with p value <0.0001
Effects of Local and Landscape Factors on Population Dynamics of a Cotton Pest
BACKGROUND: Many polyphagous pests sequentially use crops and uncultivated habitats in landscapes dominated by annual crops. As these habitats may contribute in increasing or decreasing pest density in fields of a specific crop, understanding the scale and temporal variability of source and sink effects is critical for managing landscapes to enhance pest control. METHODOLOGY/PRINCIPAL FINDINGS: We evaluated how local and landscape characteristics affect population density of the western tarnished plant bug, Lygus hesperus (Knight), in cotton fields of the San Joaquin Valley in California. During two periods covering the main window of cotton vulnerability to Lygus attack over three years, we examined the associations between abundance of six common Lygus crops, uncultivated habitats and Lygus population density in these cotton fields. We also investigated impacts of insecticide applications in cotton fields and cotton flowering date. Consistent associations observed across periods and years involved abundances of cotton and uncultivated habitats that were negatively associated with Lygus density, and abundance of seed alfalfa and cotton flowering date that were positively associated with Lygus density. Safflower and forage alfalfa had variable effects, possibly reflecting among-year variation in crop management practices, and tomato, sugar beet and insecticide applications were rarely associated with Lygus density. Using data from the first two years, a multiple regression model including the four consistent factors successfully predicted Lygus density across cotton fields in the last year of the study. CONCLUSIONS/SIGNIFICANCE: Our results show that the approach developed here is appropriate to characterize and test the source and sink effects of various habitats on pest dynamics and improve the design of landscape-level pest management strategies
Disease-associated alleles in genome-wide association studies are enriched for derived low frequency alleles relative to HapMap and neutral expectations
<p>Abstract</p> <p>Background</p> <p>Genome-wide association studies give insight into the genetic basis of common diseases. An open question is whether the allele frequency distributions and ancestral vs. derived states of disease-associated alleles differ from the rest of the genome. Characteristics of disease-associated alleles can be used to increase the yield of future studies.</p> <p>Methods</p> <p>The set of all common disease-associated alleles found in genome-wide association studies prior to January 2010 was analyzed and compared with HapMap and theoretical null expectations. In addition, allele frequency distributions of different disease classes were assessed. Ages of HapMap and disease-associated alleles were also estimated.</p> <p>Results</p> <p>The allele frequency distribution of HapMap alleles was qualitatively similar to neutral expectations. However, disease-associated alleles were more likely to be low frequency derived alleles relative to null expectations. 43.7% of disease-associated alleles were ancestral alleles. The mean frequency of disease-associated alleles was less than randomly chosen CEU HapMap alleles (0.394 vs. 0.610, after accounting for probability of detection). Similar patterns were observed for the subset of disease-associated alleles that have been verified in multiple studies. SNPs implicated in genome-wide association studies were enriched for young SNPs compared to randomly selected HapMap loci. Odds ratios of disease-associated alleles tended to be less than 1.5 and varied by frequency, confirming previous studies.</p> <p>Conclusions</p> <p>Alleles associated with genetic disease differ from randomly selected HapMap alleles and neutral expectations. The evolutionary history of alleles (frequency and ancestral vs. derived state) influences whether they are implicated in genome-wide assocation studies.</p
Effect of variable transmission rate on the dynamics of HIV in sub-Saharan Africa
<p>Abstract</p> <p>Background</p> <p>The cause of the high HIV prevalence in sub-Saharan Africa is incompletely understood, with heterosexual penile-vaginal transmission proposed as the main mechanism. Heterosexual HIV transmission has been estimated to have a very low probability; but effects of cofactors that vary in space and time may substantially alter this pattern.</p> <p>Methods</p> <p>To test the effect of individual variation in the HIV infectiousness generated by co-infection, we developed and analyzed a mathematical sexual network model that simulates the behavioral components of a population from Malawi, as well as the dynamics of HIV and the co-infection effect caused by other infectious diseases, including herpes simplex virus type-2, gonorrhea, syphilis and malaria.</p> <p>Results</p> <p>The analysis shows that without the amplification effect caused by co-infection, no epidemic is generated, and HIV prevalence decreases to extinction. But the model indicates that an epidemic can be generated by the amplification effect on HIV transmission caused by co-infection.</p> <p>Conclusion</p> <p>The simulated sexual network demonstrated that a single value for HIV infectivity fails to describe the dynamics of the epidemic. Regardless of the low probability of heterosexual transmission per sexual contact, the inclusion of individual variation generated by transient but repeated increases in HIV viral load associated with co-infections may provide a biological basis for the accelerated spread of HIV in sub-Saharan Africa. Moreover, our work raises the possibility that the natural history of HIV in sub-Saharan Africa cannot be fully understood if individual variation in infectiousness is neglected.</p
- âŠ