Search CORE

221 research outputs found

Adaptive pattern recognition by mini-max neural networks as a part of an intelligent processor

Author: Szu Harold H.
Publication venue
Publication date
Field of study

In this decade and progressing into 21st Century, NASA will have missions including Space Station and the Earth related Planet Sciences. To support these missions, a high degree of sophistication in machine automation and an increasing amount of data processing throughput rate are necessary. Meeting these challenges requires intelligent machines, designed to support the necessary automations in a remote space and hazardous environment. There are two approaches to designing these intelligent machines. One of these is the knowledge-based expert system approach, namely AI. The other is a non-rule approach based on parallel and distributed computing for adaptive fault-tolerances, namely Neural or Natural Intelligence (NI). The union of AI and NI is the solution to the problem stated above. The NI segment of this unit extracts features automatically by applying Cauchy simulated annealing to a mini-max cost energy function. The feature discovered by NI can then be passed to the AI system for future processing, and vice versa. This passing increases reliability, for AI can follow the NI formulated algorithm exactly, and can provide the context knowledge base as the constraints of neurocomputing. The mini-max cost function that solves the unknown feature can furthermore give us a top-down architectural design of neural networks by means of Taylor series expansion of the cost function. A typical mini-max cost function consists of the sample variance of each class in the numerator, and separation of the center of each class in the denominator. Thus, when the total cost energy is minimized, the conflicting goals of intraclass clustering and interclass segregation are achieved simultaneously

NASA Technical Reports Server

Text Classification Aided by Clustering: a Literature Review

Author: Kyriakopoulou Antonia
Publication venue: 'IntechOpen'
Publication date: 01/08/2008
Field of study

IntechOpen

Crossref

Recommended from our members

Improved integration of information to reduce subsurface model bias

Author: Mabadeje Ademide O.
Publication venue
Publication date: 17/07/2024
Field of study

Subsurface modeling deals with data-related issues like cognitive and sampling biases, and model-related challenges including statistical assumptions, misspecification, and algorithmic biases. These challenges introduce four critical implications during subsurface modeling. Firstly, subsurface sampling is subject to sampling bias, which compromises statistical representativeness. Secondly, analog selection methodologies rely on multivariate statistics and expert judgment that overlook spatial information and data dimensionality. Thirdly, subsurface inferential workflows that utilize dimensionality reduction seldom provide repeatable frameworks that maintain model stability and are invariant to Euclidean transformations. Lastly, deep learning methods for dimensionality reduction, characterized as black-box models, lack interpretability and robust evaluation metrics, increasing susceptibility to algorithmic bias. Consequently, neglecting these challenges in subsurface modeling could lead to erroneous predictions, inconsistent inferences, diminished model reliability, and suboptimal decision-making that impacts project economics. This dissertation integrates information within subsurface models to reduce model bias and significantly improve their accuracy, robustness, and generalizability. First, I create spatial declustering methods to debias spatial datasets with single and multiscale preferential sampling in stationary populations. Second, I introduce a novel geostatistics-based machine learning method for identifying subsurface resource analogs that integrate spatial information in subsurface datasets with high dimensionality. Next, I efficiently combine machine learning and computational geometry methods to stabilize lower dimensional spaces for uncertainty quantification and interpretation. Finally, I create a methodology to assess, evaluate, and interpret the stability of deep learning latent feature spaces. These novel methodologies demonstrate the importance of improved techniques for information integration in subsurface modeling and show better results over naïve methods. This results in objective sampling debiasing in spatial stationary populations with single or multiple data scales, improving statistical representativity. Also, the results show better generalization and accurate identification of spatial analogs in high-dimensional datasets. Moreover, the methods yield Euclidean transformation-invariant lower-dimensional spaces, ensuring unique and repeatable solutions that improve model reliability and interpretability, for rational comparisons. Finally, the results indicate that deep learning models for dimensionality reduction exhibit algorithmic biases and instabilities, including sample, structural, and inferential instability, affecting their reliability and interpretability. Together, these innovations ultimately reduce model bias and significantly improve subsurface modeling.Petroleum and Geosystems Engineerin

Texas ScholarWorks

ESTSS—energy system time series suite: a declustered, application-independent, semi-artificial load profile benchmark set

Author: Bensmann Astrid
Brandt Jonathan
Günther Sebastian
Hanke-Rauschenbach Richard
Publication venue: Cham : Springer International Publishing
Publication date: 01/01/2024
Field of study

This paper introduces an univariate application-independent set of load profiles or time series derived from real-world energy system data. The generation involved a two-step process: manifolding the initial dataset through signal processors to increase diversity and heterogeneity, followed by a declustering process that removes data redundancy. The study employed common feature engineering and machine learning techniques: the time series are transformed into a normalized feature space, followed by a dimensionality reduction via hierarchical clustering, and optimization. The resulting dataset is uniformly distributed across multiple feature space dimensions while retaining typical time and frequency domain characteristics inherent in energy system time series. This data serves various purposes, including algorithm testing, uncovering functional relationships between time series features and system performance, and training machine learning models. Two case studies demonstrate the claims: one focused on the suitability of hybrid energy storage systems and the other on quantifying the onsite hydrogen supply cost in green hydrogen production sites. The declustering algorithm, although a bys study, shows promise for further scientific exploration. The data and source code are openly accessible, providing a robust platform for future comparative studies. This work also offers smaller subsets for computationally intensive research. Data and source code can be found at https://github.com/s-guenther/estss and https://zenodo.org/records/10213145

Institutionelles Repositorium der Leibniz Universität Hannover

Identifying Heavy-Flavor Jets Using Vectors of Locally Aggregated Descriptors

Author: Bielčíková Jana
Elayavalli Raghav Kunnawalkam
Ponimatkin Georgy
Putschke Jörn H.
Sivic Josef
Publication venue: 'IOP Publishing'
Publication date: 01/01/2021
Field of study

Jets of collimated particles serve a multitude of purposes in high energy collisions. Recently, studies of jet interaction with the quark-gluon plasma (QGP) created in high energy heavy ion collisions are of growing interest, particularly towards understanding partonic energy loss in the QGP medium and its related modifications of the jet shower and fragmentation. Since the QGP is a colored medium, the extent of jet quenching and consequently, the transport properties of the medium are expected to be sensitive to fundamental properties of the jets such as the flavor of the parton that initiates the jet. Identifying the jet flavor enables an extraction of the mass dependence in jet-QGP interactions. We present a novel approach to tagging heavy-flavor jets at collider experiments utilizing the information contained within jet constituents via the \texttt{JetVLAD} model architecture. We show the performance of this model in proton-proton collisions at center of mass energy

\sqrt{s} = 200

GeV as characterized by common metrics and showcase its ability to extract high purity heavy-flavor jet sample at various jet momenta and realistic production cross-sections including a brief discussion on the impact of out-of-time pile-up. Such studies open new opportunities for future high purity heavy-flavor measurements at jet energies accessible at current and future collider experiments.Comment: 18 pages, 6 figures and 3 tables. Accepted by JINS

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Conduits of Intratumor Heterogeneity: Centrosome Amplification, Centrosome Clustering and Mitotic Frequency

Author: Pannu Vaishali
Publication venue: ScholarWorks @ Georgia State University
Publication date: 18/12/2014
Field of study

Tumor initiation and progression is dependent on the acquisition and accumulation of multiple driver mutations that activate and fuel oncogenic pathways and deactivate tumor suppressor networks. This complex continuum of non-stochastic genetic changes in accompaniment with error-prone mitoses largely explains why tumors are a mosaic of different cells. Contrary to the long-held notion that tumors are dominated by genetically-identical cells, tumors often contain many different subsets of cells that are remarkably diverse and distinct. The extent of this intratumor heterogeneity has bewildered cancer biologists’ and clinicians alike, as this partly illuminates why most cancer treatments fail. Unsurprisingly, there is no “wonder” drug yet available which can target all the different sub-populations including rare clones, and conquer the war on cancer. Breast tumors harbor ginormous extent of intratumoral heterogeneity, both within primary and metastatic lesions. This revelation essentially calls into question mega clinical endeavors such as the Human Genome Project that have sequenced a single biopsy from a large tumor mass thus precluding realization of the fact that a single tumor mass comprises of cells that present a variety of flavors in genotypic compositions. It is also becoming recognized that intratumor clonal heterogeneity underlies therapeutic resistance. Thus to comprehend the clinical behavior and therapeutic management of tumors, it is imperative to recognize and understand how intratumor heterogeneity arises. To this end, my research proposes to study two main features/cellular traits of tumors that can be quantitatively evaluated as “surrogates” to represent tumor heterogeneity at various stages of the disease: (a) centrosome amplification and clustering, and (b) mitotic frequency. This study aims at interrogating how a collaborative interplay of these “vehicles” support the tumor’s evolutionary agenda, and how we can glean prognostic and predictive information from an accurate determination of these cellular traits

ScholarWorks @ Georgia State University

Numerical Experiments with Support Vector Machines

Author: Gilardi Nicolas
Kanevski Mikhail
Publication venue: IDIAP
Publication date: 10/03/2006
Field of study

The report presents a series of numerical experiments concerning application of Support Vector Machines for the two class spatial data classification. The main attention is paid to the variability of the results by changing hyperparameters: bandwidth of the radial basis function kernel and C parameter. Training error, testing error and number of support vectors are plotted against hyperparameters. Number of support vectors is minimal at the optimal solution. Two real case studies are considered: Cd contamination in the Leman Lake, Briansk region radionuclides soil contamination. Structural analysis (variography) is used for the description of the spatial patterns obtained and to monitor the performance of SVM

Infoscience - École polytechnique fédérale de Lausanne

A unified framework for detecting groups and application to shape recognition

Author: Cao Frédéric
Delon Julie
Desolneux Agnès
Musé Pablo
Sur Frédéric
Publication venue: HAL CCSD
Publication date: 01/01/2005
Field of study

A unified a contrario detection method is proposed to solve three classical problems in clustering analysis. The first one is to evaluate the validity of a cluster candidate. The second problem is that meaningful clusters can contain or be contained in other meaningful clusters. A rule is needed to define locally optimal clusters by inclusion. The third problem is the definition of a correct merging rule between meaningful clusters, permitting to decide whether they should stay separate or unit. The motivation of this theory is shape recognition. Matching algorithms usually compute correspondences between more or less local features (called shape elements) between images to be compared. This paper intends to form spatially coherent groups between matching shape elements into a shape. Each pair of matching shape elements indeed leads to a unique transformation (similarity or affine map.) As an application, the present theory on the choice of the right clusters is used to group these shape elements into shapes by detecting clusters in the transformation space

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL Descartes

HAL-Rennes 1

Simulated Annealing

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

The book contains 15 chapters presenting recent contributions of top researchers working with Simulated Annealing (SA). Although it represents a small sample of the research activity on SA, the book will certainly serve as a valuable tool for researchers interested in getting involved in this multidisciplinary field. In fact, one of the salient features is that the book is highly multidisciplinary in terms of application areas since it assembles experts from the fields of Biology, Telecommunications, Geology, Electronics and Medicine

Directory of Open Access Books (DOAB)