Search CORE

465 research outputs found

OWA operators in regression problems

Author: Beliakov Gleb
Yager Ronald R.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/02/2010
Field of study

We consider an application of fuzzy logic connectives to statistical regression. We replace the standard least squares, least absolute deviation, and maximum likelihood criteria with an ordered weighted averaging (OWA) function of the residuals. Depending on the choice of the weights, we obtain the standard regression problems, high-breakdown robust methods (least median, least trimmed squares, and trimmed likelihood methods), as well as new formulations. We present various approaches to numerical solution of such regression problems. OWA-based regression is particularly useful in the presence of outliers, and we illustrate the performance of the new methods on several instances of linear regression problems with multiple outliers.<br /

Deakin Research Online

Constraints preserving genetic algorithm for learning fuzzy measures with an application to ontology matching

Author: Derek T. Anderson
Mohammad Al Boni
Roger L. King
Publication venue
Publication date
Field of study

Abstract. Both the fuzzy measure and integral have been widely studied for multi-source information fusion. A number of researchers have proposed optimization techniques to learn a fuzzy measure from training data. In part, this task is difficult as the fuzzy measure can have a large number of free parameters (2 N − 2 for N sources) and it has many (monotonicity) constraints. In this paper, a new genetic algorithm approach to constraint preserving optimization of the fuzzy measure is present for the task of learning and fusing different ontology matching results. Preliminary results are presented to show the stability of the leaning algorithm and its effectiveness compared to existing approaches

CiteSeerX

EXPLAINABLE FEATURE- AND DECISION-LEVEL FUSION

Author: Kakula Siva Krishna
Publication venue: Digital Commons @ Michigan Tech
Publication date: 01/01/2021
Field of study

Information fusion is the process of aggregating knowledge from multiple data sources to produce more consistent, accurate, and useful information than any one individual source can provide. In general, there are three primary sources of data/information: humans, algorithms, and sensors. Typically, objective data---e.g., measurements---arise from sensors. Using these data sources, applications such as computer vision and remote sensing have long been applying fusion at different levels (signal, feature, decision, etc.). Furthermore, the daily advancement in engineering technologies like smart cars, which operate in complex and dynamic environments using multiple sensors, are raising both the demand for and complexity of fusion. There is a great need to discover new theories to combine and analyze heterogeneous data arising from one or more sources. The work collected in this dissertation addresses the problem of feature- and decision-level fusion. Specifically, this work focuses on fuzzy choquet integral (ChI)-based data fusion methods. Most mathematical approaches for data fusion have focused on combining inputs relative to the assumption of independence between them. However, often there are rich interactions (e.g., correlations) between inputs that should be exploited. The ChI is a powerful aggregation tool that is capable modeling these interactions. Consider the fusion of m sources, where there are 2m unique subsets (interactions); the ChI is capable of learning the worth of each of these possible source subsets. However, the complexity of fuzzy integral-based methods grows quickly, as the number of trainable parameters for the fusion of m sources scales as 2m. Hence, we require a large amount of training data to avoid the problem of over-fitting. This work addresses the over-fitting problem of ChI-based data fusion with novel regularization strategies. These regularization strategies alleviate the issue of over-fitting while training with limited data and also enable the user to consciously push the learned methods to take a predefined, or perhaps known, structure. Also, the existing methods for training the ChI for decision- and feature-level data fusion involve quadratic programming (QP). The QP-based learning approach for learning ChI-based data fusion solutions has a high space complexity. This has limited the practical application of ChI-based data fusion methods to six or fewer input sources. To address the space complexity issue, this work introduces an online training algorithm for learning ChI. The online method is an iterative gradient descent approach that processes one observation at a time, enabling the applicability of ChI-based data fusion on higher dimensional data sets. In many real-world data fusion applications, it is imperative to have an explanation or interpretation. This may include providing information on what was learned, what is the worth of individual sources, why a decision was reached, what evidence process(es) were used, and what confidence does the system have on its decision. However, most existing machine learning solutions for data fusion are black boxes, e.g., deep learning. In this work, we designed methods and metrics that help with answering these questions of interpretation, and we also developed visualization methods that help users better understand the machine learning solution and its behavior for different instances of data

Michigan Technological University

Framework for optimizing intelligence collection requirements

Author: Tong Khiem
Publication venue: RIT Scholar Works
Publication date: 01/11/2010
Field of study

In the military, typical mission execution goes through cycles of intelligence collection and action planning phases. For complex operations where many parameters affect the outcomes of the mission, several steps may be taken for intelligence collection before the optimal Course of Action is actually carried out. Human analytics suggests the steps of: (1) anticipating plausible futures, (2) determining information requirements, and (3) optimize the choice of feasible and cost-effective intelligence requirements. This work formalizes this process by developing a decision support tool to determine information requirements needed to differentiate critical plausible futures, and formulating a mixed integer programming problem to trade-off the feasibility and benefits of intelligence collection requirements. Course of Action planning has been widely studied in the military domain, but mostly in an abstract fashion. Intelligence collection, while intuitively aiming at reducing uncertainties, should ultimately produce optimal outcomes for mission success. Building on previous efforts, this work studies the effect of plausible futures estimated based on current adversary activities. A set of differentiating event attributes are derived for each set of high impact futures, forming a candidate collection requirement action. The candidate collection requirement actions are then used as inputs to a Mixed Integer Programming formulation, which optimizes the plausible future mission state subject to timing and cost constraints. The plausible future mission state is estimated by assuming that the Collection Requirement Actions can potentially avert the damages adversary future activities might cause. A case study was performed to demonstrate several use cases for the overall framework

RIT Scholar Works

Finding a Collective Set of Items: From Proportional Multirepresentation to Group Recommendation

Author: Faliszewski Piotr
Lang Jerome
Skowron Piotr
Publication venue
Publication date: 18/02/2015
Field of study

We consider the following problem: There is a set of items (e.g., movies) and a group of agents (e.g., passengers on a plane); each agent has some intrinsic utility for each of the items. Our goal is to pick a set of

K

items that maximize the total derived utility of all the agents (i.e., in our example we are to pick

K

movies that we put on the plane's entertainment system). However, the actual utility that an agent derives from a given item is only a fraction of its intrinsic one, and this fraction depends on how the agent ranks the item among the chosen, available, ones. We provide a formal specification of the model and provide concrete examples and settings where it is applicable. We show that the problem is hard in general, but we show a number of tractability results for its natural special cases

arXiv.org e-Print Archive

CiteSeerX

Base de publications de l'université Paris-Dauphine

Association for the Advancement of Artificial Intelligence: AAAI Publications

Efficient Data Driven Multi Source Fusion

Author: Islam Muhammad Aminul
Publication venue: Scholars Junction
Publication date: 10/08/2018
Field of study

Data/information fusion is an integral component of many existing and emerging applications; e.g., remote sensing, smart cars, Internet of Things (IoT), and Big Data, to name a few. While fusion aims to achieve better results than what any one individual input can provide, often the challenge is to determine the underlying mathematics for aggregation suitable for an application. In this dissertation, I focus on the following three aspects of aggregation: (i) efficient data-driven learning and optimization, (ii) extensions and new aggregation methods, and (iii) feature and decision level fusion for machine learning with applications to signal and image processing. The Choquet integral (ChI), a powerful nonlinear aggregation operator, is a parametric way (with respect to the fuzzy measure (FM)) to generate a wealth of aggregation operators. The FM has 2N variables and N(2N − 1) constraints for N inputs. As a result, learning the ChI parameters from data quickly becomes impractical for most applications. Herein, I propose a scalable learning procedure (which is linear with respect to training sample size) for the ChI that identifies and optimizes only data-supported variables. As such, the computational complexity of the learning algorithm is proportional to the complexity of the solver used. This method also includes an imputation framework to obtain scalar values for data-unsupported (aka missing) variables and a compression algorithm (lossy or losselss) of the learned variables. I also propose a genetic algorithm (GA) to optimize the ChI for non-convex, multi-modal, and/or analytical objective functions. This algorithm introduces two operators that automatically preserve the constraints; therefore there is no need to explicitly enforce the constraints as is required by traditional GA algorithms. In addition, this algorithm provides an efficient representation of the search space with the minimal set of vertices. Furthermore, I study different strategies for extending the fuzzy integral for missing data and I propose a GOAL programming framework to aggregate inputs from heterogeneous sources for the ChI learning. Last, my work in remote sensing involves visual clustering based band group selection and Lp-norm multiple kernel learning based feature level fusion in hyperspectral image processing to enhance pixel level classification

Mississippi State University Libraries ETD database

Scholars Junction - Mississippi State University Institutional Repository

A risk-aversion approach for the Multiobjective Stochastic Programming problem

Author: León Javier
Puerto Justo
Vitoriano Begoña
Publication venue: 'MDPI AG'
Publication date: 27/08/2020
Field of study

Multiobjective stochastic programming is a field well located to tackle problems arising in emergencies, given that uncertainty and multiple objectives are usually present in such problems. A new concept of solution is proposed in this work, especially designed for risk-aversion solutions. A linear programming model is presented to obtain such solution.Comment: 29 pages, 3 figures, 17 table

arXiv.org e-Print Archive

Docta Complutense

idUS. Depósito de Investigación Universidad de Sevilla

Nearest-Neighbor Guided Evaluation of Data Reliability and Its Applications

Author: Boongoen Tossapon
Shen Qiang
Publication venue
Publication date: 01/12/2010
Field of study

Aberystwyth Research Portal