705 research outputs found

    Metaheuristic Algorithms in Artificial Intelligence with Applications to Bioinformatics, Biostatistics, Ecology and, the Manufacturing Industries

    Full text link
    Nature-inspired metaheuristic algorithms are important components of artificial intelligence, and are increasingly used across disciplines to tackle various types of challenging optimization problems. We apply a newly proposed nature-inspired metaheuristic algorithm called competitive swarm optimizer with mutated agents (CSO-MA) and demonstrate its flexibility and out-performance relative to its competitors in a variety of optimization problems in the statistical sciences. In particular, we show the algorithm is efficient and can incorporate various cost structures or multiple user-specified nonlinear constraints. Our applications include (i) finding maximum likelihood estimates of parameters in a single cell generalized trend model to study pseudotime in bioinformatics, (ii) estimating parameters in a commonly used Rasch model in education research, (iii) finding M-estimates for a Cox regression in a Markov renewal model and (iv) matrix completion to impute missing values in a two compartment model. In addition we discuss applications to (v) select variables optimally in an ecology problem and (vi) design a car refueling experiment for the auto industry using a logistic model with multiple interacting factors

    SUPPLY CHAIN NETWORK DESIGN: RISK-AVERSE VS. RISK-NEUTRAL DECISION MAKING

    Get PDF
    Recent events, such as the Heparin tragedy, highlight the necessity for designers and planners of supply chain networks to consider the risk of disruptions in spite of their low probability of occurrence. One effective way to hedge against supply chain network disruptions is to have a robustly designed supply chain network. This involves strategic decisions, such as choosing which markets to serve, which suppliers to source from, the location of plants, the types of facilities to use, and tactical decisions, such as production and capacity allocation. In this dissertation, we focus on models for designing supply chain networks that are resilient to disruptions. We consider two types of decision making policies. A risk-neutral decision making policy is based on the cost minimization approach, and the decision-maker defines the set of decisions that minimize expected cost. We also consider a risk-averse policy wherein rather than selecting facilities that minimize expected cost, the decision-maker uses a Conditional Value-at-Risk approach to measure and quantify risk. However, such network design problems belong to class of NP hard problems. Accordingly, we develop efficient heuristic algorithms and metaheuristic approaches to obtain acceptable solutions to these types of problems in reasonable runtimes so that the decision making process is facilitated with at most a moderate reduction in solution quality. Finally, we perform statistical analyses (e.g., logistic regression) to assess the likelihood of selection for each facility. These models allow us to identify the factors that impact facility selection in both the risk-neutral and risk-averse policies

    Optimizing Logistic Regression Coefficients for Discrimination and Calibration Using Estimation of Distribution Algorithms.

    Get PDF
    Logistic regression is a simple and efficient supervised learning algorithm for estimating the probability of an outcome or class variable. In spite of its simplicity, logistic regression has shown very good performance in a range of fields. It is widely accepted in a range of fields because its results are easy to interpret. Fitting the logistic regression model usually involves using the principle of maximum likelihood. The Newton–Raphson algorithm is the most common numerical approach for obtaining the coefficients maximizing the likelihood of the data. This work presents a novel approach for fitting the logistic regression model based on estimation of distribution algorithms (EDAs), a tool for evolutionary computation. EDAs are suitable not only for maximizing the likelihood, but also for maximizing the area under the receiver operating characteristic curve (AUC). Thus, we tackle the logistic regression problem from a double perspective: likelihood-based to calibrate the model and AUC-based to discriminate between the different classes. Under these two objectives of calibration and discrimination, the Pareto front can be obtained in our EDA framework. These fronts are compared with those yielded by a multiobjective EDA recently introduced in the literature

    Assisted specification of discrete choice models

    Get PDF
    Determining appropriate utility specifications for discrete choice models is time-consuming and prone to errors. With the availability of larger and larger datasets, as the number of possible specifications exponentially grows with the number of variables under consideration, the analysts need to spend increasing amounts of time on searching for good models through trial-and-error, while expert knowledge is required to ensure these models are sound. This paper proposes an algorithm that aims at assisting modelers in their search. Our approach translates the task into a multi-objective combinatorial optimization problem and makes use of a variant of the variable neighborhood search algorithm to generate sets of promising model specifications. We apply the algorithm both to semi-synthetic data and to real mode choice datasets as a proof of concept. The results demonstrate its ability to provide relevant insights in reasonable amounts of time so as to effectively assist the modeler in developing interpretable and powerful models

    Predicting failure in the commercial banking industry

    Get PDF
    The ability to predict bank failure has become much more important since the mortgage foreclosure crisis began in 2007. The model proposed in this study uses proxies for the regulatory standards embodied in the so-called CAMELS rating system, as well as several local or national economic variables to produce a model that is robust enough to forecast bank failure for the entire commercial bank industry in the United States. This model is able to predict failure (survival) accurately for commercial banks during both the Savings and Loan crisis and the mortgage foreclosure crisis. Other important results include the insignificance of several factors proposed in the literature, including total assets, real price of energy, currency ratio and the interest rate spread.bank failure; banking crises; CAMELS ratings
    corecore