196,505 research outputs found

    Categorical Data

    Get PDF
    A very brief survey of regression for categorical data. Categorical outcome (or discrete outcome or qualitative response) regression models are models for a discrete dependent variable recording in which of two or more categories an outcome of interest lies. For binary data (two categories) probit and logit models or semiparametric methods are used. For multinomial data (more than two categories) that are unordered, common models are multinomial and conditional logit, nested logit, multinomial probit, and random parameters logit. The last two models are estimated using simulation or Bayesian methods. For ordered data, standard multinomial models are ordered logit and probit, or count models are used if ordered discrete data are actually a count.binary data, multinomial, logit, probit, count data

    On the Equivalence of Location Choice Models: Conditional Logit, Nested Logit and Poisson

    Get PDF
    It is well understood that the two most popular empirical models of location choice - conditional logit and Poisson - return identical coefficient estimates when the regressors are not individual specific. We show that these two models differ starkly in terms of their implied predictions. The conditional logit model represents a zero-sum world, in which one region's gain is the other regions' loss. In contrast, the Poisson model implies a positive-sum economy, in which one region's gain is no other region's loss. We also show that all intermediate cases can be represented as a nested logit model with a single outside option. The nested logit turns out to be a linear combination of the conditional logit and Poisson models. Conditional logit and Poisson elasticities mark the polar cases and can therefore serve as boundary values in applied research.firm location, residential choice, conditional logit, nested logit, Poisson count model

    Multinomial Logit Models with Implicit Variable Selection

    Get PDF
    Multinomial logit models which are most commonly used for the modeling of unordered multi-category responses are typically restricted to the use of few predictors. In the high-dimensional case maximum likelihood estimates frequently do not exist. In this paper we are developing a boosting technique called multinomBoost that performs variable selection and fits the multinomial logit model also when predictors are high-dimensional. Since in multicategory models the effect of one predictor variable is represented by several parameters one has to distinguish between variable selection and parameter selection. A special feature of the approach is that, in contrast to existing approaches, it selects variables not parameters. The method can distinguish between mandatory predictors and optional predictors. Moreover, it adapts to metric, binary, nominal and ordinal predictors. Regularization within the algorithm allows to include nominal and ordinal variables which have many categories. In the case of ordinal predictors the order information is used. The performance of the boosting technique with respect to mean squared error, prediction error and the identification of relevant variables is investigated in a simulation study. For two real life data sets the results are also compared with the Lasso approach which selects parameters

    Specification(s) of Nested Logit Models

    Get PDF
    The nested logit model has become an important tool for the empirical analysis of discrete outcomes. There is some confusion about its specification of the outcome probabilities. Two major variants show up in the literature. This paper compares both and finds that one of them (called random utility maximization nested logit, RUMNL) is preferable in most situations. Since the command nlogit of Stata 7.0 implements the other variant (called non-normalized nested logit, NNNL), an implementation of RUMNL called nlogitrum is introduced. Numerous examples support and illustrate the differences of both specifications.

    Angler Heterogeneity and the Species-Specific Demand for Marine Recreational Fishing

    Get PDF
    In this study we assess the viability of single-species recreation demand models given commonly available data sets. Using the 2000 MRFSS southeast intercept data combined with the economic add-on, we determine that the MRFSS data will support only a few species-specific recreation demand models. Considering species of management interest in the southeast, we focus on dolphin, king mackerel, red snapper and red drum. We examine single-species recreational fishing behavior using random utility models of demand. We explore mixed logit (i.e., random parameter) logit and finite mixture (i.e., latent class logit) models for dealing with angler heterogeneity. We compare these to the commonly used conditional and nested logit models in terms of the value of catching (and keeping) one additional fish. Mixed logit models illustrate that the value of catch can be highly heterogeneous and, in some cases, can include both positive and negative values. The finite mixture model generates value estimates that were some times strikingly different than conditional, nested and mixed logit models. Preference heterogeneity is significant within the MRFSS data. We find evidence that single-species models outperform multiple species models and recreational values differ. Key Words: marine recreational fishing, single-species demand, preference heterogeneity models

    Variable Selection in General Multinomial Logit Models

    Get PDF
    The use of the multinomial logit model is typically restricted to applications with few predictors, because in high-dimensional settings maximum likelihood estimates tend to deteriorate. In this paper we are proposing a sparsity-inducing penalty that accounts for the special structure of multinomial models. In contrast to existing methods, it penalizes the parameters that are linked to one variable in a grouped way and thus yields variable selection instead of parameter selection. We develop a proximal gradient method that is able to efficiently compute stable estimates. In addition, the penalization is extended to the important case of predictors that vary across response categories. We apply our estimator to the modeling of party choice of voters in Germany including voter-specific variables like age and gender but also party-specific features like stance on nuclear energy and immigration
    corecore