Search CORE

542,502 research outputs found

Essays on structural changes in high dimensional econometric models

Author: Wang Fa
Publication venue: SURFACE at Syracuse University
Publication date: 01/05/2016
Field of study

This dissertation consists of three essays on estimating and testing structural changes in high dimensional econometrics models. These essays are based on three working papers joint with Prof. Badi Baltagi and Prof. Chihwa Kao. The first essay considers estimating the date of a single common change in the regression coefficients of a heterogeneous large N and large T panel data model with or without strong cross- sectional dependence. The second essay considers estimating a high dimensional factor model with an unknown number of latent factors and a single common change in the number of factors and/or factor loadings. The third essay considers estimating a high dimensional factor model with an unknown number of latent factors and multiple common changes in the number of factors and/or factor loadings, and also testing procedures to detect the presence and number of structural changes. The first essay studies the asymptotic properties of the least squares estimator of the common change point in large heterogeneous panel data models under various sets of conditions on the change magnitude and N-T ratio, allowing N and T to go to infinity jointly. Consistency and limiting distribution are established under general conditions. A general Hajek-Renyi inequality is introduced to calculate the order of the expectation of sup-type terms. Both weak and strong cross-sectional dependence are considered. In the former case the least squares estimator is consistent as the number of subjects tends to infinity while in the latter case a two step estimator is proposed and consistency can be recovered once estimated factors are used to control the cross-sectional dependence. The limiting distribution is derived allowing the error process to be serially dependent and heteroskedastic of unknown form, and inference can be made based on the simulated distribution. The second essay tackles the identification and estimation of a high dimensional factor model with unknown number of latent factors and a single common break in the number of factors and/or factor loadings. Since the factors are unobservable, the change point estimator is based on the second moments of the estimated pseudo factors. This essay shows that the estimation error of the proposed estimator is bounded in probability as N and T go to infinity jointly. This essay also shows that the proposed estimator has a high degree of robustness to misspecification of the number of pseudo factors. With the estimated change point plugged in, consistency of the estimated number of pre and post- break factors and convergence rate of the estimated pre and post-break factor space are then established under fairly general assumptions. Finite sample performance of the proposed estimators is investigated using Monte Carlo experiments. The third essay considers high dimensional factor models with multiple common structural changes. Based on the second moments of the estimated pseudo factors, both joint and sequential estimation of the change points are considered. The estimation error of both estimators is bounded in probability as the cross-sectional dimension N and the time dimension T go to infinity jointly. The measurement error contained in the estimated pseudo factors has no effect on the asymptotic properties of the estimated change points as N and T go to infinity jointly, and no N-T ratio condition is needed. The estimated change points are plugged in to estimate the number of factors and the factor space in each regime. Although the estimated change points are inconsistent, using them asymptotically has no effect on subsequent estimation. This essay also proposes (i) tests for the null of no change versus the alternative of l changes and (ii) tests for the null of l changes versus the alternative of l + 1 changes. These tests allow us to make inference on the presence and number of structural changes. Simulation results show good performance of the proposed estimation and testing procedures

Syracuse University Research Facility and Collaborative Environment

Recommended from our members

Optimizing Data-Intensive Computing with Efficient Configuration Tuning

Author: Fekry Ayat
Publication venue: University of Cambridge
Publication date: 30/07/2021
Field of study

As the complexity of distributed analytics systems evolves over time, more configuration parameters get exposed for tuning. While these numerous parameters allow users more control over how their workloads are executed, this flexibility comes at a cost, since finding the right configurations for such systems in a cost-effective way becomes challenging. In practice, several factors contribute to the complexity of tuning the configuration of those systems: the large configuration space, the diversity of the served workloads (each workload possibly requiring a different resource allocation strategy to run optimally), and the dynamic characteristics of these systems’ environment (e.g., increase in input data size, changes in the allocation of resources). Paradoxically, existing solutions for workload tuning either assume static tuning environment or workloads that are inexpensive to run (i.e. requiring hundreds of execution samples). Recently, Bayesian Optimisation (BO) strategies have been applied as a solution to enable efficient autotuning. They build a probabilistic model incrementally to predict the impact of the parameters on performance using a small number of execution samples. The incrementally constructed BO model is used to guide the tuning process and accelerate convergence to a near-optimal configuration. Unfortunately, for distributed analytics systems, the configuration space is too large to construct a good model using traditional BO, which fails to provide quick convergence in high dimensional configuration space. I argue that cost-effective tuning strategies can only be developed when taking into account: the frequent changes that can happen in the analytics workload/environment, the amortization of tuning costs and how this influences tuning profitability, the high dimensionality of configuration space and the need to cater for diverse workloads. To tackle these challenges, I propose Tuneful, an efficient configuration tuning framework for such expensive to tune systems. It works efficiently both initially (when little data is available) as well as later (as more tuning knowledge is acquired). It starts with learning workload-specific influential parameters incrementally and tunes those only, then when more tuning knowledge becomes available, it detects similarity across workloads and utilizes multitask BO to share the tuning knowledge across similar workloads. I show how augmenting the BO approach with parameters’ significance and workload similarity characteristics enables an efficient configuration tuning in high dimensional configuration space. Over diverse analytics workloads, this significantly accelerates both configuration tuning and cost amortization, saving search time by 2.7-3.7X at median compared to the-state-of-the-art approaches

Apollo (Cambridge)

Recommended from our members

Finding High-Dimensional D-OptimalDesigns for Logistic Models via Differential Evolution

Author: Tan Kay Chen
Wong Weng Kee
Xu Jianxin
Xu Weinan
Publication venue: eScholarship, University of California
Publication date: 31/01/2020
Field of study

D-optimal designs are frequently used in controlled experiments to obtain the most accurateestimate of model parameters at minimal cost. Finding them can be a challenging task, especially whenthere are many factors in a nonlinear model. As the number of factors becomes large and interact withone another, there are many more variables to optimize and the D-optimal design problem becomes highdimensionaland non-separable. Consequently, premature convergence issues arise. Candidate solutions gettrapped in local optima and the classical gradient-based optimization approaches to search for the D-optimaldesigns rarely succeed. We propose a specially designed version of differential evolution (DE) which is arepresentative gradient-free optimization approach to solve such high-dimensional optimization problems.The proposed specially designed DE uses a new novelty-based mutation strategy to explore the variousregions in the search space. The exploration of the regions will be carried out differently from the previouslyexplored regions and the diversity of the population can be preserved. The proposed novelty-based mutationstrategy is collaborated with two common DE mutation strategies to balance exploration and exploitationat the early or medium stage of the evolution. Additionally, we adapt the control parameters of DE as theevolution proceeds. Using logistic models with several factors on various design spaces as examples, oursimulation results show our algorithm can find D-optimal designs efficiently and the algorithm outperformsits competitors. As an application, we apply our algorithm and re-design a 10-factor car refueling experimentwith discrete and continuous factors and selected pairwise interactions. Our proposed algorithm was able toconsistently outperform the other algorithms and find a more efficient D-optimal design for the problem

eScholarship - University of California

Sum-of-Squares approach to feedback control of laminar wake flows

Author: Chernyshenko
Cordier
Davide Lasagna
Deqing Huang
Goulart
Holmes
Jasak
Lehmann
Lu
Löfberg
Monkewitz
Noack
Owen R. Tutty
Peifer
Sergei Chernyshenko
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 12/10/2016
Field of study

A novel nonlinear feedback control design methodology for incompressible fluid flows aiming at the optimisation of long-time averages of flow quantities is presented. It applies to reduced-order finite-dimensional models of fluid flows, expressed as a set of first-order nonlinear ordinary differential equations with the right-hand side being a polynomial function in the state variables and in the controls. The key idea, first discussed in Chernyshenko et al. 2014, Philos. T. Roy. Soc. 372(2020), is that the difficulties of treating and optimising long-time averages of a cost are relaxed by using the upper/lower bounds of such averages as the objective function. In this setting, control design reduces to finding a feedback controller that optimises the bound, subject to a polynomial inequality constraint involving the cost function, the nonlinear system, the controller itself and a tunable polynomial function. A numerically tractable approach to the solution of such optimisation problems, based on Sum-of-Squares techniques and semidefinite programming, is proposed. To showcase the methodology, the mitigation of the fluctuation kinetic energy in the unsteady wake behind a circular cylinder in the laminar regime at Re=100, via controlled angular motions of the surface, is numerically investigated. A compact reduced-order model that resolves the long-term behaviour of the fluid flow and the effects of actuation, is derived using Proper Orthogonal Decomposition and Galerkin projection. In a full-information setting, feedback controllers are then designed to reduce the long-time average of the kinetic energy associated with the limit cycle. These controllers are then implemented in direct numerical simulations of the actuated flow. Control performance, energy efficiency, and physical control mechanisms identified are analysed. Key elements, implications and future work are discussed

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Crossref

Spiral - Imperial College Digital Repository

Action-Conditional Video Prediction using Deep Networks in Atari Games

Author: Guo Xiaoxiao
Lee Honglak
Lewis Richard
Oh Junhyuk
Singh Satinder
Publication venue
Publication date: 21/12/2015
Field of study

Motivated by vision-based reinforcement learning (RL) problems, in particular Atari games from the recent benchmark Aracade Learning Environment (ALE), we consider spatio-temporal prediction problems where future (image-)frames are dependent on control variables or actions as well as previous frames. While not composed of natural scenes, frames in Atari games are high-dimensional in size, can involve tens of objects with one or more objects being controlled by the actions directly and many other objects being influenced indirectly, can involve entry and departure of objects, and can involve deep partial observability. We propose and evaluate two deep neural network architectures that consist of encoding, action-conditional transformation, and decoding layers based on convolutional neural networks and recurrent neural networks. Experimental results show that the proposed architectures are able to generate visually-realistic frames that are also useful for control over approximately 100-step action-conditional futures in some games. To the best of our knowledge, this paper is the first to make and evaluate long-term predictions on high-dimensional video conditioned by control inputs.Comment: Published at NIPS 2015 (Advances in Neural Information Processing Systems 28

arXiv.org e-Print Archive

CiteSeerX