1,640 research outputs found
Robust Orthogonal Complement Principal Component Analysis
Recently, the robustification of principal component analysis has attracted
lots of attention from statisticians, engineers and computer scientists. In
this work we study the type of outliers that are not necessarily apparent in
the original observation space but can seriously affect the principal subspace
estimation. Based on a mathematical formulation of such transformed outliers, a
novel robust orthogonal complement principal component analysis (ROC-PCA) is
proposed. The framework combines the popular sparsity-enforcing and low rank
regularization techniques to deal with row-wise outliers as well as
element-wise outliers. A non-asymptotic oracle inequality guarantees the
accuracy and high breakdown performance of ROC-PCA in finite samples. To tackle
the computational challenges, an efficient algorithm is developed on the basis
of Stiefel manifold optimization and iterative thresholding. Furthermore, a
batch variant is proposed to significantly reduce the cost in ultra high
dimensions. The paper also points out a pitfall of a common practice of SVD
reduction in robust PCA. Experiments show the effectiveness and efficiency of
ROC-PCA in both synthetic and real data
Doctor of Philosophy
dissertationWildfire is a common hazard in the western U.S. that can cause significant loss of life and property. When a fire approaches a community and becomes a threat to the residents, emergency managers need to take into account both fire behavior and the expected response of the threatened population to warnings before they issue protective action recommendations to the residents at risk. In wildfire evacuation practices, incident commanders use prominent geographic features (e.g., rivers, roads, and ridgelines) as trigger points, such that when a fire crosses a feature, the selected protective action recommendation will be issued to the residents at risk. This dissertation examines the dynamics of evacuation timing by coupling wildfire spread modeling, trigger modeling, reverse geocoding, and traffic simulation to model wildfire evacuation as a coupled human-environmental system. This dissertation is composed of three manuscripts. In the first manuscript, wildfire simulation and household-level trigger modeling are coupled to stage evacuation warnings. This work presents a bottom-up approach to constructing evacuation warning zones and is characterized by fine-grain, data-driven spatial modeling. The results in this work will help improve our understanding and representation of the spatiotemporal dynamics in wildfire evacuation timing and warnings. The second manuscript integrates trigger modeling and reverse geocoding to extract and select prominent geographic features along the boundary of a trigger buffer. A case study using a global gazetteer GeoNames demonstrates the potential value of the proposed method in facilitating communications in real-world evacuation practice. This work also sheds light on using reverse geocoding in other environmental modeling applications. The third manuscript explores the spatiotemporal dynamics behind evacuation timing by coupling fire and traffic simulation models. The proposed method sets wildfire evacuation triggers based on the estimated evacuation times using agent-based traffic simulation and could be potentially used in evacuation planning. In summary, this dissertation enriches existing trigger modeling approaches by coupling fire simulation, reverse geocoding, and traffic simulation. A framework for modeling wildfire evacuation as a coupled human-environmental system using triggers is proposed. Moreover, this dissertation also attempts to advocate and promote open science in wildfire evacuation modeling by using open data and software tools in different phases of modeling and simulation
Doctor of Philosophy
dissertationWildfire is a common hazard in the western U.S. that can cause significant loss of life and property. When a fire approaches a community and becomes a threat to the residents, emergency managers need to take into account both fire behavior and the expected response of the threatened population to warnings before they issue protective action recommendations to the residents at risk. In wildfire evacuation practices, incident commanders use prominent geographic features (e.g., rivers, roads, and ridgelines) as trigger points, such that when a fire crosses a feature, the selected protective action recommendation will be issued to the residents at risk. This dissertation examines the dynamics of evacuation timing by coupling wildfire spread modeling, trigger modeling, reverse geocoding, and traffic simulation to model wildfire evacuation as a coupled human-environmental system. This dissertation is composed of three manuscripts. In the first manuscript, wildfire simulation and household-level trigger modeling are coupled to stage evacuation warnings. This work presents a bottom-up approach to constructing evacuation warning zones and is characterized by fine-grain, data-driven spatial modeling. The results in this work will help improve our understanding and representation of the spatiotemporal dynamics in wildfire evacuation timing and warnings. The second manuscript integrates trigger modeling and reverse geocoding to extract and select prominent geographic features along the boundary of a trigger buffer. A case study using a global gazetteer GeoNames demonstrates the potential value of the proposed method in facilitating communications in real-world evacuation practice. This work also sheds light on using reverse geocoding in other environmental modeling applications. The third manuscript explores the spatiotemporal dynamics behind evacuation timing by coupling fire and traffic simulation models. The proposed method sets wildfire evacuation triggers based on the estimated evacuation times using agent-based traffic simulation and could be potentially used in evacuation planning. In summary, this dissertation enriches existing trigger modeling approaches by coupling fire simulation, reverse geocoding, and traffic simulation. A framework for modeling wildfire evacuation as a coupled human-environmental system using triggers is proposed. Moreover, this dissertation also attempts to advocate and promote open science in wildfire evacuation modeling by using open data and software tools in different phases of modeling and simulation
Group Iterative Spectrum Thresholding for Super-Resolution Sparse Spectral Selection
Recently, sparsity-based algorithms are proposed for super-resolution
spectrum estimation. However, to achieve adequately high resolution in
real-world signal analysis, the dictionary atoms have to be close to each other
in frequency, thereby resulting in a coherent design. The popular convex
compressed sensing methods break down in presence of high coherence and large
noise. We propose a new regularization approach to handle model collinearity
and obtain parsimonious frequency selection simultaneously. It takes advantage
of the pairing structure of sine and cosine atoms in the frequency dictionary.
A probabilistic spectrum screening is also developed for fast computation in
high dimensions. A data-resampling version of high-dimensional Bayesian
Information Criterion is used to determine the regularization parameters.
Experiments show the efficacy and efficiency of the proposed algorithms in
challenging situations with small sample size, high frequency resolution, and
low signal-to-noise ratio
OSU Multimodal Machine Translation System Report
This paper describes Oregon State University's submissions to the shared
WMT'17 task "multimodal translation task I". In this task, all the sentence
pairs are image captions in different languages. The key difference between
this task and conventional machine translation is that we have corresponding
images as additional information for each sentence pair. In this paper, we
introduce a simple but effective system which takes an image shared between
different languages, feeding it into the both encoding and decoding side. We
report our system's performance for English-French and English-German with
Flickr30K (in-domain) and MSCOCO (out-of-domain) datasets. Our system achieves
the best performance in TER for English-German for MSCOCO dataset.Comment: 5, WMT 201
- …