43,236 research outputs found
Recommended from our members
Two Geometric Results regarding Hรถlder-Brascamp-Lieb Inequalities, and Two Novel Algorithms for Low-Rank Approximation
Broadly speaking, this thesis investigates mathematical questions motivated by computer science. The involved topics include communication avoiding algorithms, classical analysis, convex geometry, and low-rank matrix approximation. In total, the thesis consists of four self-contained sections, each adapted from papers the author has been a part of.The first two sections are both motivated by the Brascamp-Lieb inequalities, which are also often referred to as Hรถlder-Brascamp-Lieb inequalities. These inequalities have featured prominently in recent theoretical computer science work, due to connections to geometric complexity theory, harmonic analysis, communication-avoidance, and many other areas. Moreover, work generalizing the inequalities in various ways, such as to nonlinear versions, has been impactful to the study of differential equations.Section 1 studies the application of Hรถlder-Brascamp-Lieb (HBL) inequalities to the design of communication optimal algorithms. In particular, it describes optimal tiling (blocking) strategies for nested loops that lack data dependencies and exhibit affine memory access patterns. The problem roughly amounts to maximizing the volume of an object provided some of its linear images have bounded volume. The methods used are algorithmic.Another reason for the interest in these inequalities is because they are an interesting test case for non-convex optimization techniques. The optimal constant for a particular instance of the inequality is given by solving a non-convex optimization problem that is still highly structured. Of particular relevance to this thesis is that it can be formulated as a geodesically-convex problem, considered in the context of the manifold of positive definite matrices of determinant . Even using the methods of Section 1, the procedure is not necessarily polynomial time, and this motivates further study of geodesic convexity.This lead to the work of Section 2, which discusses a notion of halfspace for Hadamard manifolds that is natural in the context of convex optimization. For this notion of halfspace, we generalize a classic result of Grunbaum, which itself is a corollary of Helly's theorem. Namely, given a probability distribution on the manifold, there is a point for which all halfspaces based at this point have at least 1/(n+1) of the mass, n being the dimension of the manifold. As an application, the gradient oracle complexity of geodesic convex optimization is polynomial in the parameters defining the problem. In particular it is polynomial in -log(epsilon), where epsilon is the desired error. This is a step toward the open question of whether such an algorithm exists.The remaining two sections of the paper present a different research direction, randomized numerical linear algebra. Numerical linear algebra has long been an important part of scientific computing. Due to the current trend of increasing matrix sizes and growing importance of fast, approximate solutions in industry, randomized methods are quickly increasing in popularity. Sections 3 and 4 in this thesis aim to show that randomized low-rank approximation algorithms satisfy many of the properties of classical rank-revealing factorizations.Section 3 introduces a Generalized Randomized QR-decomposition (RURV) that may be applied to arbitrary products of matrices and their inverses, without needing to explicitly compute the products or inverses. This factorization is a critical part of a communication-optimal spectral divide-and-conquer algorithm for the nonsymmetric eigenvalue problem. In this paper, we establish that this randomized QR-factorization satisfies the strong rank-revealing properties. We also formally prove its stability, making it suitable in applications. Finally, we present numerical experiments which demonstrate that our theoretical bounds capture the empirical behavior of the factorization.Section 4 concerns a Generalized LU-Factorization (GLU) for low-rank matrix approximation. We relate this to past approaches and extensively analyze its approximation properties. The established deterministic guarantees are combined with sketching ensembles satisfying Johnson-Lindenstrauss properties to present complete bounds. Particularly good performance is shown for the sub-sampled randomized Hadamard transform (SRHT) ensemble. Moreover, the factorization is shown to unify and generalize many past algorithms. It also helps to explain the effect of sketching on the growth factor during Gaussian Elimination
๊ฐ์ธํ ์ ์ฐจ์ ๊ณต๊ฐ์ ํ์ต๊ณผ ๋ถ๋ฅ: ํฌ์ ๋ฐ ์ ๊ณ์ ํํ
ํ์๋
ผ๋ฌธ (๋ฐ์ฌ)-- ์์ธ๋ํ๊ต ๋ํ์ : ์ ๊ธฐยท์ปดํจํฐ๊ณตํ๋ถ, 2017. 2. ์ค์ฑํ.Learning a subspace structure based on sparse or low-rank representation has gained much attention and has been widely used over the past decade in machine learning, signal processing, computer vision, and robotic literatures to model a wide range of natural phenomena. Sparse representation is a powerful tool for high-dimensional data such as images, where the goal is to represent or compress the cumbersome data using a few representative samples. Low-rank representation is a generalization of the sparse representation in 2D space. Behind the successful outcomes, many efforts have been made for learning sparse or low-rank representation effciently. However, they are still ineffcient for complex data structures and lack robustness under the existence of various noises including outliers and missing data, because many existing algorithms relax the ideal optimization problem to a tractable one without considering computational and memory complexities. Thus, it is important to use a good representation algorithm which is effciently solvable and robust against unwanted corruptions. In this dissertation, our main goal is to learn algorithms with both robustness and effciency under noisy environments.
As for sparse representation, most of the optimization problems are relaxed to convex ones based on surrogate measures, such as the l1-norm, to resolve the computational intractability and high noise sensitivity of the original sparse representation problem based on the l0-norm. However, if the system at interest, other than the sparsity measure, is inherently nonconvex, then using a convex sparsity measure may not be the best choice for the problems. From this perspective, we propose desirable criteria to be a good nonconvex sparsity measure and suggest a corresponding family of measure. The proposed family of measures allows a simple measure, which enables effcient computation and embraces the benefits of both l0- and l1-norms, and most importantly, its gradient vanishes slowly unlike the l0-norm, which is suitable from an optimization perspective.
For low-rank representation, we first present an effcient l1-norm based low-rank matrix approximation algorithm using the proposed alternating rectified gradient methods to solve an l1-norm minimization problem, since conventional algorithms are very slow to solve the l1-norm based alternating minimization problem. The proposed methods try to find an optimal direction with a proper constraint which limits the search domain to avoid the diffculty that arises from the ambiguity in representing the two optimization variables. It is extended to an algorithm with an explicit smoothness regularizer and an orthogonality constraint for better effciency and solve it under the augmented Lagrangian framework. To give more stable solution with flexible rank estimation in the presence of heavy corruptions, we present a new solution based on the elastic-net regularization of singular values, which allows a faster algorithm than existing rank minimization methods without any heavy operations and is more stable than the state-of-the-art low-rank approximation algorithms due to its strong convexity. As a result, the proposed method leads to a holistic approach which enables both rank minimization and bilinear factorization. Moreover, as an extension to the previous methods performing on an unstructured matrix, we apply recent advances in rank minimization to a structured matrix for robust kernel subspace estimation under noisy scenarios.
Lastly, but not least, we extend a low-rank approximation problem, which assumes a single subspace, to a problem which lies in a union of multiple subspaces, which is closely related to subspace clustering. While many recent studies are based on sparse or low-rank representation, the grouping effect among similar samples has not been often considered with the sparse or low-rank representation. Thus, we propose a robust group subspace clustering lgorithms based on sparse and low-rank representation with explicit subspace grouping. To resolve the fundamental issue on computational complexity of existing subspace clustering algorithms, we suggest a full scalable low-rank subspace clustering approach, which achieves linear complexity in the number of samples. Extensive experimental results on various applications, including computer vision and robotics, using benchmark and real-world data sets verify that our suggested solutions to the existing issues on sparse and low-rank representations are considerably robust, effective, and practically applicable.1 Introduction 1
1.1 Main Challenges 4
1.2 Organization of the Dissertation 6
2 Related Work 11
2.1 Sparse Representation 11
2.2 Low-Rank Representation 14
2.2.1 Low-rank matrix approximation 14
2.2.2 Robust principal component analysis 17
2.3 Subspace Clustering 18
2.3.1 Sparse subspace clustering 18
2.3.2 Low-rank subspace clustering 20
2.3.3 Scalable subspace clustering 20
2.4 Gaussian Process Regression 21
3 Effcient Nonconvex Sparse Representation 25
3.1 Analysis of the l0-norm approximation 26
3.1.1 Notations 26
3.1.2 Desirable criteria for a nonconvex measure 27
3.1.3 A representative family of measures: SVG 29
3.2 The Proposed Nonconvex Sparsity Measure 32
3.2.1 Choosing a simple one among the SVG family 32
3.2.2 Relationships with other sparsity measures 34
3.2.3 More analysis on SVG 36
3.2.4 Learning sparse representations via SVG 38
3.3 Experimental Results 40
3.3.1 Evaluation for nonconvex sparsity measures 41
3.3.2 Low-rank approximation of matrices 42
3.3.3 Sparse coding 44
3.3.4 Subspace clustering 46
3.3.5 Parameter Analysis 49
3.4 Summary 51
4 Robust Fixed Low-Rank Representations 53
4.1 The Alternating Rectified Gradient Method for l1 Minimization 54
4.1.1 l1-ARGA as an approximation method 54
4.1.2 l1-ARGD as a dual method 65
4.1.3 Experimental results 74
4.2 Smooth Regularized Fixed-Rank Representation 88
4.2.1 Robust orthogonal matrix factorization (ROMF) 89
4.2.2 Rank estimation for ROMF (ROMF-RE) 95
4.2.3 Experimental results 98
4.3 Structured Low-Rank Representation 114
4.3.1 Kernel subspace learning 115
4.3.2 Structured kernel subspace learning in GPR 119
4.3.3 Experimental results 125
4.4 Summary 133
5 Robust Lower-Rank Subspace Representations 135
5.1 Elastic-Net Subspace Representation 136
5.2 Robust Elastic-Net Subspace Learning 140
5.2.1 Problem formulation 140
5.2.2 Algorithm: FactEN 145
5.3 Joint Subspace Estimation and Clustering 151
5.3.1 Problem formulation 151
5.3.2 Algorithm: ClustEN 152
5.4 Experiments 156
5.4.1 Subspace learning problems 157
5.4.2 Subspace clustering problems 167
5.5 Summary 174
6 Robust Group Subspace Representations 175
6.1 Group Subspace Representation 176
6.2 Group Sparse Representation (GSR) 180
6.2.1 GSR with noisy data 180
6.2.2 GSR with corrupted data 181
6.3 Group Low-Rank Representation (GLR) 184
6.3.1 GLR with noisy or corrupted data 184
6.4 Experimental Results 187
6.5 Summary 197
7 Scalable Low-Rank Subspace Clustering 199
7.1 Incremental Affnity Representation 201
7.2 End-to-End Scalable Subspace Clustering 205
7.2.1 Robust incremental summary representation 205
7.2.2 Effcient affnity construction 207
7.2.3 An end-to-end scalable learning pipeline 210
7.2.4 Nonlinear extension for SLR 213
7.3 Experimental Results 215
7.3.1 Synthetic data 216
7.3.2 Motion segmentation 219
7.3.3 Face clustering 220
7.3.4 Handwritten digits clustering 222
7.3.5 Action clustering 224
7.4 Summary 227
8 Conclusion and Future Work 229
Appendices 233
A Derivations of the LRA Problems 235
B Proof of Lemma 1 237
C Proof of Proposition 1 239
D Proof of Theorem 1 241
E Proof of Theorem 2 247
F Proof of Theorems in Chapter 6 251
F.1 Proof of Theorem 3 251
F.2 Proof of Theorem 4 252
F.3 Proof of Theorem 5 253
G Proof of Theorems in Chapter 7 255
G.1 Proof of Theorem 6 255
G.2 Proof of Theorem 7 256
Bibliography 259
์ด๋ก 275Docto
A literature survey of low-rank tensor approximation techniques
During the last years, low-rank tensor approximation has been established as
a new tool in scientific computing to address large-scale linear and
multilinear algebra problems, which would be intractable by classical
techniques. This survey attempts to give a literature overview of current
developments in this area, with an emphasis on function-related tensors
- โฆ