426 research outputs found
Object Discovery via Cohesion Measurement
Color and intensity are two important components in an image. Usually, groups
of image pixels, which are similar in color or intensity, are an informative
representation for an object. They are therefore particularly suitable for
computer vision tasks, such as saliency detection and object proposal
generation. However, image pixels, which share a similar real-world color, may
be quite different since colors are often distorted by intensity. In this
paper, we reinvestigate the affinity matrices originally used in image
segmentation methods based on spectral clustering. A new affinity matrix, which
is robust to color distortions, is formulated for object discovery. Moreover, a
Cohesion Measurement (CM) for object regions is also derived based on the
formulated affinity matrix. Based on the new Cohesion Measurement, a novel
object discovery method is proposed to discover objects latent in an image by
utilizing the eigenvectors of the affinity matrix. Then we apply the proposed
method to both saliency detection and object proposal generation. Experimental
results on several evaluation benchmarks demonstrate that the proposed CM based
method has achieved promising performance for these two tasks.Comment: 14 pages, 14 figure
์ด๋ ๋ฌผ์ฒด ๊ฐ์ง ๋ฐ ๋ถ์ง ์์ ๋ณต์์ ์ฐ๊ตฌ
ํ์๋
ผ๋ฌธ (๋ฐ์ฌ) -- ์์ธ๋ํ๊ต ๋ํ์ : ์์ฐ๊ณผํ๋ํ ์๋ฆฌ๊ณผํ๋ถ, 2021. 2. ๊ฐ๋ช
์ฃผ.Robust principal component analysis(RPCA), a method used to decom-
pose a matrix into the sum of a low-rank matrix and a sparse matrix, has
been proven e๏ฌective in modeling the static background of videos. However,
because a dynamic background cannot be represented by a low-rank matrix,
measures additional to the RPCA are required. In this thesis, we propose
masked RPCA to process backgrounds containing moving textures. First-
order Marcov random ๏ฌeld (MRF) is used to generate a mask that roughly
labels moving objects and backgrounds. To estimate the background, the
rank minimization process is then applied with the mask multiplied. During
the iteration, the background rank increases as the object mask expands,
and the weight of the rank constraint term decreases, which increases the
accuracy of the background. We compared the proposed method with state-
of-art, end-to-end methods to demonstrate its advantages.
Subsequently, we suggest novel dedusting method based on dust-optimized
transmission map and deep image prior. This method consists of estimating
atmospheric light and transmission in that order, which is similar to dark
channel prior-based dehazing methods. However, existing atmospheric light
estimating methods widely used in dehazing schemes give an overly bright
estimation, which results in unrealistically dark dedusting results. To ad-
dress this problem, we propose a segmentation-based method that gives new
estimation in atmospheric light. Dark channel prior based transmission map
with new atmospheric light gives unnatural intensity ordering and zero value
at low transmission regions. Therefore, the transmission map is re๏ฌned by
scattering model based transformation and dark channel adaptive non-local
total variation (NLTV) regularization. Parameter optimizing steps with deep
image prior(DIP) gives the ๏ฌnal dedusting result.๊ฐ๊ฑด ์ฃผ์ฑ๋ถ ๋ถ์์ ๋ฐฐ๊ฒฝ ๊ฐ์ฐ์ ํตํ ๋์์์ ์ ๊ฒฝ ์ถ์ถ์ ๋ฐฉ๋ฒ์ผ๋ก ์ด
์ฉ๋์ด์์ผ๋, ๋์ ๋ฐฐ๊ฒฝ์์ ๊ณ์ํ๋ ฌ๋กํํ๋ ์์๊ธฐ๋๋ฌธ์๋์ ๋ฐฐ๊ฒฝ
๊ฐ์ฐ์์ฑ๋ฅ์ ํ๊ณ๋ฅผ๊ฐ์ง๊ณ ์์๋ค. ์ฐ๋ฆฌ๋์ ๊ฒฝ๊ณผ๋ฐฐ๊ฒฝ์๊ตฌ๋ถํ๋์ผ๊ณ๋ง
๋ฅด์ฝํ์ฐ์๋ฅผ๋์
ํด์ ์ ๋ฐฐ๊ฒฝ์๋ํ๋ด๋ํญ๊ณผ๊ณฑํ๊ณ ์ด๊ฒ์์ด์ฉํ์๋ก
์ดํํ์๊ฐ๊ฑด์ฃผ์ฑ๋ถ๋ถ์์์ ์ํ์ฌ๋์ ๋ฐฐ๊ฒฝ๊ฐ์ฐ๋ฌธ์ ๋ฅผํด๊ฒฐํ๋ค. ํด๋น
์ต์ํ๋ฌธ์ ๋๋ฐ๋ณต์ ์ธ๊ต์ฐจ์ต์ ํ๋ฅผํตํ์ฌํด๊ฒฐํ๋ค. ์ด์ด์๋๊ธฐ์ค์๋ฏธ์ธ
๋จผ์ง์์ํด์ค์ผ๋์์์๋ณต์ํ๋ค. ์์๋ถํ ๊ณผ์ํ์ฑ๋๊ฐ์ ์๊ธฐ๋ฐํ์ฌ
๊น์ด์ง๋๋ฅผ๊ตฌํ๊ณ , ๋น๊ตญ์์ด๋ณ๋์ต์ํ๋ฅผํตํ์ฌ์ ์ ํ๋ค. ์ดํ๊น์์์
๊ฐ์ ์๊ธฐ๋ฐํ์์์์ฑ๊ธฐ๋ฅผํตํ์ฌ์ต์ข
์ ์ผ๋ก๋ณต์๋์์์๊ตฌํ๋ค. ์คํ์
ํตํ์ฌ์ ์๋๋ฐฉ๋ฒ์๋ค๋ฅธ๋ฐฉ๋ฒ๋ค๊ณผ๋น๊ตํ๊ณ ์ง์ ์ธ์ธก๋ฉด๊ณผ์์ ์ธ์ธก๋ฉด๋ชจ
๋์์์ฐ์ํจ์ํ์ธํ๋ค.Abstract i
1 Introduction 1
1.1 Moving Object Detection In Dynamic Backgrounds 1
1.2 Image Dedusting 2
2 Preliminaries 4
2.1 Moving Object Detection In Dynamic Backgrounds 4
2.1.1 Literature review 5
2.1.2 Robust principal component analysis(RPCA) and their application status 7
2.1.3 Graph cuts and ฮฑ-expansion algorithm 14
2.2 Image Dedusting 16
2.2.1 Image dehazing methods 16
2.2.2 Dust model 18
2.2.3 Non-local total variation(NLTV) 19
3 Dynamic Background Subtraction With Masked RPCA 21
3.1 Motivation 21
3.1.1 Motivation of background modeling 21
3.1.2 Mask formulation 23
3.1.3 Model 24
3.2 Optimization 25
3.2.1 L-Subproblem 25
3.2.2 Lห-Subproblem 26
3.2.3 M-Subproblem 27
3.2.4 p-Subproblem 28
3.2.5 Adaptive parameter control 28
3.2.6 Convergence 29
3.3 Experimental results 31
3.3.1 Benchmark Algorithms And Videos 31
3.3.2 Implementation 32
3.3.3 Evaluation 32
4 Deep Image Dedusting With Dust-Optimized Transmission Map 41
4.1 Transmission estimation 41
4.1.1 Atmospheric light estimation 41
4.1.2 Transmission estimation 43
4.2 Scene radiance recovery 47
4.3 Experimental results 51
4.3.1 Implementation 51
4.3.2 Evaluation 52
5 Conclusion 58
Abstract (in Korean) 69
Acknowledgement (in Korean) 70Docto
On landmark selection and sampling in high-dimensional data analysis
In recent years, the spectral analysis of appropriately defined kernel
matrices has emerged as a principled way to extract the low-dimensional
structure often prevalent in high-dimensional data. Here we provide an
introduction to spectral methods for linear and nonlinear dimension reduction,
emphasizing ways to overcome the computational limitations currently faced by
practitioners with massive datasets. In particular, a data subsampling or
landmark selection process is often employed to construct a kernel based on
partial information, followed by an approximate spectral analysis termed the
Nystrom extension. We provide a quantitative framework to analyse this
procedure, and use it to demonstrate algorithmic performance bounds on a range
of practical approaches designed to optimize the landmark selection process. We
compare the practical implications of these bounds by way of real-world
examples drawn from the field of computer vision, whereby low-dimensional
manifold structure is shown to emerge from high-dimensional video data streams.Comment: 18 pages, 6 figures, submitted for publicatio
- โฆ