5,073 research outputs found
High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables
In this work we address the problem of approximating high-dimensional data
with a low-dimensional representation. We make the following contributions. We
propose an inverse regression method which exchanges the roles of input and
response, such that the low-dimensional variable becomes the regressor, and
which is tractable. We introduce a mixture of locally-linear probabilistic
mapping model that starts with estimating the parameters of inverse regression,
and follows with inferring closed-form solutions for the forward parameters of
the high-dimensional regression problem of interest. Moreover, we introduce a
partially-latent paradigm, such that the vector-valued response variable is
composed of both observed and latent entries, thus being able to deal with data
contaminated by experimental artifacts that cannot be explained with noise
models. The proposed probabilistic formulation could be viewed as a
latent-variable augmentation of regression. We devise expectation-maximization
(EM) procedures based on a data augmentation strategy which facilitates the
maximum-likelihood search over the model parameters. We propose two
augmentation schemes and we describe in detail the associated EM inference
procedures that may well be viewed as generalizations of a number of EM
regression, dimension reduction, and factor analysis algorithms. The proposed
framework is validated with both synthetic and real data. We provide
experimental evidence that our method outperforms several existing regression
techniques
Fairness in Face Presentation Attack Detection
Face presentation attack detection (PAD) is critical to secure face
recognition (FR) applications from presentation attacks. FR performance has
been shown to be unfair to certain demographic and non-demographic groups.
However, the fairness of face PAD is an understudied issue, mainly due to the
lack of appropriately annotated data. To address this issue, this work first
presents a Combined Attribute Annotated PAD Dataset (CAAD-PAD) by combining
several well-known PAD datasets where we provide seven human-annotated
attribute labels. This work then comprehensively analyses the fairness of a set
of face PADs and its relation to the nature of training data and the
Operational Decision Threshold Assignment (ODTA) on different data groups by
studying four face PAD approaches on our CAAD-PAD. To simultaneously represent
both the PAD fairness and the absolute PAD performance, we introduce a novel
metric, namely the Accuracy Balanced Fairness (ABF). Extensive experiments on
CAAD-PAD show that the training data and ODTA induce unfairness on gender,
occlusion, and other attribute groups. Based on these analyses, we propose a
data augmentation method, FairSWAP, which aims to disrupt the identity/semantic
information and guide models to mine attack cues rather than attribute-related
information. Detailed experimental results demonstrate that FairSWAP generally
enhances both the PAD performance and the fairness of face PAD
A mixed reality telepresence system for collaborative space operation
This paper presents a Mixed Reality system that results from the integration of a telepresence system and an application to improve collaborative space exploration. The system combines free viewpoint video with immersive projection technology to support non-verbal communication, including eye gaze, inter-personal distance and facial expression. Importantly, these can be interpreted together as people move around the simulation, maintaining natural social distance. The application is a simulation of Mars, within which the collaborators must come to agreement over, for example, where the Rover should land and go.
The first contribution is the creation of a Mixed Reality system supporting contextualization of non-verbal communication. Tw technological contributions are prototyping a technique to subtract a person from a background that may contain physical objects and/or moving images, and a light weight texturing method for multi-view rendering which provides balance in terms of visual and temporal quality. A practical contribution is the demonstration of pragmatic approaches to sharing space between display systems of distinct levels of immersion. A research tool contribution is a system that allows comparison of conventional authored and video based reconstructed avatars, within an environment that encourages exploration and social interaction. Aspects of system quality, including the communication of facial expression and end-to-end latency are reported
Biometric Systems
Because of the accelerating progress in biometrics research and the latest nation-state threats to security, this book's publication is not only timely but also much needed. This volume contains seventeen peer-reviewed chapters reporting the state of the art in biometrics research: security issues, signature verification, fingerprint identification, wrist vascular biometrics, ear detection, face detection and identification (including a new survey of face recognition), person re-identification, electrocardiogram (ECT) recognition, and several multi-modal systems. This book will be a valuable resource for graduate students, engineers, and researchers interested in understanding and investigating this important field of study
Recommended from our members
The role of HG in the analysis of temporal iteration and interaural correlation
์์ ๊ธฐ๋ฐ ๋์ผ์ธ ํ๋ณ์ ์ํ ๋ถ๋ถ ์ ํฉ ํ์ต
ํ์๋
ผ๋ฌธ (๋ฐ์ฌ)-- ์์ธ๋ํ๊ต ๋ํ์ : ๊ณต๊ณผ๋ํ ์ ๊ธฐยท์ ๋ณด๊ณตํ๋ถ, 2019. 2. ์ด๊ฒฝ๋ฌด.Person re-identification is a problem of identifying the same individuals among the persons captured from different cameras. It is a challenging problem because the same person captured from non-overlapping cameras usually shows dramatic appearance change due to the viewpoint, pose, and illumination changes. Since it is an essential tool for many surveillance applications, various research directions have been exploredhowever, it is far from being solved.
The goal of this thesis is to solve person re-identification problem under the surveillance system. In particular, we focus on two critical components: designing 1) a better image representation model using human poses and 2) a better training method using hard sample mining. First, we propose a part-aligned representation model which represents an image as the bilinear pooling between appearance and part maps. Since the image similarity is independently calculated from the locations of body parts, it addresses the body part misalignment issue and effectively distinguishes different people by discriminating fine-grained local differences. Second, we propose a stochastic hard sample mining method that exploits class information to generate diverse and hard examples to use for training. It efficiently explores the training samples while avoiding stuck in a small subset of hard samples, thereby effectively training the model. Finally, we propose an integrated system that combines the two approaches, which is benefited from both components. Experimental results show that the proposed method works robustly on five datasets with diverse conditions and its potential extension to the more general conditions.๋์ผ์ธ ํ๋ณ๋ฌธ์ ๋ ๋ค๋ฅธ ์นด๋ฉ๋ผ๋ก ์ดฌ์๋ ๊ฐ๊ฐ์ ์์์ ์ฐํ ๋ ์ฌ๋์ด ๊ฐ์ ์ฌ๋์ธ์ง ์ฌ๋ถ๋ฅผ ํ๋จํ๋ ๋ฌธ์ ์ด๋ค. ์ด๋ ๊ฐ์์นด๋ฉ๋ผ์ ๋ณด์์ ๊ด๋ จ๋ ๋ค์ํ ์์ฉ ๋ถ์ผ์์ ์ค์ํ ๋๊ตฌ๋ก ํ์ฉ๋๊ธฐ ๋๋ฌธ์ ์ต๊ทผ๊น์ง ๋ง์ ์ฐ๊ตฌ๊ฐ ์ด๋ฃจ์ด์ง๊ณ ์๋ค. ๊ทธ๋ฌ๋ ๊ฐ์ ์ฌ๋์ด๋๋ผ๋ ์๊ฐ, ์ฅ์, ์ดฌ์ ๊ฐ๋, ์กฐ๋ช
์ํ๊ฐ ๋ค๋ฅธ ํ๊ฒฝ์์ ์ฐํ๋ฉด ์์๋ง๋ค ๋ณด์ด๋ ๋ชจ์ต์ด ๋ฌ๋ผ์ง๋ฏ๋ก ํ๋ณ์ ์๋ํํ๊ธฐ ์ด๋ ต๋ค๋ ๋ฌธ์ ๊ฐ ์๋ค.
๋ณธ ๋
ผ๋ฌธ์์๋ ์ฃผ๋ก ๊ฐ์์นด๋ฉ๋ผ ์์์ ๋ํด์, ๊ฐ ์์์์ ์๋์ผ๋ก ์ฌ๋์ ๊ฒ์ถํ ํ์ ๊ฒ์ถํ ๊ฒฐ๊ณผ๋ค์ด ์๋ก ๊ฐ์ ์ฌ๋์ธ์ง ์ฌ๋ถ๋ฅผ ํ๋จํ๋ ๋ฌธ์ ๋ฅผ ํ๊ณ ์ ํ๋ค. ์ด๋ฅผ ์ํด 1) ์ด๋ค ๋ชจ๋ธ์ด ์์์ ์ ํํํ ๊ฒ์ธ์ง 2) ์ฃผ์ด์ง ๋ชจ๋ธ์ ์ด๋ป๊ฒ ์ ํ์ต์ํฌ์ ์์์ง ๋ ๊ฐ์ง ์ง๋ฌธ์ ๋ํด์ ์ฐ๊ตฌํ๋ค. ๋จผ์ ๋ฒกํฐ ๊ณต๊ฐ ์์์์ ๊ฑฐ๋ฆฌ๊ฐ ์ด๋ฏธ์ง ์์์ ๋์๋๋ ํํธ๋ค ์ฌ์ด์ ์๊น์ ์ฐจ์ด์ ํฉ๊ณผ ๊ฐ์์ง๋๋ก ํ๋ ๋งคํ ํจ์๋ฅผ ์ค๊ณํจ์ผ๋ก์จ ๊ฒ์ถ๋ ์ฌ๋๋ค ์ฌ์ด์ ์ ์ฒด ๋ถ๋ถ๋ณ๋ก ์๊น์๋ฅผ ๋น๊ต๋ฅผ ํตํด ํจ๊ณผ์ ์ธ ํ๋ณ์ ๊ฐ๋ฅํ๊ฒ ํ๋ ๋ชจ๋ธ์ ์ ์ํ๋ค. ๋๋ฒ์งธ๋ก ํ์ต ๊ณผ์ ์์ ํด๋์ค ์ ๋ณด๋ฅผ ํ์ฉํด์ ์ ์ ๊ณ์ฐ๋์ผ๋ก ์ด๋ ค์ด ์์๋ฅผ ๋ง์ด ๋ณด๋๋ก ํจ์ผ๋ก์จ ํจ๊ณผ์ ์ผ๋ก ํจ์์ ํ๋ผ๋ฏธํฐ๋ฅผ ํ์ตํ๋ ๋ฐฉ๋ฒ์ ์ ์ํ๋ค. ์ต์ข
์ ์ผ๋ก๋ ๋ ์์๋ฅผ ๊ฒฐํฉํด์ ์๋ก์ด ๋์ผ์ธ ํ๋ณ ์์คํ
์ ์ ์ํ๊ณ ์ ํ๋ค. ๋ณธ ๋
ผ๋ฌธ์์๋ ์คํ๊ฒฐ๊ณผ๋ฅผ ํตํด ์ ์ํ๋ ๋ฐฉ๋ฒ์ด ๋ค์ํ ํ๊ฒฝ์์ ๊ฐ์ธํ๊ณ ํจ๊ณผ์ ์ผ๋ก ๋์ํจ์ ์ฆ๋ช
ํ์๊ณ ๋ณด๋ค ์ผ๋ฐ์ ์ธ ํ๊ฒฝ์ผ๋ก์ ํ์ฅ ๊ฐ๋ฅ์ฑ๋ ํ์ธ ํ ์ ์์ ๊ฒ์ด๋ค.Abstract i
Contents ii
List of Tables v
List of Figures vii
1. Introduction 1
1.1 Part-Aligned Bilinear Representations . . . . . . . . . . . . . . . . . 3
1.2 Stochastic Class-Based Hard Sample Mining . . . . . . . . . . . . . 4
1.3 Integrated System for Person Re-identification . . . . . . . . . . . . . 5
2. Part-Aligned Bilinear Represenatations 6
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.3 Our Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.3.1 Two-Stream Network . . . . . . . . . . . . . . . . . . . . . . 10
2.3.2 Bilinear Pooling . . . . . . . . . . . . . . . . . . . . . . . . 12
2.3.3 Loss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.4 Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.4.1 Part-Aware Image Similarity . . . . . . . . . . . . . . . . . . 13
2.4.2 Relationship to the Baseline Models . . . . . . . . . . . . . . 15
2.4.3 Decomposition of Appearance and Part Maps . . . . . . . . . 15
2.4.4 Part-Alignment Effects on Reducing Misalignment Issue . . . 19
2.5 Implementation Details . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.6 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
2.6.1 Datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
2.6.2 Evaluation Metrics . . . . . . . . . . . . . . . . . . . . . . . 23
2.6.3 Comparison with the Baselines . . . . . . . . . . . . . . . . . 24
2.6.4 Comparison with State-of-the-Art Methods . . . . . . . . . . 25
2.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3. Stochastic Class-Based Hard Sample Mining 35
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
3.2 Related Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.3 Deep Metric Learning with Triplet Loss . . . . . . . . . . . . . . . . 40
3.3.1 Triplet Loss . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
3.3.2 Efficient Learning with Triplet Loss . . . . . . . . . . . . . . 41
3.4 Batch Construction for Metric Learning . . . . . . . . . . . . . . . . 42
3.4.1 Neighbor Class Mining by Class Signatures . . . . . . . . . . 42
3.4.2 Batch Construction . . . . . . . . . . . . . . . . . . . . . . . 44
3.4.3 Scalable Extension to the Number of Classes . . . . . . . . . 50
3.5 Loss . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.6 Feature Extractor . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.7 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.7.1 Datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.7.2 Implementation Details . . . . . . . . . . . . . . . . . . . . . 55
3.7.3 Evaluation Metrics . . . . . . . . . . . . . . . . . . . . . . . 56
3.7.4 Effect of the Stochastic Hard Example Mining . . . . . . . . 59
3.7.5 Comparison with the Existing Methods on Image Retrieval
Datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
3.8 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .70
4. Integrated System for Person Re-identification 71
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
4.2 Hard Positive Mining . . . . . . . . . . . . . . . . . . . . . . . . . . 72
4.3 Integrated System for Person Re-identification . . . . . . . . . . . . . 75
4.4 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
4.4.1 Comparison with the baselines . . . . . . . . . . . . . . . . . 75
4.4.2 Comparison with the existing works . . . . . . . . . . . . . . 80
4.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
5.Conclusion 83
5.1 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
5.2 Future Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
Abstract (In Korean) 94Docto
A Survey on Evolutionary Computation for Computer Vision and Image Analysis: Past, Present, and Future Trends
Computer vision (CV) is a big and important field
in artificial intelligence covering a wide range of applications.
Image analysis is a major task in CV aiming to extract, analyse
and understand the visual content of images. However, imagerelated
tasks are very challenging due to many factors, e.g., high
variations across images, high dimensionality, domain expertise
requirement, and image distortions. Evolutionary computation
(EC) approaches have been widely used for image analysis with
significant achievement. However, there is no comprehensive
survey of existing EC approaches to image analysis. To fill
this gap, this paper provides a comprehensive survey covering
all essential EC approaches to important image analysis tasks
including edge detection, image segmentation, image feature
analysis, image classification, object detection, and others. This
survey aims to provide a better understanding of evolutionary
computer vision (ECV) by discussing the contributions of different
approaches and exploring how and why EC is used for
CV and image analysis. The applications, challenges, issues, and
trends associated to this research field are also discussed and
summarised to provide further guidelines and opportunities for
future research
Change blindness: eradication of gestalt strategies
Arrays of eight, texture-defined rectangles were used as stimuli in a one-shot change blindness (CB) task where there was a 50% chance that one rectangle would change orientation between two successive presentations separated by an interval. CB was eliminated by cueing the target rectangle in the first stimulus, reduced by cueing in the interval and unaffected by cueing in the second presentation. This supports the idea that a representation was formed that persisted through the interval before being 'overwritten' by the second presentation (Landman et al, 2003 Vision Research 43149โ164]. Another possibility is that participants used some kind of grouping or Gestalt strategy. To test this we changed the spatial position of the rectangles in the second presentation by shifting them along imaginary spokes (by ยฑ1 degree) emanating from the central fixation point. There was no significant difference seen in performance between this and the standard task [F(1,4)=2.565, p=0.185]. This may suggest two things: (i) Gestalt grouping is not used as a strategy in these tasks, and (ii) it gives further weight to the argument that objects may be stored and retrieved from a pre-attentional store during this task
A machine-learning-based surrogate model of Mars' thermal evolution
Constraining initial conditions and parameters of mantle convection for a planet often requires running several hundred computationally expensive simulations in order to find those matching certain โobservablesโ, such as crustal thickness, duration of volcanism, or radial contraction. A lower fidelity alternative is to use 1-D evolution models based on scaling laws that parametrize convective heat transfer. However, this approach is often limited in the amount of physics that scaling laws can accurately represent (e.g. temperature and pressure-dependent rheologies or mineralogical phase transitions can only be marginally simulated). We leverage neural networks to build a surrogate model that can predict the entire evolution (0โ4.5 Gyr) of the 1-D temperature profile of a Mars-like planet for a wide range of values of five different parameters: reference viscosity, activation energy and activation volume of diffusion creep, enrichment factor of heat-producing elements in the crust and initial temperature of the mantle. The neural network we evaluate and present here has been trained from a subset of โผ10โ000 evolution simulations of Mars ran on a 2-D quarter-cylindrical grid, from which we extracted laterally averaged 1-D temperature profiles. The temperature profiles predicted by this trained network match those of an unseen batch of 2-D simulations with an average accuracy of 99.7per centโ
- โฆ