Search CORE

1,512 research outputs found

A global-to-local model for the representation of human faces

Author: Knothe Reinhard
Publication venue
Publication date: 01/01/2009
Field of study

In the context of face modeling and face recognition, statistical models are widely used for the representation and modeling of surfaces. Most of these models are obtained by computing Principal Components Analysis (PCA) on a set of representative examples. These models represent novel faces poorly due to their holistic nature (i.e.\ each component has global support), and they suffer from overfitting when used for generalization from partial information. In this work, we present a novel analysis method that breaks the objects up into modes based on spatial frequency. The high-frequency modes are segmented into regions with respect to specific features of the object. After computing PCA on these segments individually, a hierarchy of global and local components gradually decreasing in size of their support is combined into a linear statistical model, hence the name, Global-to-Local model (G2L). We apply our methodology to build a novel G2L model of 3D shapes of human heads. Both the representation and the generalization capabilities of the models are evaluated and compared in a standardized test, and it is demonstrated that the G2L model performs better compared to traditional holistic PCA models. Furthermore, both models are used to reconstruct the 3D shape of faces from a single photograph. A novel adaptive fitting method is presented that estimates the model parameters using a multi-resolution approach. The model is first fitted to contours extracted from the image. In a second stage, the contours are kept fixed and the remaining flexibility of the model is fitted to the input image. This makes the method fast (30 sec on a standard PC), efficient, and accurate

edoc

Style transfer for headshot portraits

Author: Barnes Connelly
Durand Fredo
Freeman William T.
Paris Sylvain
Shih YiChang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/07/2014
Field of study

Headshot portraits are a popular subject in photography but to achieve a compelling visual style requires advanced skills that a casual photographer will not have. Further, algorithms that automate or assist the stylization of generic photographs do not perform well on headshots due to the feature-specific, local retouching that a professional photographer typically applies to generate such portraits. We introduce a technique to transfer the style of an example headshot photo onto a new one. This can allow one to easily reproduce the look of renowned artists. At the core of our approach is a new multiscale technique to robustly transfer the local statistics of an example portrait onto a new one. This technique matches properties such as the local contrast and the overall lighting direction while being tolerant to the unavoidable differences between the faces of two different people. Additionally, because artists sometimes produce entire headshot collections in a common style, we show how to automatically find a good example to use as a reference for a given portrait, enabling style transfer without the user having to search for a suitable example for each input. We demonstrate our approach on data taken in a controlled environment as well as on a large set of photos downloaded from the Internet. We show that we can successfully handle styles by a variety of different artists.Quanta Computer (Firm)Adobe System

DSpace@MIT

A survey of visual preprocessing and shape representation techniques

Author: Olshausen Bruno A.
Publication venue
Publication date
Field of study

Many recent theories and methods proposed for visual preprocessing and shape representation are summarized. The survey brings together research from the fields of biology, psychology, computer science, electrical engineering, and most recently, neural networks. It was motivated by the need to preprocess images for a sparse distributed memory (SDM), but the techniques presented may also prove useful for applying other associative memories to visual pattern recognition. The material of this survey is divided into three sections: an overview of biological visual processing; methods of preprocessing (extracting parts of shape, texture, motion, and depth); and shape representation and recognition (form invariance, primitives and structural descriptions, and theories of attention)

NASA Technical Reports Server

Mini Kirsch Edge Detection and Its Sharpening Effect

Author: Sia Jeremy Yik Xian
Sia Joyce Sin Yin
Tan Tian Swee
Tiong Matthias Foh Thye
Yahya Azli Bin
Publication venue: IAES Indonesia Section
Publication date: 30/03/2021
Field of study

In computer vision, edge detection is a crucial step in identifying the objects’ boundaries in an image. The existing edge detection methods function in either spatial domain or frequency domain, fail to outline the high continuity boundaries of the objects. In this work, we modified four-directional mini Kirsch edge detection kernels which enable full directional edge detection. We also introduced the novel involvement of the proposed method in image sharpening by adding the resulting edge map onto the original input image to enhance the edge details in the image. From the edge detection performance tests, our proposed method acquired the highest true edge pixels and true non-edge pixels detection, yielding the highest accuracy among all the comparing methods. Moreover, the sharpening effect offered by our proposed framework could achieve a more favorable visual appearance with a competitive score of peak signal-to-noise ratio and structural similarity index value compared to the most widely used unsharp masking and Laplacian of Gaussian sharpening methods. The edges of the sharpened image are further enhanced could potentially contribute to better boundary tracking and higher segmentation accuracy

Indonesian Journal of Electrical Engineering and Informatics (IJEEI)

Hybrid LSTM and Encoder-Decoder Architecture for Detection of Image Forgeries

Author: Bappy Jawadul H.
Manjunath B. S.
Nataraj Lakshmanan
Roy-Chowdhury Amit K.
Simons Cody
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/03/2019
Field of study

With advanced image journaling tools, one can easily alter the semantic meaning of an image by exploiting certain manipulation techniques such as copy-clone, object splicing, and removal, which mislead the viewers. In contrast, the identification of these manipulations becomes a very challenging task as manipulated regions are not visually apparent. This paper proposes a high-confidence manipulation localization architecture which utilizes resampling features, Long-Short Term Memory (LSTM) cells, and encoder-decoder network to segment out manipulated regions from non-manipulated ones. Resampling features are used to capture artifacts like JPEG quality loss, upsampling, downsampling, rotation, and shearing. The proposed network exploits larger receptive fields (spatial maps) and frequency domain correlation to analyze the discriminative characteristics between manipulated and non-manipulated regions by incorporating encoder and LSTM network. Finally, decoder network learns the mapping from low-resolution feature maps to pixel-wise predictions for image tamper localization. With predicted mask provided by final layer (softmax) of the proposed architecture, end-to-end training is performed to learn the network parameters through back-propagation using ground-truth masks. Furthermore, a large image splicing dataset is introduced to guide the training process. The proposed method is capable of localizing image manipulations at pixel level with high precision, which is demonstrated through rigorous experimentation on three diverse datasets

arXiv.org e-Print Archive

eScholarship - University of California