Search CORE

528 research outputs found

Printed texture guided color feature fusion for impressionism style rendering of oil paintings.

Author: Geng Jing
Li Xiaoquan
Ma Li’e
Yan Yijun
Zhang Xin
Publication venue: 'MDPI AG'
Publication date: 01/10/2022
Field of study

As a major branch of Non-Photorealistic Rendering (NPR), image stylization mainly uses computer algorithms to render a photo into an artistic painting. Recent work has shown that the ex-traction of style information such as stroke texture and color of the target style image is the key to image stylization. Given its stroke texture and color characteristics, a new stroke rendering method is proposed. By fully considering the tonal characteristics and the representative color of the original oil painting, it can fit the tone of the original oil painting image into a stylized image whilst keeping the artist's creative effect. The experiments have validated the efficacy of the proposed model in comparison to three state-of-the-arts. This method would be more suitable for the works of pointillism painters with a relatively uniform style, especially for natural scenes, otherwise, the results can be less satisfactory

Open Access Institutional Repository at Robert Gordon University

Directory of Open Access Journals

University of Dundee Online Publications

Artificial Intelligence for Dunhuang Cultural Heritage Protection:The Project and the Dataset

Author: An H.
Ding X.
Lin C.
Liu X.
Qu T.
Wan L.
Wang C.
Wu J.
You S.
Yu T.
Zhang J.
Zhang S.
Publication venue
Publication date: 01/11/2022
Field of study

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Transforming Information Into Knowledge

Author: Lang Sabine
Ommer Björn
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 01/01/2021
Field of study

Open Access LMU

Recommended from our members

Higher-Order Representations for Visual Recognition

Author: Lin Tsung-Yu
Publication venue: ScholarWorks@UMass Amherst
Publication date: 26/03/2020
Field of study

In this thesis, we present a simple and effective architecture called Bilinear Convolutional Neural Networks (B-CNNs). These networks represent an image as a pooled outer product of features derived from two CNNs and capture localized feature interactions in a translationally invariant manner. B-CNNs generalize classical orderless texture-based image models such as bag-of-visual-words and Fisher vector representations. However, unlike prior work, they can be trained in an end-to-end manner. In the experiments, we demonstrate that these representations generalize well to novel domains by fine-tuning and achieve excellent results on fine-grained, texture and scene recognition tasks. The visualization of fine-tuned convolutional filters shows that the models are able to capture highly localized attributes. We present a texture synthesis framework that allows us to visualize the pre-images of fine-grained categories and the invariances that are captured by these models. In order to enhance the discriminative power of the B-CNN representations, we investigate normalization techniques for rescaling the importance of individual features during aggregation. Spectral normalization scales the spectrum of the covariance matrix obtained after bilinear pooling and offers a significant improvement. However, the computation involves singular value decomposition, which is not computationally efficient on modern GPUs. We present an iteration-based approximation of matrix square-root along with its gradients to speed up the computation and study its effect on fine-tuning deep neural networks. Another approach is democratic aggregation, which aims to equalize the contributions of individual feature vector into the final pooled image descriptor. This achieves a comparable improvement, and can be approximated in a low-dimensional embedding unlike the spectral normalization. Therefore, this approach is friendly to aggregating higher-dimensional features. We demonstrate that the two approaches are closely related, and we discuss their trade-off between performance and efficiency

ScholarWorks@UMass Amherst

An Empirical Comparison of Different Machine

Author: Bajaj Piyush
Publication venue: SJSU ScholarWorks
Publication date: 20/05/2019
Field of study

Sketching has been used by humans to visualize and narrate the aesthetics of the world for a long time. With the onset of touch devices and augmented technologies, it has attracted more and more attention in recent years. Recognition of free-hand sketches is an extremely cumbersome and challenging task due to its abstract qualities and lack of visual cues. Most of the previous work has been done to identify objects in real pictorial images using neural networks instead of a more abstract depiction of the same objects in sketch. This research aims at comparing the performance of different machine learning algorithms and their learned inner representations. This research studies some of the famous machine learning models in classifying sketch images. It also does a study of legacy and the new datasets to classify a new sketch through various classifiers like support vector machines and the use of deep neural networks. It achieved remarkable results but still lacking behind the accuracy in the classification of the sketch images

SJSU ScholarWorks

A Computational Approach to Hand Pose Recognition in Early Modern Paintings

Author: Bernasconi Valentine
Cetinic Eva
Impett Leonardo
Publication venue: MDPI Publishing
Publication date: 15/06/2023
Field of study

Hands represent an important aspect of pictorial narration but have rarely been addressed as an object of study in art history and digital humanities. Although hand gestures play a significant role in conveying emotions, narratives, and cultural symbolism in the context of visual art, a comprehensive terminology for the classification of depicted hand poses is still lacking. In this article, we present the process of creating a new annotated dataset of pictorial hand poses. The dataset is based on a collection of European early modern paintings, from which hands are extracted using human pose estimation (HPE) methods. The hand images are then manually annotated based on art historical categorization schemes. From this categorization, we introduce a new classification task and perform a series of experiments using different types of features, including our newly introduced 2D hand keypoint features, as well as existing neural network-based features. This classification task represents a new and complex challenge due to the subtle and contextually dependent differences between depicted hands. The presented computational approach to hand pose recognition in paintings represents an initial attempt to tackle this challenge, which could potentially advance the use of HPE methods on paintings, as well as foster new research on the understanding of hand gestures in art

ZORA

DiffStyler: Controllable Dual Diffusion for Text-Driven Image Stylization

Author: Dong Weiming
Huang Haibin
Huang Nisha
Ma Chongyang
Tang Fan
Xu Changsheng
Zhang Yong
Zhang Yuxin
Publication venue
Publication date: 19/11/2022
Field of study

Despite the impressive results of arbitrary image-guided style transfer methods, text-driven image stylization has recently been proposed for transferring a natural image into the stylized one according to textual descriptions of the target style provided by the user. Unlike previous image-to-image transfer approaches, text-guided stylization progress provides users with a more precise and intuitive way to express the desired style. However, the huge discrepancy between cross-modal inputs/outputs makes it challenging to conduct text-driven image stylization in a typical feed-forward CNN pipeline. In this paper, we present DiffStyler on the basis of diffusion models. The cross-modal style information can be easily integrated as guidance during the diffusion progress step-by-step. In particular, we use a dual diffusion processing architecture to control the balance between the content and style of the diffused results. Furthermore, we propose a content image-based learnable noise on which the reverse denoising process is based, enabling the stylization results to better preserve the structure information of the content image. We validate the proposed DiffStyler beyond the baseline methods through extensive qualitative and quantitative experiments

arXiv.org e-Print Archive