19,771 research outputs found
Computational Aesthetics for Fashion
The online fashion industry is growing fast and with it, the need for advanced systems able to automatically solve different tasks in an accurate way. With the rapid advance of digital technologies, Deep Learning has played an important role in Computational Aesthetics, an interdisciplinary area that tries to bridge fine art, design, and computer science. Specifically, Computational Aesthetics aims to automatize human aesthetic judgments with computational methods. In this thesis, we focus on three applications of computer vision in fashion, and we discuss how Computational Aesthetics helps solve them accurately
Re-ID done right: towards good practices for person re-identification
Training a deep architecture using a ranking loss has become standard for the
person re-identification task. Increasingly, these deep architectures include
additional components that leverage part detections, attribute predictions,
pose estimators and other auxiliary information, in order to more effectively
localize and align discriminative image regions. In this paper we adopt a
different approach and carefully design each component of a simple deep
architecture and, critically, the strategy for training it effectively for
person re-identification. We extensively evaluate each design choice, leading
to a list of good practices for person re-identification. By following these
practices, our approach outperforms the state of the art, including more
complex methods with auxiliary components, by large margins on four benchmark
datasets. We also provide a qualitative analysis of our trained representation
which indicates that, while compact, it is able to capture information from
localized and discriminative regions, in a manner akin to an implicit attention
mechanism
UniHCP: A Unified Model for Human-Centric Perceptions
Human-centric perceptions (e.g., pose estimation, human parsing, pedestrian
detection, person re-identification, etc.) play a key role in industrial
applications of visual models. While specific human-centric tasks have their
own relevant semantic aspect to focus on, they also share the same underlying
semantic structure of the human body. However, few works have attempted to
exploit such homogeneity and design a general-propose model for human-centric
tasks. In this work, we revisit a broad range of human-centric tasks and unify
them in a minimalist manner. We propose UniHCP, a Unified Model for
Human-Centric Perceptions, which unifies a wide range of human-centric tasks in
a simplified end-to-end manner with the plain vision transformer architecture.
With large-scale joint training on 33 human-centric datasets, UniHCP can
outperform strong baselines on several in-domain and downstream tasks by direct
evaluation. When adapted to a specific task, UniHCP achieves new SOTAs on a
wide range of human-centric tasks, e.g., 69.8 mIoU on CIHP for human parsing,
86.18 mA on PA-100K for attribute prediction, 90.3 mAP on Market1501 for ReID,
and 85.8 JI on CrowdHuman for pedestrian detection, performing better than
specialized models tailored for each task.Comment: Accepted for publication at the IEEE/CVF Conference on Computer
Vision and Pattern Recognition 2023 (CVPR 2023
Towards Optimal Free Trade Agreement Utilization through Deep Learning Techniques
In recent years, deep learning based methods achieved new state of the art in various domains such as image recognition, speech recognition and natural language processing. However, in the context of tax and customs, the amount of existing applications of artificial intelligence and more specifically deep learning is limited. In this paper, we investigate the potentials of deep learning techniques to improve the Free Trade Agreement (FTA) utilization of trade transactions. We show that supervised learning models can be trained to decide on the basis of transaction characteristics such as import country, export country, product type, etc. whether FTA can be utilized. We apply a specific architecture with multiple embeddings to efficiently capture the dynamics of tabular data. The experiments were evaluated on real-world data generated by Enterprise Resource Planning (ERP) systems of an international chemical and consumer goods company
- …