4 research outputs found
GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models
The rapid advancement in image generation models has predominantly been
driven by diffusion models, which have demonstrated unparalleled success in
generating high-fidelity, diverse images from textual prompts. Despite their
success, diffusion models encounter substantial challenges in the domain of
image editing, particularly in executing disentangled edits-changes that target
specific attributes of an image while leaving irrelevant parts untouched. In
contrast, Generative Adversarial Networks (GANs) have been recognized for their
success in disentangled edits through their interpretable latent spaces. We
introduce GANTASTIC, a novel framework that takes existing directions from
pre-trained GAN models-representative of specific, controllable attributes-and
transfers these directions into diffusion-based models. This novel approach not
only maintains the generative quality and diversity that diffusion models are
known for but also significantly enhances their capability to perform precise,
targeted image edits, thereby leveraging the best of both worlds.Comment: Project page: https://gantastic.github.i
VecGAN: Image-to-Image Translation with Interpretable Latent Directions
We propose VecGAN, an image-to-image translation framework for facial
attribute editing with interpretable latent directions. Facial attribute
editing task faces the challenges of precise attribute editing with
controllable strength and preservation of the other attributes of an image. For
this goal, we design the attribute editing by latent space factorization and
for each attribute, we learn a linear direction that is orthogonal to the
others. The other component is the controllable strength of the change, a
scalar value. In our framework, this scalar can be either sampled or encoded
from a reference image by projection. Our work is inspired by the latent space
factorization works of fixed pretrained GANs. However, while those models
cannot be trained end-to-end and struggle to edit encoded images precisely,
VecGAN is end-to-end trained for image translation task and successful at
editing an attribute while preserving the others. Our extensive experiments
show that VecGAN achieves significant improvements over state-of-the-arts for
both local and global edits.Comment: ECCV 202
Hanri Benazus ve Yahudi azınlıkların Türkiye Cumhuriyeti'ne uyum süreci
Ankara : İhsan Doğramacı Bilkent Üniversitesi İktisadi, İdari ve Sosyal Bilimler Fakültesi, Tarih Bölümü, 2018.This work is a student project of the Department of History, Faculty of Economics, Administrative and Social Sciences, İhsan Doğramacı Bilkent University.The History of Turkey course (HIST200) is a requirement for all Bilkent undergraduates. It is designed to encourage students to work in groups on projects concerning any topic of their choice that relates to the history of Turkey. It is designed as an interactive course with an emphasis on research and the objective of investigating events, chronologically short historical periods, as well as historic representations. Students from all departments prepare and present final projects for examination by a committee, with 10 projects chosen to receive awards.Includes bibliographical references (pages 21-22).by Yasemin Başaran Doğan