309,627 research outputs found
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation
Controllable layout generation aims at synthesizing plausible arrangement of
element bounding boxes with optional constraints, such as type or position of a
specific element. In this work, we try to solve a broad range of layout
generation tasks in a single model that is based on discrete state-space
diffusion models. Our model, named LayoutDM, naturally handles the structured
layout data in the discrete representation and learns to progressively infer a
noiseless layout from the initial input, where we model the layout corruption
process by modality-wise discrete diffusion. For conditional generation, we
propose to inject layout constraints in the form of masking or logit adjustment
during inference. We show in the experiments that our LayoutDM successfully
generates high-quality layouts and outperforms both task-specific and
task-agnostic baselines on several layout tasks.Comment: To be published in CVPR2023, project page:
https://cyberagentailab.github.io/layout-dm
GBLD: A Formal Model for Layout Description and Generation
In this paper, we introduce a layout description and generation model, GBLD, based on the notions and elements of L-systems and context-free grammars. Our layout model is compatible with geometric layout formats, such as GDSII or CIF. However, it is more powerful and more concise. The layouts represented by GBLD are sizeable, parameterised, and can incorporate design rules. GBLD has the potential to be used as a format for analog layout templates, analog layout retargeting, as well as the final layout format
Design of an analog/digital truly random number generator
An analog-digital system is presented for the generation of truly random (aperiodic) digital sequences. This model is based on a very simple piecewise-linear discrete map which is suitable for implementation using monolithic analog sampled-data techniques. Simulation results are given illustrating the optimum choice of the model parameters. Circuit implementations are reported for the discrete map using both switched-capacitor (SC) and switched-current (SI) techniques. The layout of a SI prototype in a 3-μm n-well double-polysilicon double-metal technology is included
A Parse-Then-Place Approach for Generating Graphic Layouts from Textual Descriptions
Creating layouts is a fundamental step in graphic design. In this work, we
propose to use text as the guidance to create graphic layouts, i.e.,
Text-to-Layout, aiming to lower the design barriers. Text-to-Layout is a
challenging task, because it needs to consider the implicit, combined, and
incomplete layout constraints from text, each of which has not been studied in
previous work. To address this, we present a two-stage approach, named
parse-then-place. The approach introduces an intermediate representation (IR)
between text and layout to represent diverse layout constraints. With IR,
Text-to-Layout is decomposed into a parse stage and a place stage. The parse
stage takes a textual description as input and generates an IR, in which the
implicit constraints from the text are transformed into explicit ones. The
place stage generates layouts based on the IR. To model combined and incomplete
constraints, we use a Transformer-based layout generation model and carefully
design a way to represent constraints and layouts as sequences. Besides, we
adopt the pretrain-then-finetune strategy to boost the performance of the
layout generation model with large-scale unlabeled layouts. To evaluate our
approach, we construct two Text-to-Layout datasets and conduct experiments on
them. Quantitative results, qualitative analysis, and user studies demonstrate
the effectiveness of our approach.Comment: Accepted by ICCV202
Dolfin: Diffusion Layout Transformers without Autoencoder
In this paper, we introduce a novel generative model, Diffusion Layout
Transformers without Autoencoder (Dolfin), which significantly improves the
modeling capability with reduced complexity compared to existing methods.
Dolfin employs a Transformer-based diffusion process to model layout
generation. In addition to an efficient bi-directional (non-causal joint)
sequence representation, we further propose an autoregressive diffusion model
(Dolfin-AR) that is especially adept at capturing rich semantic correlations
for the neighboring objects, such as alignment, size, and overlap. When
evaluated against standard generative layout benchmarks, Dolfin notably
improves performance across various metrics (fid, alignment, overlap, MaxIoU
and DocSim scores), enhancing transparency and interoperability in the process.
Moreover, Dolfin's applications extend beyond layout generation, making it
suitable for modeling geometric structures, such as line segments. Our
experiments present both qualitative and quantitative results to demonstrate
the advantages of Dolfin
End-to-End Optimization of Scene Layout
We propose an end-to-end variational generative model for scene layout
synthesis conditioned on scene graphs. Unlike unconditional scene layout
generation, we use scene graphs as an abstract but general representation to
guide the synthesis of diverse scene layouts that satisfy relationships
included in the scene graph. This gives rise to more flexible control over the
synthesis process, allowing various forms of inputs such as scene layouts
extracted from sentences or inferred from a single color image. Using our
conditional layout synthesizer, we can generate various layouts that share the
same structure of the input example. In addition to this conditional generation
design, we also integrate a differentiable rendering module that enables layout
refinement using only 2D projections of the scene. Given a depth and a
semantics map, the differentiable rendering module enables optimizing over the
synthesized layout to fit the given input in an analysis-by-synthesis fashion.
Experiments suggest that our model achieves higher accuracy and diversity in
conditional scene synthesis and allows exemplar-based scene generation from
various input forms.Comment: CVPR 2020 (Oral). Project page: http://3dsln.csail.mit.edu
- …