Tesis Doctoral inédita leída en la Universidad Autónoma de Madrid, Escuela Politécnica Superior, Departamento de Tecnología Electrónica y de las Comunicaciones. Fecha de Lectura: 20-03-2025Esta Tesis tiene embargado el acceso al texto completo hasta el 20-09-2026In recent years, the introduction of machine learning in computer vision methods has increased
the demand for data that can be used to train them. Labeling data can be an expensive process,
particularly for computer vision tasks such as semantic segmentation, where images have pixel-level
labels for classification. For this reason, synthetic data plays a crucial role in the field of computer vision
by offering data at scale, with automatic, perfect labels without the costs associated with data collection
and labeling, enabling researchers and developers to create large-scale datasets with precise annotations,
that cover a wide range of scenarios, including rare or dangerous situations that would be impractical to
capture otherwise. Additionally, synthetic data helps mitigate privacy concerns, as it does not involve
real individuals or sensitive information.
However, synthetic data also has some notable drawbacks: A reduced variability compared to the
real world, along with inherent differences between synthetic and real images, can substantially reduce
the transferability and generalization of models trained exclusively on synthetic data.
To address these challenges, this thesis explores different approaches to synthetic data generation
and its applications in computer vision. It examines different techniques for creating synthetic datasets,
starting with hand-crafted algorithms to generate synthetic data for semantic segmentation. Then, it
explores how to adapt and exploit different simulation tools with multiple purposes: from generating a
novel semantic segmentation dataset to discussing alternate methods to generate data that can produce
more general models and finally introducing a novel simulation tool designed from scratch for generating
spacecraft imagery with navigation tasks in mind. This work closes with a proposal on how to effectively
utilize generative artificial intelligence to complete datasets for training computer vision models with
novel, previously unseen classes, and investigates how these methods can be integrated into existing
workflows to enhance model performance and generalizationThe work was supported by the Spanish Government(HVD project PID2021-125051OB-I00 and SEGA-CV project TED2021-131643A-I00
Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.