Estrategias para la generación y explotación de datos de entrenamiento sintéticos en visión artificial a nivel de píxel

Abstract

Tesis Doctoral inédita leída en la Universidad Autónoma de Madrid, Escuela Politécnica Superior, Departamento de Tecnología Electrónica y de las Comunicaciones. Fecha de Lectura: 20-03-2025Esta Tesis tiene embargado el acceso al texto completo hasta el 20-09-2026In recent years, the introduction of machine learning in computer vision methods has increased the demand for data that can be used to train them. Labeling data can be an expensive process, particularly for computer vision tasks such as semantic segmentation, where images have pixel-level labels for classification. For this reason, synthetic data plays a crucial role in the field of computer vision by offering data at scale, with automatic, perfect labels without the costs associated with data collection and labeling, enabling researchers and developers to create large-scale datasets with precise annotations, that cover a wide range of scenarios, including rare or dangerous situations that would be impractical to capture otherwise. Additionally, synthetic data helps mitigate privacy concerns, as it does not involve real individuals or sensitive information. However, synthetic data also has some notable drawbacks: A reduced variability compared to the real world, along with inherent differences between synthetic and real images, can substantially reduce the transferability and generalization of models trained exclusively on synthetic data. To address these challenges, this thesis explores different approaches to synthetic data generation and its applications in computer vision. It examines different techniques for creating synthetic datasets, starting with hand-crafted algorithms to generate synthetic data for semantic segmentation. Then, it explores how to adapt and exploit different simulation tools with multiple purposes: from generating a novel semantic segmentation dataset to discussing alternate methods to generate data that can produce more general models and finally introducing a novel simulation tool designed from scratch for generating spacecraft imagery with navigation tasks in mind. This work closes with a proposal on how to effectively utilize generative artificial intelligence to complete datasets for training computer vision models with novel, previously unseen classes, and investigates how these methods can be integrated into existing workflows to enhance model performance and generalizationThe work was supported by the Spanish Government(HVD project PID2021-125051OB-I00 and SEGA-CV project TED2021-131643A-I00

Similar works

Full text

thumbnail-image

Biblos-e Archivo

redirect
Last time updated on 10/08/2025

This paper was published in Biblos-e Archivo.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.