17 research outputs found

    Real-Time Identification of Artifacts: Synthetic Data for AI Model

    Get PDF
    The collections represent the constitutive element and the raison d'รชtre of each museum. Their management , care and dissemination are therefore a task of primary importance for every museum. Applying new Artificial Intelligence technologies in this area could lead to new initiatives. However, the development of certain tools requires structured and labeled datasets for the training phases which are not always easily available. The proposed contribution is within the domain of the construction of specific datasets with low budget tools and explores the results of a first step in this direction by testing algorithms for the recognition and labeling of heritage objects. The developed workflow is part of a first prototype that could be used both in heritage dissemination or gamification applications, and for use in heritage research tools

    Occluded object detection using deep learning by image synthesis with hands

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ •๋ณด๊ณตํ•™๋ถ€, 2021. 2. ์ดํ˜์žฌ.์ตœ๊ทผ ๋”ฅ ๋Ÿฌ๋‹์„ ์ด์šฉํ•œ ๋ฌผ์ฒด ์ธ์‹์„ ํ†ตํ•œ ๋‹ค์–‘ํ•œ ์‘์šฉ ์‹œ์Šคํ…œ ๊ฐœ๋ฐœ์ด ํ™œ๋ฐœํ•˜๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์Šค๋งˆํŠธ ๋ƒ‰์žฅ๊ณ  ์žฌ๊ณ  ๊ด€๋ฆฌ๋ฅผ ๋ชฉ์ ์œผ๋กœ ํ•˜์—ฌ, ๋ƒ‰์žฅ๊ณ  ๋‚ด์™ธ๋ถ€๋กœ ์ด๋™ํ•˜๋Š” ์† ์•ˆ์˜ ์ƒํ’ˆ ์ธ์‹๋ฅ ์„ ์ฆ๊ฐ€์‹œํ‚ค๊ธฐ ์œ„ํ•œ ํšจ์œจ์ ์ธ ํ•™์Šต๋ฐ์ดํ„ฐ ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค. ๋ƒ‰์žฅ๊ณ  ๋‚ด๋ถ€์˜ ๊ณ ์ •๋œ ์นด๋ฉ”๋ผ ๊ด€์ ์—์„œ ์†์— ์˜ํ•ด ์˜ฎ๊ฒจ์ง€๋Š” ์ƒํ’ˆ์„ ์ธ์‹ํ•  ๋•Œ, ๋ช‡ ๊ฐ€์ง€ ์–ด๋ ค์›€์ด ์žˆ๋‹ค. ์ด๋™ํ•˜๋Š” ์œ„์น˜์— ๋”ฐ๋ฅธ ์ƒํ’ˆ์˜ ํฌ๊ธฐ, ๋ชจ์–‘ ๋“ฑ์˜ ๋‹ค์–‘์„ฑ์ด ์ƒ๊ธฐ๋ฉฐ, ์ƒํ’ˆ์ด ํ•ญ์ƒ ์†์— ์žกํ˜€์ ธ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ๊ฐ€๋ ค์ง(Occlusion)์ด ๋ฐœ์ƒํ•œ๋‹ค. ๋ƒ‰์žฅ๊ณ  ๋‚ด๋ถ€์™€ ์™ธ๋ถ€ ํ™˜๊ฒฝ์ด ๋‹ค๋ฅด๊ธฐ ๋•Œ๋ฌธ์— ๋ฐฐ๊ฒฝ์˜ ๋‹ค์–‘์„ฑ ๋ฌธ์ œ๋„ ์ƒ๊ธด๋‹ค. ์ด๋Ÿฌํ•œ ๋ชจ๋“  ๊ฒฝ์šฐ๋ฅผ ๊ณ ๋ คํ•˜์—ฌ ์ˆ˜์ž‘์—…์œผ๋กœ ํ•™์Šต๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๊ฒƒ์€ ๋„ˆ๋ฌด ๋งŽ์€ ์‹œ๊ฐ„๊ณผ ๋…ธ๋™๋ ฅ์ด ์†Œ๋ชจ๋œ๋‹ค. ํ•˜์ง€๋งŒ, ๋ฐฐ๊ฒฝ์ด ์ œ๊ฑฐ๋œ ์† ์ด๋ฏธ์ง€๋ฅผ ๋ฌผ์ฒด์™€ ํ•ฉ์„ฑํ•œ ํ›„, ๋‹ค์–‘ํ•œ ๋ฐฐ๊ฒฝ์„ ์ž…ํ˜€์„œ ์ƒํ’ˆ์— ๋Œ€ํ•œ ๋ผ๋ฒจ๋ง ๋ฐ์ดํ„ฐ๋ฅผ ์ž๋™์œผ๋กœ ์ƒ์„ฑํ•œ๋‹ค๋ฉด, ๋งŽ์€ ์‹œ๊ฐ„๊ณผ ๋…ธ๋™๋ ฅ ์†Œ๋ชจ ์—†์ด๋„ ์œ„์˜ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ๋‹ค. ์ˆ˜์ž‘์—…์œผ๋กœ ๋ผ๋ฒจ๋ง์„ ํ•œ ์†Œ๋Ÿ‰์˜ ํ•™์Šต๋ฐ์ดํ„ฐ์˜ ๊ฒฝ์šฐ, 52.03%์˜ mean Average Precision(mAP)๋ฅผ ๋ณด์ธ๋‹ค. ์šฐ๋ฆฌ๊ฐ€ ์ œ์•ˆํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜์—ฌ ํ•™์Šต๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•œ ํ›„, ์ˆ˜์ž‘์—…์œผ๋กœ ๋ผ๋ฒจ๋งํ•œ ์†Œ๋Ÿ‰์˜ ํ•™์Šต๋ฐ์ดํ„ฐ์™€ ํ•ฉ์ณ์„œ ํ•™์Šตํ•œ ๊ฒฐ๊ณผ, ์ˆ˜์ž‘์—…์œผ๋กœ ๋ผ๋ฒจ๋งํ•˜๋Š” ๋…ธ๋™ ์—†์ด๋„ 87.29%์˜ mAP๋ฅผ ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๋˜ํ•œ, ์† ์ด๋ฏธ์ง€ ์—†์ด ํ•ฉ์„ฑํ•œ ๊ฒฝ์šฐ์™€ ๋น„๊ตํ–ˆ์„ ๋•Œ, 5.47%์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋ฉฐ, Z. Zhong et al์ด ์ œ์•ˆํ•œ random erasing ๊ธฐ๋ฒ•๊ณผ ๋น„๊ตํ•˜์—ฌ 7.21%์˜ ์„ฑ๋Šฅ ์ฆ๊ฐ€๋ฅผ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค[22].Recently, various application systems have been developed for object detection based on depp learning. For the purpose of inventory management of smart refrigerators, this paper proposes an efficient method for generating training datasets that improves detection rate of products held by the hand when being placed in or removed from a refrigerator. In this case, there are several issues for detecting a product. The size and appearance of the target product change as the position of the product. Occlusion problems occur since products are partially covered by the hand. The difference between the interior and exterior backgrounds of the refrigerator makes it difficult to recognize objects. Considering above problems, generating the datasets manually requires significant time and effort. To solve this problem, we create synthetic images and labels by merging images of objects, images of hands, and various backgrounds. The detector learned with a few manually labeled dataset gives 52.03% mAP. When adding the dataset of synthetic images with hands to a few manually labeled dataset, the detector obtains improved mAP of 87.29% without the labor of manual labeling. We achieved a higher performance by 5.47% compared to image synthesis without hands, and by 7.21% compared to Z. Zhong et als random erasing technique.์ œ 1 ์žฅ ์„œ ๋ก  1 1.1 ์—ฐ๊ตฌ์˜ ๋ฐฐ๊ฒฝ 1 1.2 ์—ฐ๊ตฌ์˜ ๋‚ด์šฉ 3 1.3 ๋…ผ๋ฌธ ๊ตฌ์„ฑ 3 ์ œ 2 ์žฅ ๊ด€๋ จ ์—ฐ๊ตฌ 5 2.1 ๋”ฅ ๋Ÿฌ๋‹ ๋ชจ๋ธ ๊ด€๋ จ ์—ฐ๊ตฌ 5 2.2 ์ด๋ฏธ์ง€ ํ•ฉ์„ฑ ๊ด€๋ จ ์—ฐ๊ตฌ(Image synthesis) 7 2.3 ์ž๋™ ๋ผ๋ฒจ๋ง ๊ด€๋ จ ์—ฐ๊ตฌ 8 2.4 ๊ฐ€๋ ค์ง ๊ด€๋ จ ์—ฐ๊ตฌ(Occlusion) 9 ์ œ 3 ์žฅ ์ œ์•ˆ ๋ฐฉ๋ฒ• ๋ฐ ๊ตฌํ˜„ 12 3.1 ๋ฌธ์ œ์  ์ •์˜ 12 3.2 ์ œ์•ˆ ๋ฐฉ๋ฒ• 16 3.3 ํ•ฉ์„ฑ ๊ณผ์ •(Synthesis process) 19 ์ œ 4 ์žฅ ์‹ค ํ—˜ 28 4.1 ์‹คํ—˜ ํ™˜๊ฒฝ ๋ฐ ๊ตฌ์„ฑ 28 4.2 ์‹คํ—˜ ๊ฒฐ๊ณผ 34 ์ œ 5 ์žฅ ๊ฒฐ ๋ก  42 ์ฐธ๊ณ ๋ฌธํ—Œ 44 Abstract 50Maste

    Synthetic Examples Improve Generalization for Rare Classes

    Get PDF
    The ability to detect and classify rare occurrences in images has important applications - for example, counting rare and endangered species when studying biodiversity, or detecting infrequent traffic scenarios that pose a danger to self-driving cars. Few-shot learning is an open problem: current computer vision systems struggle to categorize objects they have seen only rarely during training, and collecting a sufficient number of training examples of rare events is often challenging and expensive, and sometimes outright impossible. We explore in depth an approach to this problem: complementing the few available training images with ad-hoc simulated data.Our testbed is animal species classification, which has a real-world long-tailed distribution. We present two natural world simulators, and analyze the effect of different axes of variation in simulation, such as pose, lighting, model, and simulation method, and we prescribe best practices for efficiently incorporating simulated data for real-world performance gain. Our experiments reveal that synthetic data can considerably reduce error rates for classes that are rare, that as the amount of simulated data is increased, accuracy on the target class improves, and that high variation of simulated data provides maximum performance gain

    Automated production of synthetic point clouds of truss bridges for semantic and instance segmentation using deep learning models

    Get PDF
    The cost of obtaining large volumes of bridge data with technologies like laser scanners hinders the training of deep learning models. To address this, this paper introduces a new method for creating synthetic point clouds of truss bridges and demonstrates the effectiveness of a deep learning approach for semantic and instance segmentation of these point clouds. The method generates point clouds by specifying the dimensions and components of the bridge, resulting in high variability in the generated dataset. A deep learning model is trained using the generated point clouds, which is an adapted version of JSNet. The accuracy of the results surpasses previous heuristic methods. The proposed methodology has significant implications for the development of automated inspection and monitoring systems for truss bridges. Furthermore, the success of the deep learning approach suggests its potential for semantic and instance segmentation of complex point clouds beyond truss bridges.Agencia Estatal de Investigaciรณn | Ref. PID2021-124236OB-C33Agencia Estatal de Investigaciรณn | Ref. RYC2021-033560-IUniversidade de Vigo/CISU

    An Adaptive Early Stopping Technique for DenseNet169-Based Knee Osteoarthritis Detection Model

    Get PDF
    Knee osteoarthritis (OA) detection is an important area of research in health informatics that aims to improve the accuracy of diagnosing this debilitating condition. In this paper, we investigate the ability of DenseNet169, a deep convolutional neural network architecture, for knee osteoarthritis detection using X-ray images. We focus on the use of the DenseNet169 architecture and propose an adaptive early stopping technique that utilizes gradual cross-entropy loss estimation. The proposed approach allows for the efficient selection of the optimal number of training epochs, thus preventing overfitting. To achieve the goal of this study, the adaptive early stopping mechanism that observes the validation accuracy as a threshold was designed. Then, the gradual cross-entropy (GCE) loss estimation technique was developed and integrated to the epoch training mechanism. Both adaptive early stopping and GCE were incorporated into the DenseNet169 for the OA detection model. The performance of the model was measured using several metrics including accuracy, precision, and recall. The obtained results were compared with those obtained from the existing works. The comparison shows that the proposed model outperformed the existing solutions in terms of accuracy, precision, recall, and loss performance, which indicates that the adaptive early stopping coupled with GCE improved the ability of DenseNet169 to accurately detect knee OA
    corecore