4 research outputs found

    Fruit Shop Tool: Fruit Classification and Recognition using Deep Learning

    Get PDF
    Fruit image classification and recognition is a challenging application of computer vision. The computer vision system is used to recognize a fruit based on artificial neural networks. Deep neural network is widely used for various classification problems. In this paper Convolutional Neural Network (CNN) is used to recognize the fruits. The dataset contains 1877 images of ten categories which are used for the experimental purpose. CNN is constructed with sixteen layers which are used to extract the features from images and Support Vector Machine (SVM) classifier is used for classification. The proposed system has the classification accuracy of 99.2% and the recognition accuracy of 99.02%

    Occluded object detection using deep learning by image synthesis with hands

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ์ „๊ธฐยท์ •๋ณด๊ณตํ•™๋ถ€, 2021. 2. ์ดํ˜์žฌ.์ตœ๊ทผ ๋”ฅ ๋Ÿฌ๋‹์„ ์ด์šฉํ•œ ๋ฌผ์ฒด ์ธ์‹์„ ํ†ตํ•œ ๋‹ค์–‘ํ•œ ์‘์šฉ ์‹œ์Šคํ…œ ๊ฐœ๋ฐœ์ด ํ™œ๋ฐœํ•˜๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์Šค๋งˆํŠธ ๋ƒ‰์žฅ๊ณ  ์žฌ๊ณ  ๊ด€๋ฆฌ๋ฅผ ๋ชฉ์ ์œผ๋กœ ํ•˜์—ฌ, ๋ƒ‰์žฅ๊ณ  ๋‚ด์™ธ๋ถ€๋กœ ์ด๋™ํ•˜๋Š” ์† ์•ˆ์˜ ์ƒํ’ˆ ์ธ์‹๋ฅ ์„ ์ฆ๊ฐ€์‹œํ‚ค๊ธฐ ์œ„ํ•œ ํšจ์œจ์ ์ธ ํ•™์Šต๋ฐ์ดํ„ฐ ์ƒ์„ฑ ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ•œ๋‹ค. ๋ƒ‰์žฅ๊ณ  ๋‚ด๋ถ€์˜ ๊ณ ์ •๋œ ์นด๋ฉ”๋ผ ๊ด€์ ์—์„œ ์†์— ์˜ํ•ด ์˜ฎ๊ฒจ์ง€๋Š” ์ƒํ’ˆ์„ ์ธ์‹ํ•  ๋•Œ, ๋ช‡ ๊ฐ€์ง€ ์–ด๋ ค์›€์ด ์žˆ๋‹ค. ์ด๋™ํ•˜๋Š” ์œ„์น˜์— ๋”ฐ๋ฅธ ์ƒํ’ˆ์˜ ํฌ๊ธฐ, ๋ชจ์–‘ ๋“ฑ์˜ ๋‹ค์–‘์„ฑ์ด ์ƒ๊ธฐ๋ฉฐ, ์ƒํ’ˆ์ด ํ•ญ์ƒ ์†์— ์žกํ˜€์ ธ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ๊ฐ€๋ ค์ง(Occlusion)์ด ๋ฐœ์ƒํ•œ๋‹ค. ๋ƒ‰์žฅ๊ณ  ๋‚ด๋ถ€์™€ ์™ธ๋ถ€ ํ™˜๊ฒฝ์ด ๋‹ค๋ฅด๊ธฐ ๋•Œ๋ฌธ์— ๋ฐฐ๊ฒฝ์˜ ๋‹ค์–‘์„ฑ ๋ฌธ์ œ๋„ ์ƒ๊ธด๋‹ค. ์ด๋Ÿฌํ•œ ๋ชจ๋“  ๊ฒฝ์šฐ๋ฅผ ๊ณ ๋ คํ•˜์—ฌ ์ˆ˜์ž‘์—…์œผ๋กœ ํ•™์Šต๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๊ฒƒ์€ ๋„ˆ๋ฌด ๋งŽ์€ ์‹œ๊ฐ„๊ณผ ๋…ธ๋™๋ ฅ์ด ์†Œ๋ชจ๋œ๋‹ค. ํ•˜์ง€๋งŒ, ๋ฐฐ๊ฒฝ์ด ์ œ๊ฑฐ๋œ ์† ์ด๋ฏธ์ง€๋ฅผ ๋ฌผ์ฒด์™€ ํ•ฉ์„ฑํ•œ ํ›„, ๋‹ค์–‘ํ•œ ๋ฐฐ๊ฒฝ์„ ์ž…ํ˜€์„œ ์ƒํ’ˆ์— ๋Œ€ํ•œ ๋ผ๋ฒจ๋ง ๋ฐ์ดํ„ฐ๋ฅผ ์ž๋™์œผ๋กœ ์ƒ์„ฑํ•œ๋‹ค๋ฉด, ๋งŽ์€ ์‹œ๊ฐ„๊ณผ ๋…ธ๋™๋ ฅ ์†Œ๋ชจ ์—†์ด๋„ ์œ„์˜ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•  ์ˆ˜ ์žˆ๋‹ค. ์ˆ˜์ž‘์—…์œผ๋กœ ๋ผ๋ฒจ๋ง์„ ํ•œ ์†Œ๋Ÿ‰์˜ ํ•™์Šต๋ฐ์ดํ„ฐ์˜ ๊ฒฝ์šฐ, 52.03%์˜ mean Average Precision(mAP)๋ฅผ ๋ณด์ธ๋‹ค. ์šฐ๋ฆฌ๊ฐ€ ์ œ์•ˆํ•˜๋Š” ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜์—ฌ ํ•™์Šต๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•œ ํ›„, ์ˆ˜์ž‘์—…์œผ๋กœ ๋ผ๋ฒจ๋งํ•œ ์†Œ๋Ÿ‰์˜ ํ•™์Šต๋ฐ์ดํ„ฐ์™€ ํ•ฉ์ณ์„œ ํ•™์Šตํ•œ ๊ฒฐ๊ณผ, ์ˆ˜์ž‘์—…์œผ๋กœ ๋ผ๋ฒจ๋งํ•˜๋Š” ๋…ธ๋™ ์—†์ด๋„ 87.29%์˜ mAP๋ฅผ ๋‹ฌ์„ฑํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. ๋˜ํ•œ, ์† ์ด๋ฏธ์ง€ ์—†์ด ํ•ฉ์„ฑํ•œ ๊ฒฝ์šฐ์™€ ๋น„๊ตํ–ˆ์„ ๋•Œ, 5.47%์˜ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ๋ณด์—ฌ์ฃผ์—ˆ์œผ๋ฉฐ, Z. Zhong et al์ด ์ œ์•ˆํ•œ random erasing ๊ธฐ๋ฒ•๊ณผ ๋น„๊ตํ•˜์—ฌ 7.21%์˜ ์„ฑ๋Šฅ ์ฆ๊ฐ€๋ฅผ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค[22].Recently, various application systems have been developed for object detection based on depp learning. For the purpose of inventory management of smart refrigerators, this paper proposes an efficient method for generating training datasets that improves detection rate of products held by the hand when being placed in or removed from a refrigerator. In this case, there are several issues for detecting a product. The size and appearance of the target product change as the position of the product. Occlusion problems occur since products are partially covered by the hand. The difference between the interior and exterior backgrounds of the refrigerator makes it difficult to recognize objects. Considering above problems, generating the datasets manually requires significant time and effort. To solve this problem, we create synthetic images and labels by merging images of objects, images of hands, and various backgrounds. The detector learned with a few manually labeled dataset gives 52.03% mAP. When adding the dataset of synthetic images with hands to a few manually labeled dataset, the detector obtains improved mAP of 87.29% without the labor of manual labeling. We achieved a higher performance by 5.47% compared to image synthesis without hands, and by 7.21% compared to Z. Zhong et als random erasing technique.์ œ 1 ์žฅ ์„œ ๋ก  1 1.1 ์—ฐ๊ตฌ์˜ ๋ฐฐ๊ฒฝ 1 1.2 ์—ฐ๊ตฌ์˜ ๋‚ด์šฉ 3 1.3 ๋…ผ๋ฌธ ๊ตฌ์„ฑ 3 ์ œ 2 ์žฅ ๊ด€๋ จ ์—ฐ๊ตฌ 5 2.1 ๋”ฅ ๋Ÿฌ๋‹ ๋ชจ๋ธ ๊ด€๋ จ ์—ฐ๊ตฌ 5 2.2 ์ด๋ฏธ์ง€ ํ•ฉ์„ฑ ๊ด€๋ จ ์—ฐ๊ตฌ(Image synthesis) 7 2.3 ์ž๋™ ๋ผ๋ฒจ๋ง ๊ด€๋ จ ์—ฐ๊ตฌ 8 2.4 ๊ฐ€๋ ค์ง ๊ด€๋ จ ์—ฐ๊ตฌ(Occlusion) 9 ์ œ 3 ์žฅ ์ œ์•ˆ ๋ฐฉ๋ฒ• ๋ฐ ๊ตฌํ˜„ 12 3.1 ๋ฌธ์ œ์  ์ •์˜ 12 3.2 ์ œ์•ˆ ๋ฐฉ๋ฒ• 16 3.3 ํ•ฉ์„ฑ ๊ณผ์ •(Synthesis process) 19 ์ œ 4 ์žฅ ์‹ค ํ—˜ 28 4.1 ์‹คํ—˜ ํ™˜๊ฒฝ ๋ฐ ๊ตฌ์„ฑ 28 4.2 ์‹คํ—˜ ๊ฒฐ๊ณผ 34 ์ œ 5 ์žฅ ๊ฒฐ ๋ก  42 ์ฐธ๊ณ ๋ฌธํ—Œ 44 Abstract 50Maste
    corecore