1,525 research outputs found
Advancements in Image Classification using Convolutional Neural Network
Convolutional Neural Network (CNN) is the state-of-the-art for image
classification task. Here we have briefly discussed different components of
CNN. In this paper, We have explained different CNN architectures for image
classification. Through this paper, we have shown advancements in CNN from
LeNet-5 to latest SENet model. We have discussed the model description and
training details of each model. We have also drawn a comparison among those
models.Comment: 9 pages, 15 figures, 3 Tables. Submitted to 2018 Fourth International
Conference on Research in Computational Intelligence and Communication
Networks(ICRCICN 2018
Advanced Capsule Networks via Context Awareness
Capsule Networks (CN) offer new architectures for Deep Learning (DL)
community. Though its effectiveness has been demonstrated in MNIST and
smallNORB datasets, the networks still face challenges in other datasets for
images with distinct contexts. In this research, we improve the design of CN
(Vector version) namely we expand more Pooling layers to filter image
backgrounds and increase Reconstruction layers to make better image
restoration. Additionally, we perform experiments to compare accuracy and speed
of CN versus DL models. In DL models, we utilize Inception V3 and DenseNet V201
for powerful computers besides NASNet, MobileNet V1 and MobileNet V2 for small
and embedded devices. We evaluate our models on a fingerspelling alphabet
dataset from American Sign Language (ASL). The results show that CNs perform
comparably to DL models while dramatically reducing training time. We also make
a demonstration and give a link for the purpose of illustration.Comment: 12 page
Using Capsule Networks for Image and Speech Recognition Problems
abstract: In recent years, conventional convolutional neural network (CNN) has achieved outstanding performance in image and speech processing applications. Unfortunately, the pooling operation in CNN ignores important spatial information which is an important attribute in many applications. The recently proposed capsule network retains spatial information and improves the capabilities of traditional CNN. It uses capsules to describe features in multiple dimensions and dynamic routing to increase the statistical stability of the network.
In this work, we first use capsule network for overlapping digit recognition problem. We evaluate the performance of the network with respect to recognition accuracy, convergence and training time per epoch. We show that capsule network achieves higher accuracy when training set size is small. When training set size is larger, capsule network and conventional CNN have comparable recognition accuracy. The training time per epoch for capsule network is longer than conventional CNN because of the dynamic routing algorithm. An analysis of the GPU timing shows that adjusting the capsule structure can help decrease the time complexity of the dynamic routing algorithm significantly.
Next, we design a capsule network for speech recognition, specifically, overlapping word recognition. We use both capsule network and conventional CNN to recognize 2 overlapping words in speech files created from 5 word classes. We show that capsule network achieves a considerably higher recognition accuracy (96.92%) compared to conventional CNN (85.19%). Our results show that capsule network recognizes overlapping word by recognizing each individual word in the speech. We also verify the scalability of capsule network by increasing the number of word classes from 5 to 10. Capsule network still shows a high recognition accuracy of 95.42% in case of 10 words while the accuracy of conventional CNN decreases sharply to 73.18%.Dissertation/ThesisMasters Thesis Electrical Engineering 201
Brain Tumor Classification Using Hit-or-Miss Capsule Layers
The job of classifying or annotating brain tumors from MRI images can be time-consuming and difficult, even for radiologists. To increase the survival chances of a patient, medical practitioners desire a means for quick and accurate diagnosis. While datasets like CIFAR, ImageNet, and SVHN have tens of thousands, hundreds of thousands, or millions of samples, an MRI dataset may not have the same luxury of receiving accurate labels for each image containing a tumor. This work covers three models that classify brain tumors using a combination of convolutional neural networks and of the concept of capsule layers. Each network utilizes a hit-or-miss capsule layer to relate classes to capsule vectors in a one-to-one relationship. Additionally, this work proposes the use of deep active learning for picking the samples that can give the best model, PSP-HitNet, the most information when adding mini-batches of unlabeled data into the master, labeled training dataset. By using an uncertainty estimated querying strategy, PSP-HitNet approaches the best validation accuracy possible within the first 12-24% of added data from the unlabeled dataset, whereas random choosing takes until 30-50% of the unlabeled to reach the same performance
- …