Search CORE

1,665 research outputs found

Assessing hyper parameter optimization and speedup for convolutional neural networks

Author: A.Krizhevsky
D. L.Tutorial
E.Bochinski
E.Real
J.Bergstra
J.Deng
K.He
L.Xie
N.Srivastava
S.Ioffe
T.Domhan
W. Y.Lee
Z.Zhong
Publication venue: 'IGI Global'
Publication date: 01/01/2020
Field of study

The increased processing power of graphical processing units (GPUs) and the availability of large image datasets has fostered a renewed interest in extracting semantic information from images. Promising results for complex image categorization problems have been achieved using deep learning, with neural networks comprised of many layers. Convolutional neural networks (CNN) are one such architecture which provides more opportunities for image classification. Advances in CNN enable the development of training models using large labelled image datasets, but the hyper parameters need to be specified, which is challenging and complex due to the large number of parameters. A substantial amount of computational power and processing time is required to determine the optimal hyper parameters to define a model yielding good results. This article provides a survey of the hyper parameter search and optimization methods for CNN architectures

LSBU Research Open

Crossref

ResearchOnline@GCU

Pairwise Confusion for Fine-Grained Visual Classification

Author: A Dubey
GJ Székely
J Krause
KK Singh
Maolin Liu
N Zhang
S Kullback
Y Souri
Y Zhang
Publication venue
Publication date: 25/07/2018
Field of study

Fine-Grained Visual Classification (FGVC) datasets contain small sample sizes, along with significant intra-class variation and inter-class similarity. While prior work has addressed intra-class variation using localization and segmentation techniques, inter-class similarity may also affect feature learning and reduce classification performance. In this work, we address this problem using a novel optimization procedure for the end-to-end neural network training on FGVC tasks. Our procedure, called Pairwise Confusion (PC) reduces overfitting by intentionally {introducing confusion} in the activations. With PC regularization, we obtain state-of-the-art performance on six of the most widely-used FGVC datasets and demonstrate improved localization ability. {PC} is easy to implement, does not need excessive hyperparameter tuning during training, and does not add significant overhead during test time.Comment: Camera-Ready version for ECCV 201

arXiv.org e-Print Archive

Crossref

Recommended from our members

DISEASE OF LUNG INFECTION DETECTION USING CNN MODEL -BAYESIAN OPTIMIZATION

Author: gutha poojitha
Publication venue: CSUSB ScholarWorks
Publication date: 01/12/2023
Field of study

Auscultation plays a role, in diagnosing and identifying diseases during examinations. However, it requires training and expertise, for application. This study aims to tackle this challenge by introducing a model that categorizes respiratory sounds into eight groups: URTI, Healthy, Asthma, COPD, LRTI, Bronchiectasis, Pneumonia, and Bronchiolitis. To achieve this categorization the study utilizes a Convolutional Neural Network (CNN) model that has been optimized using techniques. The dataset used in the study consists of 920 audio samples obtained from 126 patients with durations ranging from 10 to 90 seconds. Impressively, the model demonstrates a noteworthy 83% validation accuracy and an impressive 86% training accuracy, highlighting its robust and effective performance. To enhance user interaction and facilitate result visualization, the research team has developed a user-friendly interface using Flask, HTML, and CSS. This interface provides healthcare professionals and other stakeholders with the means to access and interpret the results of the experimental analysis. Overall, this research marks a significant stride in making respiratory sound analysis more accessible and accurate, thus contributing to improved disease diagnosis and patient care

CSUSB ScholarWorks

Medical Internet-of-Things Based Breast Cancer Diagnosis Using Hyperparameter-Optimized Neural Networks

Author: Damaševičius Robertas
Douglas Mychal
Maskeliunas Rytis
Misra Sanjay
Ogundokun Roseline Oluwaseun
Publication venue: 'MDPI AG'
Publication date: 01/01/2022
Field of study

In today’s healthcare setting, the accurate and timely diagnosis of breast cancer is critical for recovery and treatment in the early stages. In recent years, the Internet of Things (IoT) has experienced a transformation that allows the analysis of real-time and historical data using artificial intelligence (AI) and machine learning (ML) approaches. Medical IoT combines medical devices and AI applications with healthcare infrastructure to support medical diagnostics. The current state-of-the-art approach fails to diagnose breast cancer in its initial period, resulting in the death of most women. As a result, medical professionals and researchers are faced with a tremendous problem in early breast cancer detection. We propose a medical IoT-based diagnostic system that competently identifies malignant and benign people in an IoT environment to resolve the difficulty of identifying early-stage breast cancer. The artificial neural network (ANN) and convolutional neural network (CNN) with hyperparameter optimization are used for malignant vs. benign classification, while the Support Vector Machine (SVM) and Multilayer Perceptron (MLP) were utilized as baseline classifiers for comparison. Hyperparameters are important for machine learning algorithms since they directly control the behaviors of training algorithms and have a significant effect on the performance of machine learning models. We employ a particle swarm optimization (PSO) feature selection approach to select more satisfactory features from the breast cancer dataset to enhance the classification performance using MLP and SVM, while grid-based search was used to find the best combination of the hyperparameters of the CNN and ANN models. The Wisconsin Diagnostic Breast Cancer (WDBC) dataset was used to test the proposed approach. The proposed model got a classification accuracy of 98.5% using CNN, and 99.2% using ANN.publishedVersio

Multidisciplinary Digital Publishing Institute

HIØ Brage

NORA - Norwegian Open Research Archives