Search CORE

9 research outputs found

Improving Classification in Single and Multi-View Images

Author: Salman Hadi Kanaan Hadi
Publication venue: ScholarWorks@UARK
Publication date: 01/05/2023
Field of study

Image classification is a sub-field of computer vision that focuses on identifying objects within digital images. In order to improve image classification we must address the following areas of improvement: 1) Single and Multi-View data quality using data pre-processing techniques. 2) Enhancing deep feature learning to extract alternative representation of the data. 3) Improving decision or prediction of labels. This dissertation presents a series of four published papers that explore different improvements of image classification. In our first paper, we explore the Siamese network architecture to create a Convolution Neural Network based similarity metric. We learn the priority features that differentiate two given input images. The metric proposed achieves state-of-the-art Fβ measure. In our second paper, we explore multi-view data classification. We investigate the application of Generative Adversarial Networks GANs on Multi-view data image classification and few-shot learning. Experimental results show that our method outperforms state-of-the-art research. In our third paper, we take on the challenge of improving ResNet backbone model. For this task, we focus on improving channel attention mechanisms. We utilize Discrete Wavelet Transform compression to address the channel representation problem. Experimental results on ImageNet shows that our method outperforms baseline SENet-34 and SOTA FcaNet-34 at no extra computational cost. In our fourth paper, we investigate further the potential of orthogonalization of filters for extraction of diverse information for channel attention. We prove that using only random constant orthogonal filters is sufficient enough to achieve good channel attention. We test our proposed method using ImageNet, Places365, and Birds datasets for image classification, MS-COCO for object detection, and instance segmentation tasks. Our method outperforms FcaNet, and WaveNet and achieves the state-of-the-art results

ScholarWorks@UARK

Improving Classification in Single and Multi-View Images

Author: Salman Hadi Kanaan Hadi
Publication venue: ScholarWorks@UARK
Publication date: 01/05/2023
Field of study

UARK (University of Arkansas )

Towards Robust, Interpretable and Scalable Visual Representations

Author: Li Ang
Publication venue
Publication date: 01/01/2017
Field of study

Visual representation is one of the central problems in computer vision. The essential problem is to develop a unified representation that effectively encodes both visual appearance and spatial information so that it can be easily applied to various vision applications such as face recognition, image matching, and multimodal image retrieval. Along with the history of computer vision research, there are four major levels of visual representations, i.e., geometric, low-level, mid-level and high-level. The dissertation comprises four works studying effective visual representations in the four different levels. Multiple approaches are proposed with the aim of improving the robustness, interpretability, and scalability of visual representations. Geometric features are effective in matching images under spatial transformations however their performance is sensitive to the noises. In the first part, we propose to model the uncertainty of geometric representation based on line segments and propose to equip these features with uncertainty modeling so that they could be robustly applied in the image-based geolocation application. We study in the second part the robustness of feature encoding to noisy keypoints. We show that traditional feature encoding is sensitive to background or noisy features. We propose the Selective Encoding framework which learns the relevance distribution of each codeword and incorporate such information with the original codebook model. Our approach is more robust to the localization errors or uncertainty in the active face authentication application. The mission of visual understanding is to express and describe the image content which is essentially relating images to human language. That typically involves finding a common representation inferable from both domains of data. In the third part, we propose a framework to extract a mid-level spatial representation directly from language descriptions and match such spatial layouts to the detected object bounding boxes for retrieving indoor scene images from user text queries. Modern high-level visual features are typically learned from supervised datasets, whose scalability is largely limited by the requirement of dedicated human annotation. In the last part, we propose to learn visual representations from large-scale weakly supervised data for a large number of natural language-based concepts, i.e., n-gram phrases. We propose the differentiable Jelinek-Mercer smoothing loss and train a deep convolutional neural network from images with associated user comments. We show that the learned model can predict a large number of phrase-based concepts from images, can be effectively applied to image-caption applications and transfers well to other visual recognition datasets

Digital Repository at the University of Maryland

Recent Developments in Smart Healthcare

Author: Tie Qiu (Ed.)
Wenbing Zhao (Ed.)
Xiong Luo (Ed.)
Publication venue: MDPI - Multidisciplinary Digital Publishing Institute
Publication date: 01/01/2018
Field of study

Medicine is undergoing a sector-wide transformation thanks to the advances in computing and networking technologies. Healthcare is changing from reactive and hospital-centered to preventive and personalized, from disease focused to well-being centered. In essence, the healthcare systems, as well as fundamental medicine research, are becoming smarter. We anticipate significant improvements in areas ranging from molecular genomics and proteomics to decision support for healthcare professionals through big data analytics, to support behavior changes through technology-enabled self-management, and social and motivational support. Furthermore, with smart technologies, healthcare delivery could also be made more efficient, higher quality, and lower cost. In this special issue, we received a total 45 submissions and accepted 19 outstanding papers that roughly span across several interesting topics on smart healthcare, including public health, health information technology (Health IT), and smart medicine

Directory of Open Access Books (DOAB)

Understanding Quantum Technologies 2022

Author: Ezratty Olivier
Publication venue
Publication date: 27/10/2022
Field of study

Understanding Quantum Technologies 2022 is a creative-commons ebook that provides a unique 360 degrees overview of quantum technologies from science and technology to geopolitical and societal issues. It covers quantum physics history, quantum physics 101, gate-based quantum computing, quantum computing engineering (including quantum error corrections and quantum computing energetics), quantum computing hardware (all qubit types, including quantum annealing and quantum simulation paradigms, history, science, research, implementation and vendors), quantum enabling technologies (cryogenics, control electronics, photonics, components fabs, raw materials), quantum computing algorithms, software development tools and use cases, unconventional computing (potential alternatives to quantum and classical computing), quantum telecommunications and cryptography, quantum sensing, quantum technologies around the world, quantum technologies societal impact and even quantum fake sciences. The main audience are computer science engineers, developers and IT specialists as well as quantum scientists and students who want to acquire a global view of how quantum technologies work, and particularly quantum computing. This version is an extensive update to the 2021 edition published in October 2021.Comment: 1132 pages, 920 figures, Letter forma

arXiv.org e-Print Archive

Recommended from our members

Investigations into distributed artificial intelligence techniques for design with applications to instruments

Author: Khoui H.
Publication venue
Publication date
Field of study

This Thesis is concerned with the application of Distributed Artificial Intelligence techniques for the design of instruments. In this thesis it is argued that, the early stages of the design process can be automated by the use of Distributed Artificial Intelligence systems that are contractual in their communication and control. A Distributed Problem Solver is proposed, and implemented, for the purpose of conceptual design of instruments. The system consists of a community of knowledge-based agents, with expertise on design of instrument sub-systems. The agents, use a task-sharing form of cooperation for dynamic problem decomposition and sub-problem distribution phases of the design problem solving. New design concepts are generated by suitable combination of partial solutions. To incorporate learning capabilities into our Distributed Problem Solver, we have proposed the use of Classifier System Modules as inductive knowledge-based agents. The application of Classifier Systems and Genetic Algorithms in the context of a number of concrete instrument design problems is investigated. A normalized formulation is applied to the multi-modal design optimization of a Linear Variable Differential Transformer. A number of important proposals for the application of classifier systems to the design automation of instruments are detailed. In particular, an implemented classifier system is used for the purpose of design heuristic extraction for corrugated diaphragms, using a set of dimensionless curves. In this application, the classifier system has produced a set of useful design heuristics by direct interactions with the specified mathematical model

City Research Online

Mental-State Estimation, 1987

Author: Comstock J. Raymond, Jr.
Publication venue
Publication date
Field of study

Reports on the measurement and evaluation of the physiological and mental state of operators are presented

NASA Technical Reports Server

Shortest Route at Dynamic Location with Node Combination-Dijkstra Algorithm

Author: Fitro Achmad
Kusumaningrum Retno
Suryono Suryono
Publication venue
Publication date: 01/10/2018
Field of study

Abstract— Online transportation has become a basic requirement of the general public in support of all activities to go to work, school or vacation to the sights. Public transportation services compete to provide the best service so that consumers feel comfortable using the services offered, so that all activities are noticed, one of them is the search for the shortest route in picking the buyer or delivering to the destination. Node Combination method can minimize memory usage and this methode is more optimal when compared to A* and Ant Colony in the shortest route search like Dijkstra algorithm, but can’t store the history node that has been passed. Therefore, using node combination algorithm is very good in searching the shortest distance is not the shortest route. This paper is structured to modify the node combination algorithm to solve the problem of finding the shortest route at the dynamic location obtained from the transport fleet by displaying the nodes that have the shortest distance and will be implemented in the geographic information system in the form of map to facilitate the use of the system. Keywords— Shortest Path, Algorithm Dijkstra, Node Combination, Dynamic Location (key words

Politeknik NSC Surabay Repository

A hierarchical object-based approach for urban land-use classification from remote sensing data

Author: Zhan Q.
Publication venue: S.n.
Publication date
Field of study

Wageningen University & Research Publications