296 research outputs found

    Retinex theory for color image enhancement: A systematic review

    Get PDF
    A short but comprehensive review of Retinex has been presented in this paper. Retinex theory aims to explain human color perception. In addition, its derivation on modifying the reflectance components has introduced effective approaches for images contrast enhancement. In this review, the classical theory of Retinex has been covered. Moreover, advance and improved techniques of Retinex, proposed in the literature, have been addressed. Strength and weakness aspects of each technique are discussed and compared. An optimum parameter is needed to be determined to define the image degradation level. Such parameter determination would help in quantifying the amount of adjustment in the Retinex theory. Thus, a robust framework to modify the reflectance component of the Retinex theory can be developed to enhance the overall quality of color images

    Race classification using gaussian-based weight K-nn algorithm for face recognition

    Get PDF
    One of the greatest challenges in facial recognition systems is to recognize faces around different race and illuminations. Chromaticity is an essential factor in facial recognition and shows the intensity of the color in a pixel, it can greatly vary depending on the lighting conditions. The race classification scheme proposed which is Gaussian based-weighted K-Nearest Neighbor classifier in this paper, has very sensitive to illumination intensity. The main idea is first to identify the minority class instances in the training data and then generalize them to Gaussian function as concept for the minority class. By using combination of K-NN algorithm with Gaussian formula for race classification. In this paper, image processing is divided into two phases. The first is preprocessing phase. There are three preprocessing comprises of auto contrast balance, noise reduction and auto-color balancing. The second phase is face processing which contains six steps; face detection, illumination normalization, feature extraction, skin segmentation, race classification and face recognition. There are two type of dataset are being used; first FERET dataset where images inside this dataset involve of illumination variations. The second is Caltech dataset which images side this dataset contains noises

    Learning as a Nonlinear Line of Attraction for Pattern Association, Classification and Recognition

    Get PDF
    Development of a mathematical model for learning a nonlinear line of attraction is presented in this dissertation, in contrast to the conventional recurrent neural network model in which the memory is stored in an attractive fixed point at discrete location in state space. A nonlinear line of attraction is the encapsulation of attractive fixed points scattered in state space as an attractive nonlinear line, describing patterns with similar characteristics as a family of patterns. It is usually of prime imperative to guarantee the convergence of the dynamics of the recurrent network for associative learning and recall. We propose to alter this picture. That is, if the brain remembers by converging to the state representing familiar patterns, it should also diverge from such states when presented by an unknown encoded representation of a visual image. The conception of the dynamics of the nonlinear line attractor network to operate between stable and unstable states is the second contribution in this dissertation research. These criteria can be used to circumvent the plasticity-stability dilemma by using the unstable state as an indicator to create a new line for an unfamiliar pattern. This novel learning strategy utilizes stability (convergence) and instability (divergence) criteria of the designed dynamics to induce self-organizing behavior. The self-organizing behavior of the nonlinear line attractor model can manifest complex dynamics in an unsupervised manner. The third contribution of this dissertation is the introduction of the concept of manifold of color perception. The fourth contribution of this dissertation is the development of a nonlinear dimensionality reduction technique by embedding a set of related observations into a low-dimensional space utilizing the result attained by the learned memory matrices of the nonlinear line attractor network. Development of a system for affective states computation is also presented in this dissertation. This system is capable of extracting the user\u27s mental state in real time using a low cost computer. It is successfully interfaced with an advanced learning environment for human-computer interaction

    Enhanced Augmented Reality Framework for Sports Entertainment Applications

    Get PDF
    Augmented Reality (AR) superimposes virtual information on real-world data, such as displaying useful information on videos/images of a scene. This dissertation presents an Enhanced AR (EAR) framework for displaying useful information on images of a sports game. The challenge in such applications is robust object detection and recognition. This is even more challenging when there is strong sunlight. We address the phenomenon where a captured image is degraded by strong sunlight. The developed framework consists of an image enhancement technique to improve the accuracy of subsequent player and face detection. The image enhancement is followed by player detection, face detection, recognition of players, and display of personal information of players. First, an algorithm based on Multi-Scale Retinex (MSR) is proposed for image enhancement. For the tasks of player and face detection, we use adaptive boosting algorithm with Haar-like features for both feature selection and classification. The player face recognition algorithm uses adaptive boosting with the LDA for feature selection and nearest neighbor classifier for classification. The framework can be deployed in any sports where a viewer captures images. Display of players-specific information enhances the end-user experience. Detailed experiments are performed on 2096 diverse images captured using a digital camera and smartphone. The images contain players in different poses, expressions, and illuminations. Player face recognition module requires players faces to be frontal or up to ?350 of pose variation. The work demonstrates the great potential of computer vision based approaches for future development of AR applications.COMSATS Institute of Information Technolog

    NON-INVASIVE IMAGE ENHANCEMENT OF COLOUR RETINAL FUNDUS IMAGES FOR A COMPUTERISED DIABETIC RETINOPATHY MONITORING AND GRADING SYSTEM

    Get PDF
    Diabetic Retinopathy (DR) is a sight threatening complication due to diabetes mellitus affecting the retina. The pathologies of DR can be monitored by analysing colour fundus images. However, the low and varied contrast between retinal vessels and the background in colour fundus images remains an impediment to visual analysis in particular in analysing tiny retinal vessels and capillary networks. To circumvent this problem, fundus fluorescein angiography (FF A) that improves the image contrast is used. Unfortunately, it is an invasive procedure (injection of contrast dyes) that leads to other physiological problems and in the worst case may cause death. The objective of this research is to develop a non-invasive digital Image enhancement scheme that can overcome the problem of the varied and low contrast colour fundus images in order that the contrast produced is comparable to the invasive fluorescein method, and without introducing noise or artefacts. The developed image enhancement algorithm (called RETICA) is incorporated into a newly developed computerised DR system (called RETINO) that is capable to monitor and grade DR severity using colour fundus images. RETINO grades DR severity into five stages, namely No DR, Mild Non Proliferative DR (NPDR), Moderate NPDR, Severe NPDR and Proliferative DR (PDR) by enhancing the quality of digital colour fundus image using RETICA in the macular region and analysing the enlargement of the foveal avascular zone (F AZ), a region devoid of retinal vessels in the macular region. The importance of this research is to improve image quality in order to increase the accuracy, sensitivity and specificity of DR diagnosis, and to enable DR grading through either direct observation or computer assisted diagnosis system

    Quality Enhancement for Underwater Images using Various Image Processing Techniques: A Survey

    Get PDF
    Underwater images are essential to identify the activity of underwater objects. It played a vital role to explore and utilizing aquatic resources. The underwater images have features such as low contrast, different noises, and object imbalance due to lack of light intensity. CNN-based in-deep learning approaches have improved underwater low-resolution photos during the last decade. Nevertheless, still, those techniques have some problems, such as high MSE, PSNT and high SSIM error rate. They solve the problem using different experimental analyses; various methods are studied that effectively treat different underwater image distorted scenes and improve contrast and color deviation compared to other algorithms. In terms of the color richness of the resulting images and the execution time, there are still deficiencies with the latest algorithm. In future work, the structure of our algorithm will be further adjusted to shorten the execution time, and optimization of the color compensation method under different color deviations will also be the focus of future research. With the wide application of underwater vision in different scientific research fields, underwater image enhancement can play an increasingly significant role in the process of image processing in underwater research and underwater archaeology. Most of the target images of the current algorithms are shallow water images. When the artificial light source is added to deep water images, the raw images will face more diverse noises, and image enhancement will face more challenges. As a result, this study investigates the numerous existing systems used for quality enhancement of underwater mages using various image processing techniques. We find various gaps and challenges of current systems and build the enhancement of this research for future improvement. Aa a result of this overview is to define the future problem statement to enhance this research and overcome the challenges faced by previous researchers. On other hand also improve the accuracy in terms of reducing MSE and enhancing PSNR etc
    corecore