5 research outputs found

    Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

    Full text link
    In this paper, we present a novel approach to perform deep neural networks layer-wise weight initialization using Linear Discriminant Analysis (LDA). Typically, the weights of a deep neural network are initialized with: random values, greedy layer-wise pre-training (usually as Deep Belief Network or as auto-encoder) or by re-using the layers from another network (transfer learning). Hence, many training epochs are needed before meaningful weights are learned, or a rather similar dataset is required for seeding a fine-tuning of transfer learning. In this paper, we describe how to turn an LDA into either a neural layer or a classification layer. We analyze the initialization technique on historical documents. First, we show that an LDA-based initialization is quick and leads to a very stable initialization. Furthermore, for the task of layout analysis at pixel level, we investigate the effectiveness of LDA-based initialization and show that it outperforms state-of-the-art random weight initialization methods.Comment: 5 page

    RGB-NIR image categorization with prior knowledge transfer

    Full text link
    Abstract Recent development on image categorization, especially scene categorization, shows that the combination of standard visible RGB image data and near-infrared (NIR) image data performs better than RGB-only image data. However, the size of RGB-NIR image collection is often limited due to the difficulty of acquisition. With limited data, it is difficult to extract effective features using the common deep learning networks. It is observed that humans are able to learn prior knowledge from other tasks or a good mentor, which is helpful to solve the learning problems with limited training samples. Inspired by this observation, we propose a novel training methodology for introducing the prior knowledge into a deep architecture, which allows us to bypass the burdensome labeling large quantity of image data to meet the big data requirements in deep learning. At first, transfer learning is adopted to learn single modal features from a large source database, such as ImageNet. Then, a knowledge distillation method is explored to fuse the RGB and NIR features. Finally, a global optimization method is employed to fine-tune the entire network. The experimental results on two RGB-NIR datasets demonstrate the effectiveness of our proposed approach in comparison with the state-of-the-art multi-modal image categorization methods.https://deepblue.lib.umich.edu/bitstream/2027.42/146762/1/13640_2018_Article_388.pd

    Multimodal Optical Diagnostics of the Microhaemodynamics in Upper and Lower Limbs

    Get PDF
    The introduction of optical non-invasive diagnostic methods into clinical practice can substantially advance in the detection of early microcirculatory disorders in patients with different diseases. This paper is devoted to the development and application of the optical non-invasive diagnostic approach for the detection and evaluation of the severity of microcirculatory and metabolic disorders in rheumatic diseases and diabetes mellitus. The proposed methods include the joint use of laser Doppler flowmetry, absorption spectroscopy and fluorescence spectroscopy in combination with functional tests. This technique showed the high diagnostic importance for the detection of disturbances in peripheral microhaemodynamics. These methods have been successfully tested as additional diagnostic techniques in the field of rheumatology and endocrinology. The sensitivity and specificity of the proposed diagnostic procedures have been evaluated

    Online local learning algorithms for linear discriminant analysis

    No full text
    Online local learning algorithms for a laterally-connected single-layer neural network for performing linear discriminant analysis have been proposed. A convergence proof is provided for the algorithm based on Hebbian learning. The algorithms are simulated and applied to the face recognition problem. (C) 2004 Elsevier B.V. All rights reserved
    corecore