4,049 research outputs found

    The Importance of the Instantaneous Phase in Detecting Faces with Convolutional Neural Networks

    Full text link
    Convolutional Neural Networks (CNN) have provided new and accurate methods for processing digital images and videos. Yet, training CNNs is extremely demanding in terms of computational resources. Also, for specific applications, the standard use of transfer learning also tends to require far more resources than what may be needed. Furthermore, the final systems tend to operate as black boxes that are difficult to interpret. The current thesis considers the problem of detecting faces from the AOLME video dataset. The AOLME dataset consists of a large video collection of group interactions that are recorded in unconstrained classroom environments. For the thesis, still image frames were extracted at every minute from 18 24-minute videos. Then, each video frame was divided into 9x5 blocks with 50x50 pixels each. For each of the 19440 blocks, the percentage of face pixels was set as ground truth. Face detection was then defined as a regression problem for determining the face pixel percentage for each block. For testing different methods, 12 videos were used for training and validation. The remaining 6 videos were used for testing. The thesis examines the impact of using the instantaneous phase for the AOLME block-based face detection application. For comparison, the thesis compares the use of the Frequency Modulation image based on the instantaneous phase, the use of the instantaneous amplitude, and the original gray scale image. To generate the FM and AM inputs, the thesis uses dominant component analysis that aims to decrease the training overhead while maintaining interpretability.Comment: Master Thesi

    Prólogo

    Get PDF

    In situ decolorization monitoring of textile dyes for an optimized UV-LED/TiO2 reactor

    Get PDF
    Heterogeneous photocatalysis, using photocatalysts in suspension to eliminate diverse contaminants, including textile wastewater, has several advantages. Nevertheless, current absorbance and decolorization measurements imply sample acquisition by extraction at a fixed rate with consequent photocatalyst removal. This study presents online monitoring for the decolorization of six azo dyes, Orange PX-2R (OP2), Remazol Black B133 (RB), Procion Crimson H-EXL (PC), Procion Navy H-EXL (PN), Procion Blue H-EXL (PB), and Procion Yellow H-EXL (PY), analyzing the spectrum measured in situ by using the light scattering provided by the photocatalyst in suspension. The results obtained have corroborated the feasibility of obtaining absorbance and decolorization measurements, avoiding disturbances in the process due to a decrease in the volume in the reactor.Peer ReviewedPostprint (published version

    Maritime education and training in Chile : an analysis of the current management system and proposal for its restructure

    Get PDF

    The Importance of the Instantaneous Phase in Detecting Faces with Convolutional Neural Networks

    Get PDF
    Convolutional Neural Networks (CNN) have provided new and accurate methods for processing digital images and videos. Yet, training CNNs is extremely demanding in terms of computational resources. Also, for simple applications, the standard use of transfer learning also tends to require far more resources than what may be needed. Furthermore, the final systems tend to operate as black boxes that are difficult to interpret. The current thesis considers the problem of detecting faces from the AOLME video dataset. The AOLME dataset consists of a large video collection of group interactions that are recorded in unconstrained classroom environments. For the thesis, still image frames were extracted at every minute from 18 24-minute videos. Then, each video frame was divided into 9x5 blocks with 50x50 pixels each. For each of the 19440 blocks, the percentage of face pixels was set as ground truth. Face detection was then defined as a regression problem for determining the face pixel percentage for each block. For testing different methods, 12 videos were used for training and validation. The remaining 6 videos were used for testing. The thesis examines the impact of using the instantaneous phase for the AOLME block-based face detection application. For comparison, the thesis compares the use of the Frequency modulation image based on the instantaneous phase, the use of the instantaneous amplitude, and the original gray scale image. To generate the FM and AM inputs, the thesis uses dominant component analysis that aims to decrease the training overhead while maintaining interpretability. The results indicate that the use of the FM image yielded about the same performance as the MobileNet V2 architecture (AUC of 0.78 vs 0.79), with vastly reduced training times. Training was 7x faster for an Intel Xeon with a GTX 1080 based desktop and 11x faster on a laptop with Intel i5 with a GTX 1050. Furthermore, the proposed architecture trains 123x less parameters than what is needed for MobileNet V2. The FM-based neural network architecture uses a single convolutional layer. In comparison, the full LeNet-5 on the same image block using the original image could not be trained for face detection (AUC of 0.5)

    IX. La cuestión nacional

    Get PDF
    • …
    corecore