12 research outputs found

    AMNet: Memorability Estimation with Attention

    Get PDF
    In this paper we present the design and evaluation of an end-to-end trainable, deep neural network with a visual attention mechanism for memorability estimation in still images. We analyze the suitability of transfer learning of deep models from image classification to the memorability task. Further on we study the impact of the attention mechanism on the memorability estimation and evaluate our network on the SUN Memorability and the LaMem datasets. Our network outperforms the existing state of the art models on both datasets in terms of the Spearman's rank correlation as well as the mean squared error, closely matching human consistency

    Unsupervised Video Summarization via Attention-Driven Adversarial Learning

    Get PDF
    This paper presents a new video summarization approach that integrates an attention mechanism to identify the signi cant parts of the video, and is trained unsupervisingly via generative adversarial learning. Starting from the SUM-GAN model, we rst develop an improved version of it (called SUM-GAN-sl) that has a signi cantly reduced number of learned parameters, performs incremental training of the model's components, and applies a stepwise label-based strategy for updating the adversarial part. Subsequently, we introduce an attention mechanism to SUM-GAN-sl in two ways: i) by integrating an attention layer within the variational auto-encoder (VAE) of the architecture (SUM-GAN-VAAE), and ii) by replacing the VAE with a deterministic attention auto-encoder (SUM-GAN-AAE). Experimental evaluation on two datasets (SumMe and TVSum) documents the contribution of the attention auto-encoder to faster and more stable training of the model, resulting in a signi cant performance improvement with respect to the original model and demonstrating the competitiveness of the proposed SUM-GAN-AAE against the state of the art

    A Comparison of Embedded Deep Learning Methods for Person Detection

    Full text link
    Recent advancements in parallel computing, GPU technology and deep learning provide a new platform for complex image processing tasks such as person detection to flourish. Person detection is fundamental preliminary operation for several high level computer vision tasks. One industry that can significantly benefit from person detection is retail. In recent years, various studies attempt to find an optimal solution for person detection using neural networks and deep learning. This study conducts a comparison among the state of the art deep learning base object detector with the focus on person detection performance in indoor environments. Performance of various implementations of YOLO, SSD, RCNN, R-FCN and SqueezeDet have been assessed using our in-house proprietary dataset which consists of over 10 thousands indoor images captured form shopping malls, retails and stores. Experimental results indicate that, Tiny YOLO-416 and SSD (VGG-300) are the fastest and Faster-RCNN (Inception ResNet-v2) and R-FCN (ResNet-101) are the most accurate detectors investigated in this study. Further analysis shows that YOLO v3-416 delivers relatively accurate result in a reasonable amount of time, which makes it an ideal model for person detection in embedded platforms

    Ethnic disparities in progression rates for sight-threatening diabetic retinopathy in diabetic eye screening: a population-based retrospective cohort study

    Get PDF
    INTRODUCTION: The English Diabetic Eye Screening Programme (DESP) offers people living with diabetes (PLD) annual eye screening. We examined incidence and determinants of sight-threatening diabetic retinopathy (STDR) in a sociodemographically diverse multi-ethnic population. RESEARCH DESIGN AND METHODS: North East London DESP cohort data (January 2012 to December 2021) with 137 591 PLD with no retinopathy, or non-STDR at baseline in one/both eyes, were used to calculate STDR incidence rates by sociodemographic factors, diabetes type, and duration. HR from Cox models examined associations with STDR. RESULTS: There were 16 388 incident STDR cases over a median of 5.4 years (IQR 2.8-8.2; STDR rate 2.214, 95% CI 2.214 to 2.215 per 100 person-years). People with no retinopathy at baseline had a lower risk of sight-threatening diabetic retinopathy (STDR) compared with those with non-STDR in one eye (HR 3.03, 95% CI 2.91 to 3.15, p<0.001) and both eyes (HR 7.88, 95% CI 7.59 to 8.18, p<0.001). Black and South Asian individuals had higher STDR hazards than white individuals (HR 1.57, 95% CI 1.50 to 1.64 and HR 1.36, 95% CI 1.31 to 1.42, respectively). Additionally, every 5-year increase in age at inclusion was associated with an 8% reduction in STDR hazards (p<0.001). CONCLUSIONS: Ethnic disparities exist in a health system limited by capacity rather than patient economic circumstances. Diabetic retinopathy at first screen is a strong determinant of STDR development. By using basic demographic characteristics, screening programmes or clinical practices can stratify risk for sight-threatening diabetic retinopathy development

    Two-year recall for people with no diabetic retinopathy: A multi-ethnic population-based retrospective cohort study using real-world data to quantify the effect

    Get PDF
    BACKGROUND/AIMS: The English Diabetic Eye Screening Programme (DESP) offers people living with diabetes (PLD) annual screening. Less frequent screening has been advocated among PLD without diabetic retinopathy (DR), but evidence for each ethnic group is limited. We examined the potential effect of biennial versus annual screening on the detection of sight-threatening diabetic retinopathy (STDR) and proliferative diabetic retinopathy (PDR) among PLD without DR from a large urban multi-ethnic English DESP. METHODS: PLD in North-East London DESP (January 2012 to December 2021) with no DR on two prior consecutive screening visits with up to 8 years of follow-up were examined. Annual STDR and PDR incidence rates, overall and by ethnicity, were quantified. Delays in identification of STDR and PDR events had 2-year screening intervals been used were determined. FINDINGS: Among 82 782 PLD (37% white, 36% South Asian, and 16% black people), there were 1788 incident STDR cases over mean (SD) 4.3 (2.4) years (STDR rate 0.51, 95% CI 0.47 to 0.55 per 100-person-years). STDR incidence rates per 100-person-years by ethnicity were 0.55 (95% CI 0.48 to 0.62) for South Asian, 0.34 (95% CI 0.29 to 0.40) for white, and 0.77 (95% CI 0.65 to 0.90) for black people. Biennial screening would have delayed diagnosis by 1 year for 56.3% (1007/1788) with STDR and 43.6% (45/103) with PDR. Standardised cumulative rates of delayed STDR per 100 000 persons for each ethnic group were 1904 (95% CI 1683 to 2154) for black people, 1276 (95% CI 1153 to 1412) for South Asian people, and 844 (95% CI 745 to 955) for white people. INTERPRETATION: Biennial screening would have delayed detection of some STDR and PDR by 1 year, especially among those of black ethnic origin, leading to healthcare inequalities

    Rammed Earth: Synergy of Natural Material in Artificial Earthscape

    No full text
    This thesis explores the use of Rammed Earth, a traditional and sustainable building material, in the context of an abandoned surface quarry mine. It illuminates some of the historical precedents and developments in its technology which is argued, merit further research into, and the future use of this building method. The thesis analyses the McCoy limestone quarry as a theoretical site and its soil’s suitability for Rammed Earth construction. To demonstrate the techniques’ viability, the thesis proposes a conceptual vocational campus which provides a testing ground for application and research of Rammed Earth construction and serves as a proof of concept that brings awareness of this building technique.Architecture and Design, Gerald D. Hines College ofHonors Colleg
    corecore