148 research outputs found

    EfficientWord-Net: An Open Source Hotword Detection Engine based on One-shot Learning

    Full text link
    Voice assistants like Siri, Google Assistant, Alexa etc. are used widely across the globe for home automation, these require the use of special phrases also known as hotwords to wake it up and perform an action like "Hey Alexa!", "Ok Google!" and "Hey Siri!" etc. These hotwords are detected with lightweight real-time engines whose purpose is to detect the hotwords uttered by the user. This paper presents the design and implementation of a hotword detection engine based on one-shot learning which detects the hotword uttered by the user in real-time with just one or few training samples of the hotword. This approach is efficient when compared to existing implementations because the process of adding a new hotword in the existing systems requires enormous amounts of positive and negative training samples and the model needs to retrain for every hotword. This makes the existing implementations inefficient in terms of computation and cost. The architecture proposed in this paper has achieved an accuracy of 94.51%.Comment: 9 pages, 17 figure

    Joint Neural Networks for One-shot Object Recognition and Detection.

    Get PDF
    PhD ThesesThis thesis presents a study of techniques for one-shot recognition and detec- tion of objects. In computer vision, the one-shot object recognition task aims to categorize matches and mismatches between patterns from which a single sample is available for training. Whereas, the one-shot object detection task additionally de nes a bounding box locating the position of the matching pat- tern within a target image. Such task is trivial for humans but challenging for machines. Classical object recognition and detection have drawn a considerable amount of attention in the last decades leading to the development of several machine learning related approaches. Most approaches require a substantial amount of data to achieve state-of-art performance; making them unsuitable for use cases where collecting the data can be more costly than performing the task manu- ally. In contrast, a relatively low amount of work is present in the literature tackling the one-shot recognition problem. Just until recently, Siamese Neural Networks have emerged in the literature as a deep learning approach to ad- dress the one-shot object recognition and detection problems while achieving reasonable results This research focuses on building modi cations to pairwise Siamese Neural Networks introducing the concept of joint layer, that improve their performance when addressing the one-shot object recognition problem in realistic scenarios. The proposed Joint Neural Networks achieved an accuracy of 70.0% when com- pared with the standard Siamese Neural Networks approach, outperforming the later by 10.0% accuracy tested on the MiniImageNet and QMUL-OpenLogo datasets. Subsequently, a novel approach for one-shot object detection is presented as an extension of the Joint Neural Networks. The proposed one-shot detection approach is inspired by state-of-art one-stage detection approaches and does not rely on contextual information or support test sets to generate predictions. This approach shows competitive results, achieving a 56.1% mAP on the Pascal VOC dataset split for one-shot detection. In adition, Joint Neural Networks push the state-of-art when trained on the COCO dataset and tested on the pascal VOC dataset, achieving 47.1% mAP.

    VisualPhishNet: Zero-Day Phishing Website Detection by Visual Similarity

    Get PDF
    Phishing websites are still a major threat in today's Internet ecosystem. Despite numerous previous efforts, similarity-based detection methods do not offer sufficient protection for the trusted websites - in particular against unseen phishing pages. This paper contributes VisualPhishNet, a new similarity-based phishing detection framework, based on a triplet Convolutional Neural Network (CNN). VisualPhishNet learns profiles for websites in order to detect phishing websites by a similarity metric that can generalize to pages with new visual appearances. We furthermore present VisualPhish, the largest dataset to date that facilitates visual phishing detection in an ecologically valid manner. We show that our method outperforms previous visual similarity phishing detection approaches by a large margin while being robust against a range of evasion attacks
    • …
    corecore