13 research outputs found

    Text-Independent Speaker Verification Using 3D Convolutional Neural Networks

    Full text link
    In this paper, a novel method using 3D Convolutional Neural Network (3D-CNN) architecture has been proposed for speaker verification in the text-independent setting. One of the main challenges is the creation of the speaker models. Most of the previously-reported approaches create speaker models based on averaging the extracted features from utterances of the speaker, which is known as the d-vector system. In our paper, we propose an adaptive feature learning by utilizing the 3D-CNNs for direct speaker model creation in which, for both development and enrollment phases, an identical number of spoken utterances per speaker is fed to the network for representing the speakers' utterances and creation of the speaker model. This leads to simultaneously capturing the speaker-related information and building a more robust system to cope with within-speaker variation. We demonstrate that the proposed method significantly outperforms the traditional d-vector verification system. Moreover, the proposed system can also be an alternative to the traditional d-vector system which is a one-shot speaker modeling system by utilizing 3D-CNNs.Comment: Accepted to be published in IEEE International Conference on Multimedia and Expo (ICME) 201

    A new and automated risk prediction of coronary artery disease using clinical endpoints and medical imaging-derived patient-specific insights: protocol for the retrospective GeoCAD cohort study

    Full text link
    INTRODUCTION: Coronary artery disease (CAD) is the leading cause of death worldwide. More than a quarter of cardiovascular events are unexplained by current absolute cardiovascular disease risk calculators, and individuals without clinical risk factors have been shown to have worse outcomes. The 'anatomy of risk' hypothesis recognises that adverse anatomical features of coronary arteries enhance atherogenic haemodynamics, which in turn mediate the localisation and progression of plaques. We propose a new risk prediction method predicated on CT coronary angiography (CTCA) data and state-of-the-art machine learning methods based on a better understanding of anatomical risk for CAD. This may open new pathways in the early implementation of personalised preventive therapies in susceptible individuals as a potential key in addressing the growing burden of CAD. METHODS AND ANALYSIS: GeoCAD is a retrospective cohort study in 1000 adult patients who have undergone CTCA for investigation of suspected CAD. It is a proof-of-concept study to test the hypothesis that advanced image-derived patient-specific data can accurately predict long-term cardiovascular events. The objectives are to (1) profile CTCA images with respect to variations in anatomical shape and associated haemodynamic risk expressing, at least in part, an individual's CAD risk, (2) develop a machine-learning algorithm for the rapid assessment of anatomical risk directly from unprocessed CTCA images and (3) to build a novel CAD risk model combining traditional risk factors with these novel anatomical biomarkers to provide a higher accuracy CAD risk prediction tool. ETHICS AND DISSEMINATION: The study protocol has been approved by the St Vincent's Hospital Human Research Ethics Committee, Sydney-2020/ETH02127 and the NSW Population and Health Service Research Ethics Committee-2021/ETH00990. The project outcomes will be published in peer-reviewed and biomedical journals, scientific conferences and as a higher degree research thesis

    Spatial Mixture-of-Experts

    Full text link
    Many data have an underlying dependence on spatial location; it may be weather on the Earth, a simulation on a mesh, or a registered image. Yet this feature is rarely taken advantage of, and violates common assumptions made by many neural network layers, such as translation equivariance. Further, many works that do incorporate locality fail to capture fine-grained structure. To address this, we introduce the Spatial Mixture-of-Experts (SMoE) layer, a sparsely-gated layer that learns spatial structure in the input domain and routes experts at a fine-grained level to utilize it. We also develop new techniques to train SMoEs, including a self-supervised routing loss and damping expert errors. Finally, we show strong results for SMoEs on numerous tasks, and set new state-of-the-art results for medium-range weather prediction and post-processing ensemble weather forecasts.Comment: 20 pages, 3 figures; NeurIPS 202
    corecore