29 research outputs found

    A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design

    Full text link
    Audio fingerprinting, also named as audio hashing, has been well-known as a powerful technique to perform audio identification and synchronization. It basically involves two major steps: fingerprint (voice pattern) design and matching search. While the first step concerns the derivation of a robust and compact audio signature, the second step usually requires knowledge about database and quick-search algorithms. Though this technique offers a wide range of real-world applications, to the best of the authors' knowledge, a comprehensive survey of existing algorithms appeared more than eight years ago. Thus, in this paper, we present a more up-to-date review and, for emphasizing on the audio signal processing aspect, we focus our state-of-the-art survey on the fingerprint design step for which various audio features and their tractable statistical models are discussed.Comment: http://www.iaria.org/conferences2015/PATTERNS15.html ; Seventh International Conferences on Pervasive Patterns and Applications (PATTERNS 2015), Mar 2015, Nice, Franc

    MediaEval 2016 Predicting Media Interestingness Task

    Get PDF
    Volume: 1739 Host publication title: MediaEval 2016 Multimedia Benchmark Workshop Host publication sub-title: Working Notes Proceedings of the MediaEval 2016 WorkshopNon peer reviewe

    The impact of cataract surgey on vision-related quality of life for bilateral cataract patients in Ho Chi Minh City, Vietnam: a prospective study

    Get PDF
    BACKGROUND: To determine the impact of cataract surgery on vision-related quality of life (VRQOL) and examine the association between objective visual measures and change in VRQOL after surgery among bilateral cataract patients in Ho Chi Minh City, Vietnam. METHODS: A cohort of older patients with bilateral cataract was assessed one week before and one to three months after first eye or both eye cataract surgery. Visual measures including visual acuity, contrast sensitivity and stereopsis were obtained. Vision-related quality of life was assessed using the NEI VFQ-25. Descriptive analyses and a generalized linear estimating equation (GEE) analysis were undertaken to measure change in VRQOL after surgery. RESULTS: Four hundred and thirteen patients were assessed before cataract surgery and 247 completed the follow-up assessment one to three months after first or both eye cataract surgery. Overall, VRQOL significantly improved after cataract surgery (p < 0.001) particularly after both eye surgeries. Binocular contrast sensitivity (p < 0.001) and stereopsis (p < 0.001) were also associated with change in VRQOL after cataract surgery. Visual acuity was not associated with VRQOL. CONCLUSIONS: Cataract surgery significantly improved VRQOL among bilateral cataract patients in Vietnam. Contrast sensitivity as well as stereopsis, rather than visual acuity significantly affected VRQOL after cataract surgery

    Associations of Underlying Health Conditions With Anxiety and Depression Among Outpatients: Modification Effects of Suspected COVID-19 Symptoms, Health-Related and Preventive Behaviors

    Get PDF
    Objectives: We explored the association of underlying health conditions (UHC) with depression and anxiety, and examined the modification effects of suspected COVID-19 symptoms (S-COVID-19-S), health-related behaviors (HB), and preventive behaviors (PB).Methods: A cross-sectional study was conducted on 8,291 outpatients aged 18–85 years, in 18 hospitals and health centers across Vietnam from 14th February to May 31, 2020. We collected the data regarding participant's characteristics, UHC, HB, PB, depression, and anxiety.Results: People with UHC had higher odds of depression (OR = 2.11; p &lt; 0.001) and anxiety (OR = 2.86; p &lt; 0.001) than those without UHC. The odds of depression and anxiety were significantly higher for those with UHC and S-COVID-19-S (p &lt; 0.001); and were significantly lower for those had UHC and interacted with “unchanged/more” physical activity (p &lt; 0.001), or “unchanged/more” drinking (p &lt; 0.001 for only anxiety), or “unchanged/healthier” eating (p &lt; 0.001), and high PB score (p &lt; 0.001), as compared to those without UHC and without S-COVID-19-S, “never/stopped/less” physical activity, drinking, “less healthy” eating, and low PB score, respectively.Conclusion: S-COVID-19-S worsen psychological health in patients with UHC. Physical activity, drinking, healthier eating, and high PB score were protective factors

    Underdetermined reverberant audio source separation using a fullrank spatial covariance model

    Get PDF
    Abstract—This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covariance encodes the spatial characteristics of the source. We then consider four specific covariance models, including a full-rank unconstrained model. We derive a family of iterative expectationmaximization (EM) algorithms to estimate the parameters of each model and propose suitable procedures adapted from the stateof-the-art to initialize the parameters and to align the order of the estimated sources across all frequency bins. Experimental results over reverberant synthetic mixtures and live recordings of speech data show the effectiveness of the proposed approach. Index Terms—Convolutive blind source separation, underdetermined mixtures, spatial covariance models, EM algorithm, permutation problem. I
    corecore