894 research outputs found

    Enhanced independent vector analysis for audio separation in a room environment

    Get PDF
    Independent vector analysis (IVA) is studied as a frequency domain blind source separation method, which can theoretically avoid the permutation problem by retaining the dependency between different frequency bins of the same source vector while removing the dependency between different source vectors. This thesis focuses upon improving the performance of independent vector analysis when it is used to solve the audio separation problem in a room environment. A specific stability problem of IVA, i.e. the block permutation problem, is identified and analyzed. Then a robust IVA method is proposed to solve this problem by exploiting the phase continuity of the unmixing matrix. Moreover, an auxiliary function based IVA algorithm with an overlapped chain type source prior is proposed as well to mitigate this problem. Then an informed IVA scheme is proposed which combines the geometric information of the sources from video to solve the problem by providing an intelligent initialization for optimal convergence. The proposed informed IVA algorithm can also achieve a faster convergence in terms of iteration numbers and better separation performance. A pitch based evaluation method is defined to judge the separation performance objectively when the information describing the mixing matrix and sources is missing. In order to improve the separation performance of IVA, an appropriate multivariate source prior is needed to better preserve the dependency structure within the source vectors. A particular multivariate generalized Gaussian distribution is adopted as the source prior. The nonlinear score function derived from this proposed source prior contains the fourth order relationships between different frequency bins, which provides a more informative and stronger dependency structure compared with the original IVA algorithm and thereby improves the separation performance. Copula theory is a central tool to model the nonlinear dependency structure. The t copula is proposed to describe the dependency structure within the frequency domain speech signals due to its tail dependency property, which means if one variable has an extreme value, other variables are expected to have extreme values. A multivariate student's t distribution constructed by using a t copula with the univariate student's t marginal distribution is proposed as the source prior. Then the IVA algorithm with the proposed source prior is derived. The proposed algorithms are tested with real speech signals in different reverberant room environments both using modelled room impulse response and real room recordings. State-of-the-art criteria are used to evaluate the separation performance, and the experimental results confirm the advantage of the proposed algorithms

    Ability, Parental Valuation of Education and the High School Dropout Decision

    Get PDF
    We use a large, rich Canadian micro-level dataset to examine the channels through which family socio-economic status and unobservable characteristics affect children's decisions to drop out of high school. First, we document the strength of observable socio-economic factors: our data suggest that teenage boys with two parents who are themselves high school dropouts have a 16% chance of dropping out, compared to a dropout rate of less than 1% for boys whose parents both have a university degree. We examine the channels through which this socio-economic gradient arises using an extended version of the factor model set out in Carneiro, Hansen, and Heckman (2003). Specifically, we consider the impact of cognitive and non-cognitive ability and the value that parents place on education. Our results support three main conclusions. First, cognitive ability at age 15 has a substantial impact on dropping out. Second, parental valuation of education has an impact of approximately the same size as cognitive ability effects for medium and low ability teenagers. A low ability teenager has a probability of dropping out of approximately .03 if his parents place a high value on education but .36 if their education valuation is low. Third, parental education has no direct effect on dropping out once we control for ability and parental valuation of education. Our results point to the importance of whatever determines ability at age 15 (including, potentially, early childhood interventions) and of parental valuation of education during the teenage years. We also make a small methodological contribution by extending the standard factor based estimator to allow a non-linear relationship between the factors and a covariate of interest. We show that allowing for non-linearities has a substantial impact on estimated effects.Idiosyncratic Shocks, Disability, Insurance, Marriage

    Ability, parental valuation of education and the high school dropout decision

    Get PDF
    We use a large, rich Canadian micro-level dataset to examine the channels through which family socio-economic status and unobservable characteristics affect children's decisions to drop out of high school. First, we document the strength of observable socio-economic factors: our data suggest that teenage boys with two parents who are themselves high school dropouts have a 16 per cent chance of dropping out, compared to a dropout rate of less than 1 per cent for boys whose parents both have a university degree. We examine the channels through which this socio-economic gradient arises using an extended version of the factor model set out in Carneiro, Hansen, and Heckman (2003). Specifically, we consider the impact of cognitive and non-cognitive ability and the value that parents place on education. Our results support three main conclusions. First, cognitive ability at age 15 has a substantial impact on dropping out. The highest ability individuals are predicted never to drop out regardless of parental education or parental valuation of education. In contrast, the lowest ability teenagers have a probability of dropping out of approximately .36 if their parents have a low valuation of education. Second, parental valuation of education has a substantial impact on medium and low ability teenagers. A low ability teenager has a probability of dropping out of approximately .03 if his parents place a high value on education but .36 if their educational valuation is low. These effects are estimated while conditioning on ability at age 15. Thus, under some assumptions, they reflect parental influences during the upper teenage years and are in addition to any impact they might have in the early childhood years leading up to age 15. Third, parental education has no direct effect on dropping out once we control for ability and parental valuation of education. Overall, our results point to the importance of whatever determines ability at age 15 (including, potentially, early childhood interventions) and of parental valuation of education during the teenage years. Our work also provides a small methodological contribution by extending the standard factor based estimator to allow a more non-linear relationship between the factors and a co-variate of interest. We show that allowing for non-linearities has a substantial impact on estimated effects.

    Online source separation in reverberant environments exploiting known speaker locations

    Get PDF
    This thesis concerns blind source separation techniques using second order statistics and higher order statistics for reverberant environments. A focus of the thesis is algorithmic simplicity with a view to the algorithms being implemented in their online forms. The main challenge of blind source separation applications is to handle reverberant acoustic environments; a further complication is changes in the acoustic environment such as when human speakers physically move. A novel time-domain method which utilises a pair of finite impulse response filters is proposed. The method of principle angles is defined which exploits a singular value decomposition for their design. The pair of filters are implemented within a generalised sidelobe canceller structure, thus the method can be considered as a beamforming method which cancels one source. An adaptive filtering stage is then employed to recover the remaining source, by exploiting the output of the beamforming stage as a noise reference. A common approach to blind source separation is to use methods that use higher order statistics such as independent component analysis. When dealing with realistic convolutive audio and speech mixtures, processing in the frequency domain at each frequency bin is required. As a result this introduces the permutation problem, inherent in independent component analysis, across the frequency bins. Independent vector analysis directly addresses this issue by modeling the dependencies between frequency bins, namely making use of a source vector prior. An alternative source prior for real-time (online) natural gradient independent vector analysis is proposed. A Student's t probability density function is known to be more suited for speech sources, due to its heavier tails, and is incorporated into a real-time version of natural gradient independent vector analysis. The final algorithm is realised as a real-time embedded application on a floating point Texas Instruments digital signal processor platform. Moving sources, along with reverberant environments, cause significant problems in realistic source separation systems as mixing filters become time variant. A method which employs the pair of cancellation filters, is proposed to cancel one source coupled with an online natural gradient independent vector analysis technique to improve average separation performance in the context of step-wise moving sources. This addresses `dips' in performance when sources move. Results show the average convergence time of the performance parameters is improved. Online methods introduced in thesis are tested using impulse responses measured in reverberant environments, demonstrating their robustness and are shown to perform better than established methods in a variety of situations

    Multivariate soft rank via entropic optimal transport: sample efficiency and generative modeling

    Full text link
    The framework of optimal transport has been leveraged to extend the notion of rank to the multivariate setting while preserving desirable properties of the resulting goodness of-fit (GoF) statistics. In particular, the rank energy (RE) and rank maximum mean discrepancy (RMMD) are distribution-free under the null, exhibit high power in statistical testing, and are robust to outliers. In this paper, we point to and alleviate some of the practical shortcomings of these proposed GoF statistics, namely their high computational cost, high statistical sample complexity, and lack of differentiability with respect to the data. We show that all these practically important issues are addressed by considering entropy-regularized optimal transport maps in place of the rank map, which we refer to as the soft rank. We consequently propose two new statistics, the soft rank energy (sRE) and soft rank maximum mean discrepancy (sRMMD), which exhibit several desirable properties. Given nn sample data points, we provide non-asymptotic convergence rates for the sample estimate of the entropic transport map to its population version that are essentially of the order n−1/2n^{-1/2}. This compares favorably to non-regularized estimates, which typically suffer from the curse-of-dimensionality and converge at rate that is exponential in the data dimension. We leverage this fast convergence rate to demonstrate the sample estimate of the proposed statistics converge rapidly to their population versions, enabling efficient rank-based GoF statistical computation, even in high dimensions. Our statistics are differentiable and amenable to popular machine learning frameworks that rely on gradient methods. We leverage these properties towards showcasing the utility of the proposed statistics for generative modeling on two important problems: image generation and generating valid knockoffs for controlled feature selection.Comment: 43 pages, 10 figures. Replacement note: Title change, author changes, new theoretical results, revised and expanded experimental evaluation

    Learning Contextualized Semantics from Co-occurring Terms via a Siamese Architecture

    Get PDF
    One of the biggest challenges in Multimedia information retrieval and understanding is to bridge the semantic gap by properly modeling concept semantics in context. The presence of out of vocabulary (OOV) concepts exacerbates this difficulty. To address the semantic gap issues, we formulate a problem on learning contextualized semantics from descriptive terms and propose a novel Siamese architecture to model the contextualized semantics from descriptive terms. By means of pattern aggregation and probabilistic topic models, our Siamese architecture captures contextualized semantics from the co-occurring descriptive terms via unsupervised learning, which leads to a concept embedding space of the terms in context. Furthermore, the co-occurring OOV concepts can be easily represented in the learnt concept embedding space. The main properties of the concept embedding space are demonstrated via visualization. Using various settings in semantic priming, we have carried out a thorough evaluation by comparing our approach to a number of state-of-the-art methods on six annotation corpora in different domains, i.e., MagTag5K, CAL500 and Million Song Dataset in the music domain as well as Corel5K, LabelMe and SUNDatabase in the image domain. Experimental results on semantic priming suggest that our approach outperforms those state-of-the-art methods considerably in various aspects

    Analysis and design of a capsule landing system and surface vehicle control system for Mars exploration

    Get PDF
    Problems related to an unmanned exploration of the planet Mars by means of an autonomous roving planetary vehicle are investigated. These problems include: design, construction and evaluation of the vehicle itself and its control and operating systems. More specifically, vehicle configuration, dynamics, control, propulsion, hazard detection systems, terrain sensing and modelling, obstacle detection concepts, path selection, decision-making systems, and chemical analyses of samples are studied. Emphasis is placed on development of a vehicle capable of gathering specimens and data for an Augmented Viking Mission or to provide the basis for a Sample Return Mission

    Statistical mechanics, generalisation and regularisation of neural network models

    Get PDF
    • …
    corecore