10 research outputs found

    Transceiver design for non-regenerative MIMO relay systems with decision feedback detection

    Get PDF
    In this paper we consider the design of zero forcing (ZF) and minimum mean square error (MMSE) transceivers for non-regenerative multiple input multiple output (MIMO) relay networks. Our designs utilise linear processors at each stage of the network along with a decision feedback detection device at the receiver. Under the assumption of full channel state information (CSI) across the entire link the processors are jointly optimised to minimise the system arithmetic mean square error (MSE) whilst meeting average power constraints at both the source and the relay terminals. We compare the presented methods to linear designs available in the literature and show the advantages of the proposed transceivers through simulation results

    Local binary patterns for 1-D signal processing

    Get PDF
    Local Binary Patterns (LBP) have been used in 2-D image processing for applications such as texture segmentation and feature detection. In this paper a new 1-dimensional local binary pattern (LBP) signal processing method is presented. Speech systems such as hearing aids require fast and computationally inexpensive signal processing. The practical use of LBP based speech processing is demonstrated on two signal processing problems: - (i) signal segmentation and (ii) voice activity detection (VAD). Both applications use the underlying features extracted from the 1-D LBP. The proposed VAD algorithm demonstrates the simplicity of 1-D LBP processing with low computational complexity. It is also shown that distinct LBP features are obtained to identify the voiced and the unvoiced components of speech signal

    Multi-view through-the-wall radar imaging using compressed sensing

    Get PDF
    This paper considers the problem of Through-the-Wall Radar Imaging (TWRI) from multiple views using Compressed Sensing (CS). The scene reconstruction problem is reformulated in terms of finding a sparse representation of the target locations, consistent with the observations. In contrast to the common approach of first applying image formation to each view and then fusing the single-view images, observations from the different views are combined together into a composite measurement vector and a new dictionary is constructed accordingly. A sparse image representation of the scene is then obtained from the composite measurement vector and the new dictionary using 1 -norm minimization. Experimental results demonstrate that the proposed approach using various standoff distances and perspectives achieves a better performance in terms of detecting targets compared to the alternative approach of image formation followed by fusion

    Interactive Video Mashup Based on Emotional Identity

    Get PDF
    The growth of new multimedia technologies has provided the user with the ability to become a videomaker, instead of being merely part of a passive audience. In such a scenario, a new generation of audiovisual content, referred to as video mashup, is gaining consideration and popularity. A mashup is created by editing and remixing pre-existing material to obtain a product which has its own identity and, in some cases, an artistic value itself. In this work we propose an emotional-driven interactive framework for the creation of video mashup. Given a set of feature movies as primary material, during the mixing task the user is supported by a selection of sequences belonging to different movies which share a similar emotional identity, defined through the investigation of cinematographic techniques used by directors to convey emotions

    An Iterative Adaptive Approach for Blood Velocity Estimation Using Ultrasound

    Get PDF
    Publication in the conference proceedings of EUSIPCO, Aalborg, Denmark, 201

    Asynchronous P300 BCI:SSVEP based control state detection

    Get PDF
    Publication in the conference proceedings of EUSIPCO, Aalborg, Denmark, 201

    On the Predictability of the Intelligibility of Speech to Hearing Impaired Listeners

    Get PDF
    What information do we need to know about listeners to predict their performance on a speech intelligibility task and how well can we predict intelligibility anyway? This paper performs a meta-analysis on two speech intelligibility studies of hearing-impaired listeners in which we evaluate different approaches to building a predictive model of intelligibility. The model has two components: a cochlear loss term based on a number of psychoacoustic measures of hearing, and a supracochlear loss term to explain residual performance variation. These models are trained using a method of cross-validation to determine how well they might perform on new listeners and new tasks. We found that cochlear loss could only explain 40% of the variability in performance across hearing-impaired listeners, while the supra-cochlear loss can account for a further 20-40% depending on the task. The combined cochlear and supra-cochlear loss terms allow good estimates of intelligibility scores in the data, with speech reception thresholds on a novel listening task being predictable to within 1dB on average

    THE VISNET II DVC CODEC: ARCHITECTURE, TOOLS AND PERFORMANCE

    Get PDF
    ABSTRACT This paper introduces the VISNET II DVC codec. This codec achieves very high RD performance thanks to the efficient combination of many state-of-the-art coding tools into INTRODUCTION With the wide deployment of mobile and wireless networks, a growing number of emerging applications, such as lowpower sensor networks, video surveillance cameras and mobile communications, rely on an up-link model rather than the typical down-link communication model. Typically, these applications are characterized by many senders transmitting data to a central receiver. In this context, light encoding or a flexible distribution of the codec complexity, robustness to packet losses, high compression efficiency and low latency/delay are important requirements. To address the needs of these up-link applications, the usual predictive video coding paradigm has been revisited based on Information Theory theorems from the 70s. The Slepian-Wolf (SW) theorem [1] establishes lower bounds on the achievable rates for the lossless coding of two or more correlated sources. More specifically, considering two statistically dependent random signals X and Y, it is well-known that the lower bound for the rate is given by the joint entropy H(X,Y) when these two signals are jointly encoded (as in conventional predictive coding). Conversely, when these two signals are independently encoded but jointly decoded (distributed coding), the SW theorem states that the minimum rate is still H(X,Y) with a residual error probability which tends towards 0 for long sequences. Later, Wyner and Ziv (WZ) have extended the SW theorem and showed that the result holds for the lossy coding case under the assumptions that the sources are jointly Gaussian and a mean square error distortion measure is used [2]. Subsequently, it was shown that this result remains valid as long as the difference between X and Y is Gaussian. Video coding schemes based on these theorems are referred to as Distributed Video Coding (DVC) solutions. Since the new coding paradigm is based on a statistical framework and does not rely on joint encoding, DVC architectures may provide several functional benefits which are rather important for many emerging applications: i) flexible allocation of the global video codec complexity; ii) improved error resilience; iii) codec independent scalability; and iv) exploitation of multiview correlation. Based on these theoretical results, practical implementations of DVC have been proposed since 2002. The PRISM (Power-efficient, Robust, hIgh compression Syndrome-based Multimedia coding) [3] solution works at the block level and performs motion estimation at the decoder. Based on the amount of temporal correlation, estimated using a zeromotion block difference, each block can either be conventionally (intra) coded, skipped or coded using distributed coding principles. Another DVC architecture working at frame level has been proposed in In this paper, the DVC codec developed within the European Network of Excellence VISNET II project [5] is described. This codec is based on the early architecture in VISNET II CODEC ARCHITECTURE AND TOOLS This section provides a description of the VISNET II DVC codec architecture and tools illustrated i
    corecore