121 research outputs found

    Modulation Classification for MIMO-OFDM Signals via Approximate Bayesian Inference

    Full text link
    The problem of modulation classification for a multiple-antenna (MIMO) system employing orthogonal frequency division multiplexing (OFDM) is investigated under the assumption of unknown frequency-selective fading channels and signal-to-noise ratio (SNR). The classification problem is formulated as a Bayesian inference task, and solutions are proposed based on Gibbs sampling and mean field variational inference. The proposed methods rely on a selection of the prior distributions that adopts a latent Dirichlet model for the modulation type and on the Bayesian network formalism. The Gibbs sampling method converges to the optimal Bayesian solution and, using numerical results, its accuracy is seen to improve for small sample sizes when switching to the mean field variational inference technique after a number of iterations. The speed of convergence is shown to improve via annealing and random restarts. While most of the literature on modulation classification assume that the channels are flat fading, that the number of receive antennas is no less than that of transmit antennas, and that a large number of observed data symbols are available, the proposed methods perform well under more general conditions. Finally, the proposed Bayesian methods are demonstrated to improve over existing non-Bayesian approaches based on independent component analysis and on prior Bayesian methods based on the `superconstellation' method.Comment: To be appear in IEEE Trans. Veh. Technolog

    Spatio-Temporal Approaches to Denoising and Feature Extraction in Rapid Image Triage

    Get PDF
    Ph.DDOCTOR OF PHILOSOPH

    Bio-inspired log-polar based color image pattern analysis in multiple frequency channels

    Get PDF
    The main topic addressed in this thesis is to implement color image pattern recognition based on the lateral inhibition subtraction phenomenon combined with a complex log-polar mapping in multiple spatial frequency channels. It is shown that the individual red, green and blue channels have different recognition performances when put in the context of former work done by Dragan Vidacic. It is observed that the green channel performs better than the other two channels, with the blue channel having the poorest performance. Following the application of a contrast stretching function the object recognition performance is improved in all channels. Multiple spatial frequency filters were designed to simulate the filtering channels that occur in the human visual system. Following these preprocessing steps Dragan Vidacic\u27s methodology is followed in order to determine the benefits that are obtained from the preprocessing steps being investigated. It is shown that performance gains are realized by using such preprocessing steps

    Quantitative Multidimensional Stress Assessment from Facial Videos

    Get PDF
    Stress has a significant impact on the physical and mental health of an individual and is a growing concern for society, especially during the COVID-19 pandemic. Facial video-based stress evaluation from non-invasive cameras has proven to be a significantly more efficient method to evaluate stress in comparison to approaches that use questionnaires or wearable sensors. Plenty of classification models have been built for stress detection. However, most do not consider individual differences. Also, the results for such models are limited by a uni-dimensional definition of stress levels lacking a comprehensive quantitative definition of stress. The dissertation focuses on building a framework that utilizes the multilevel video frame representations from deep learning and the remote photoplethysmography signals extracted from the facial videos for stress assessment. The fusion model takes the inputs of a baseline video and a target video of the subject. The physiological features such as heart rate and heart rate variability are used with the initial stress scores generated from deep learning are used to predict the stress scores in cognitive anxiety, somatic anxiety, and self-confidence. To generate stress scores with better accuracy, the signal extraction method is improved by introducing the CWT-SNR method that uses the signal-to-noise ratio to assist the adaptive bandpass filtering in the post-processing of the signals. A study on phase space reconstruction features is performed and the results show the potential for additional accuracy improvement for the heart rate variability detection. To select the best deep learning architecture, multiple deep learning architectures are tested to build the deep learning model. Support Vector Regression is used to generate the output stress score results. Testing with the data from the UBFC-Phys dataset, the fusion model shows a strong correlation between ground truth and the predicted results

    Development of Fuzzy System Based Channel Equalisers

    Get PDF
    Channel equalisers are used in digital communication receivers to mitigate the effects of inter symbol interference (ISI) and inter user interference in the form of co-channel interference (CCI) and adjacent channel interference (ACI) in the presence of additive white Gaussian noise (AWGN). An equaliser uses a large part of the computations involved in the receiver. Linear equalisers based on adaptive filtering techniques have long been used for this application. Recently, use of nonlinear signal processing techniques like artificial neural networks (ANN) and radial basis functions (RBF) have shown encouraging results in this application. This thesis presents the development of a nonlinear fuzzy system based equaliser for digital communication receivers. The fuzzy equaliser proposed in this thesis provides a parametric implementation of symbolby-symbol maximum a-posteriori probability (MAP) equaliser based on Bayes’s theory. This MAP equaliser is also called Bayesian equaliser. Its decision function uses an estimate of the noise free received vectors, also called channel states or channel centres. The fuzzy equaliser developed here can be implemented with lower computational complexity than the RBF implementation of the MAP equaliser by using scalar channel states instead of channel states. It also provides schemes for performance tradeoff with complexity and schemes for subset centre selection. Simulation studies presented in this thesis suggests that the fuzzy equaliser by using only 10%-20% of the Bayesian equaliser channel states can provide near optimal performance. Subsequently, this fuzzy equaliser is modified for CCI suppression and is termed fuzzy–CCI equaliser. The fuzzy–CCI equaliser provides a performance comparable to the MAP equaliser designed for channels corrupted with CCI. However the structure of this equaliser is similar to the MAP equaliser that treats CCI as AWGN. A decision feedback form of this equaliser which uses a subset of channel states based on the feedback state is derived. Simulation studies presented in this thesis demonstrate that the fuzzy–CCI equaliser can effectively remove CCI without much increase in computational complexity. This equaliser is also successful in removing interference from more than one CCI sources, where as the MAP equalisers treating CCI as AWGN fail. This fuzzy–CCI equaliser can be treated as a fuzzy equaliser with a preprocessor for CCI suppression, and the preprocessor can be removed under high signal to interference ratio condition

    Coded-OFDM for PLC systems in non-Gaussian noise channels

    Get PDF
    PhD ThesisNowadays, power line communication (PLC) is a technology that uses the power line grid for communication purposes along with transmitting electrical energy, for providing broadband services to homes and offices such as high-speed data, audio, video and multimedia applications. The advantages of this technology are to eliminate the need for new wiring and AC outlet plugs by using an existing infrastructure, ease of installation and reduction of the network deployment cost. However, the power line grid is originally designed for the transmission of the electric power at low frequencies; i.e. 50/60 Hz. Therefore, the PLC channel appears as a harsh medium for low-power high-frequency communication signals. The development of PLC systems for providing high-speed communication needs precise knowledge of the channel characteristics such as the attenuation, non-Gaussian noise and selective fading. Non-Gaussian noise in PLC channels can classify into Nakagami-m background interference (BI) noise and asynchronous impulsive noise (IN) modelled by a Bernoulli-Gaussian mixture (BGM) model or Middleton class A (MCA) model. Besides the effects of the multipath PLC channel, asynchronous impulsive noise is the main reason causing performance degradation in PLC channels. Binary/non-binary low-density parity check B/NB-(LDPC) codes and turbo codes (TC) with soft iterative decoders have been proposed for Orthogonal Frequency Division Multiplexing (OFDM) system to improve the bit error rate (BER) performance degradation by exploiting frequency diversity. The performances are investigated utilizing high-order quadrature amplitude modulation (QAM) in the presence of non-Gaussian noise over multipath broadband power-line communication (BBPLC) channels. OFDM usually spreads the effect of IN over multiple sub-carriers after discrete Fourier transform (DFT) operation at the receiver, hence, it requires only a simple single-tap zero forcing (ZF) equalizer at the receiver. The thesis focuses on improving the performance of iterative decoders by deriving the effective, complex-valued, ratio distributions of the noise samples at the zeroforcing (ZF) equalizer output considering the frequency-selective multipath PLCs, background interference noise and impulsive noise, and utilizing the outcome for computing the apriori log likelihood ratios (LLRs) required for soft decoding algorithms. On the other hand, Physical-Layer Network Coding (PLNC) is introduced to help the PLC system to extend the range of operation for exchanging information between two users (devices) using an intermediate relay (hub) node in two-time slots in the presence of non-Gaussian noise over multipath PLC channels. A novel detection scheme is proposed to transform the transmit signal constellation based on the frequency-domain channel coefficients to optimize detection at the relay node with newly derived noise PDF at the relay and end nodes. Additionally, conditions for optimum detection utilizing a high-order constellation are derived. The closedform expressions of the BER and average BER upper-bound (AUB) are derived for a point-to-point system, and for a PLNC system at the end node to relay, relay to end node and at the end-to-end nodes. Moreover, the convergence behaviour of iterative decoders is evaluated using EXtrinsic Information Transfer (EXIT) chart analysis and upper bound analyses. Furthermore, an optimization of the threshold determination for clipping and blanking impulsive noise mitigation methods are derived. The proposed systems are compared in performance using simulation in MATLAB and analytical methods.Ministry of Higher Education in Ira

    Investigating the build-up of precedence effect using reflection masking

    Get PDF
    The auditory processing level involved in the build‐up of precedence [Freyman et al., J. Acoust. Soc. Am. 90, 874–884 (1991)] has been investigated here by employing reflection masked threshold (RMT) techniques. Given that RMT techniques are generally assumed to address lower levels of the auditory signal processing, such an approach represents a bottom‐up approach to the buildup of precedence. Three conditioner configurations measuring a possible buildup of reflection suppression were compared to the baseline RMT for four reflection delays ranging from 2.5–15 ms. No buildup of reflection suppression was observed for any of the conditioner configurations. Buildup of template (decrease in RMT for two of the conditioners), on the other hand, was found to be delay dependent. For five of six listeners, with reflection delay=2.5 and 15 ms, RMT decreased relative to the baseline. For 5‐ and 10‐ms delay, no change in threshold was observed. It is concluded that the low‐level auditory processing involved in RMT is not sufficient to realize a buildup of reflection suppression. This confirms suggestions that higher level processing is involved in PE buildup. The observed enhancement of reflection detection (RMT) may contribute to active suppression at higher processing levels

    Speech Recognition

    Get PDF
    Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for searching the hypothesis space, and multimodal approaches to speech recognition. The last part of the book is devoted to other speech processing applications that can use the information from automatic speech recognition for speaker identification and tracking, for prosody modeling in emotion-detection systems and in other speech processing applications that are able to operate in real-world environments, like mobile communication services and smart homes

    Effects of intracranial stimulation and the involvement of the human parahippocampal cortex in perception

    Get PDF
    How the human brain translates photons hitting the retina into conscious perception remains an open question. Throughout the medial temporal lobe (MTL), there are neurons (called concept cells) that change their firing rate when that neuron's preferred concept, e.g., a specific person or object, is seen. The firing rate of concept cells is correlated with perception. Nevertheless, it remains unclear whether or to what extent concept cells are involved in perceptogenesis, i.e., the creation of conscious percepts. Inferring from studies in monkeys, concept-specific neurons involved in perceptogenesis would be expected along the ventral and dorsal stream of visual processing (also called the what and where pathway, respectively). Various regions that are part of the dorsal stream are connected to the parahippocampal cortex (PHC), a region within the MTL. Compared to other MTL regions, lower selectivity, the absence of multimodal responses, and especially the shorter response latencies do not exclude an involvement of the PHC in perceptogenesis. In fact, damage to the parahippocampal place area (PPA, a part of the PHC) results in topographical disorientation. The goal of this thesis is to test the involvement of the PHC in perception by using electrical stimulation during a forced-choice categorization task involving landscapes versus animals. First, we determined effective parameters for intracranial stimulation of brain tissue in epilepsy patients implanted with depth-electrodes for seizure monitoring. We investigated the effects of amplitude, phase width, frequency, and pulse-train duration on neuronal firing, the local field potential (LFP), and behavioral responses to evoked percepts. Frequency and charge per phase were the most influential parameters on all three signals. Both parameters showed a positive effect on event-related potentials (ERPs) in the LFP. Higher frequencies (especially around 200 Hz) lead to a short-term inhibition of neuronal firing, while higher charge per phase can have an inhibitory or excitatory effect on neuronal firing. All parameters had a positive effect on the reports of evoked percepts; on reports of phosphenes in response to stimulating close to the optic radiation as well as on reports of auditory verbal hallucinations in response to stimulating Heschl's gyrus. Using functional magnetic resonance imaging (fMRI), we found that the PPA, i.e., the part of the PHC that is most selective towards images of landscapes, is rather small (up to 1‰ of total brain volume per hemisphere) with varying degrees of hemispheric laterality. Stimulating the PHC outside of the PPA - using a 100 ms high-frequency pulse train delivered at the natural response latency of the PHC - had no effect on categorizing landscapes. However, stimulating inside the PPA, close to the peak activation of the fMRI cluster, resulted in a 7% to 10% increase in landscape responses to ambiguous stimuli. Furthermore, stimulating the PPA also led to an increase in behavioral response time, especially to images with a predominant landscape component. None of our patients reported visual hallucinations of places or scenes in response to our stimulation protocols. Our data suggests that the PPA is involved in the perceptogenesis of landscapes at a stage that does not reach awareness, while the rest of the PHC is unlikely to be involved in perceptogenesis, at least not as it pertains to the perception of landscapes or animals. We also developed an online spike sorting algorithm and an adaptive screening procedure for concept cells to pave the way for new paradigms involving informed feedback
    corecore