191 research outputs found

    PLOMaR: An Ontology Framework for Context Modeling and Reasoning on Crowd-Sensing Platforms

    Get PDF
    Crowd-sensing is a popular way to sense and collect data using smartphones that reveals user behaviors and their correlations with device performance. PhoneLab is one of the largest crowd-sensing platform based on the Android system. Through experimental instrumentations and system modifications, researchers can tap into a sea of insightful information that can be further processed to reveal valuable context information about the device, user and the environment. However, the PhoneLab data is in JSON format. The process of inferring reasons from data in this format is not straightforward. In this paper, we introduce PLOMaR — an ontology framework that uses SPARQL rules to help researchers access information and derive new information without complex data processing. The goals are to (i) make the measurement data more accessible, (ii) increase interoperability and reusability of data gathered from different sources, (iii) develop extensible data representation to support future development of the PhoneLab platform. We describe the models, the JSON to RDF mapping processes, and the SPARQL rules used for deriving new information. We evaluate our framework with three application examples based on the sample dataset provided

    Data-driven Attention and Data-independent DCT based Global Context Modeling for Text-independent Speaker Recognition

    Full text link
    Learning an effective speaker representation is crucial for achieving reliable performance in speaker verification tasks. Speech signals are high-dimensional, long, and variable-length sequences that entail a complex hierarchical structure. Signals may contain diverse information at each time-frequency (TF) location. For example, it may be more beneficial to focus on high-energy parts for phoneme classes such as fricatives. The standard convolutional layer that operates on neighboring local regions cannot capture the complex TF global context information. In this study, a general global time-frequency context modeling framework is proposed to leverage the context information specifically for speaker representation modeling. First, a data-driven attention-based context model is introduced to capture the long-range and non-local relationship across different time-frequency locations. Second, a data-independent 2D-DCT based context model is proposed to improve model interpretability. A multi-DCT attention mechanism is presented to improve modeling power with alternate DCT base forms. Finally, the global context information is used to recalibrate salient time-frequency locations by computing the similarity between the global context and local features. The proposed lightweight blocks can be easily incorporated into a speaker model with little additional computational costs and effectively improves the speaker verification performance compared to the standard ResNet model and Squeeze\&Excitation block by a large margin. Detailed ablation studies are also performed to analyze various factors that may impact performance of the proposed individual modules. Results from experiments show that the proposed global context modeling framework can efficiently improve the learned speaker representations by achieving channel-wise and time-frequency feature recalibration

    Context-awareness

    Get PDF
    Context-aware computing has increasingly gained the attention of the research community because, as it is the case with human interactions, context information provides the background against which it is possible to more accurately interpret communicative acts without the need to explicitly state everything that might be relevant. If, within an agent negotiation for buying some specific service, the service provider says “the price is 20 Euros”, the receiver would not be capable of fully interpreting the meaning of the message without using the context created by the whole conversation. Context information provides the basis for more efficient information processing mechanisms due to the possibility of discarding irrelevant information in early stages of information processing. For instance, if some patient’s personal assistance agent is looking for a service that would sell him or her a specific medicine and deliver it in the patient’s home, this would be achieved through the creation of a compound service consisting of an on-line pharmacy and a medicine transportation service. Using context information about the patient’s location, the service composition process may discard service providers located far away from the client and create the compound service considering only a very small number of all existing services of the relevant categories. Context information also enables better adapted behavior since, being context-aware, it may be more directed towards clients requirements in the circumstances of the interaction.info:eu-repo/semantics/publishedVersio

    PKCAM: Previous Knowledge Channel Attention Module

    Full text link
    Recently, attention mechanisms have been explored with ConvNets, both across the spatial and channel dimensions. However, from our knowledge, all the existing methods devote the attention modules to capture local interactions from a uni-scale. In this paper, we propose a Previous Knowledge Channel Attention Module(PKCAM), that captures channel-wise relations across different layers to model the global context. Our proposed module PKCAM is easily integrated into any feed-forward CNN architectures and trained in an end-to-end fashion with a negligible footprint due to its lightweight property. We validate our novel architecture through extensive experiments on image classification and object detection tasks with different backbones. Our experiments show consistent improvements in performances against their counterparts. Our code is published at https://github.com/eslambakr/EMCA

    Bayesian parameter estimation with prior weighting in ALT model

    Get PDF
    This paper provides an overview of the application of Bayesian inference to accelerated life testing (ALT) models for the concrete case of estimation by Maximum of Aposteriori (MAP) method in the case of constant stress levels. It studies the Bayesian inference over the accelerated life model as presented in [1]. It suites, integrates and generalizes the particular cases presented in [2] and [3]. Towards the end, weighting of the prior information according to data is integrated. The paper also illustrates an experimental example

    Speech Recognition Using Augmented Conditional Random Fields

    Get PDF
    Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although HMMs are a natural choice to warp the time axis and model the temporal phenomena in the speech signal, their conditional independence properties limit their ability to model spectral phenomena well. In this paper, a new acoustic modeling paradigm based on augmented conditional random fields (ACRFs) is investigated and developed. This paradigm addresses some limitations of HMMs while maintaining many of the aspects which have made them successful. In particular, the acoustic modeling problem is reformulated in a data driven, sparse, augmented space to increase discrimination. Acoustic context modeling is explicitly integrated to handle the sequential phenomena of the speech signal. We present an efficient framework for estimating these models that ensures scalability and generality. In the TIMIT phone recognition task, a phone error rate of 23.0\% was recorded on the full test set, a significant improvement over comparable HMM-based systems

    Scene Graph Lossless Compression with Adaptive Prediction for Objects and Relations

    Full text link
    The scene graph is a new data structure describing objects and their pairwise relationship within image scenes. As the size of scene graph in vision applications grows, how to losslessly and efficiently store such data on disks or transmit over the network becomes an inevitable problem. However, the compression of scene graph is seldom studied before because of the complicated data structures and distributions. Existing solutions usually involve general-purpose compressors or graph structure compression methods, which is weak at reducing redundancy for scene graph data. This paper introduces a new lossless compression framework with adaptive predictors for joint compression of objects and relations in scene graph data. The proposed framework consists of a unified prior extractor and specialized element predictors to adapt for different data elements. Furthermore, to exploit the context information within and between graph elements, Graph Context Convolution is proposed to support different graph context modeling schemes for different graph elements. Finally, a learned distribution model is devised to predict numerical data under complicated conditional constraints. Experiments conducted on labeled or generated scene graphs proves the effectiveness of the proposed framework in scene graph lossless compression task

    Improving SPIHT-based Compression of Volumetric Medical Data

    Get PDF
    Volumetric medical data (CT,MR) are useful tools for diagnostic investigation however their usage may be made diffcult because of the amount of data to store or because of the duration of communication over a limited capacity channel. In order to code such information sources we present a progressive three dimensional image compression algorithm based on zerotree wavelet coder with arithmetic coding. We make use of a 3D separable biorthogonal wavelet transform and we extend the zerotree SPIHT algorithm to three dimensions. Moreover we propose some improvements to the SPIHT encoder in order to obtain a better rate distortion performance without increasing the computational complexity. Finally we propose an efficient context-based adaptive arithmetic coding which eliminates high order redundancy. The results obtained on progressive coding of a test CT volume are better than those presented in recent similar works both for the mean PSNR on the whole volume and for the PSNR homogeneity between various slices
    • 

    corecore