87 research outputs found

    Enhancing Context Modeling with a Query-Guided Capsule Network for Document-level Translation

    Full text link
    Context modeling is essential to generate coherent and consistent translation for Document-level Neural Machine Translations. The widely used method for document-level translation usually compresses the context information into a representation via hierarchical attention networks. However, this method neither considers the relationship between context words nor distinguishes the roles of context words. To address this problem, we propose a query-guided capsule networks to cluster context information into different perspectives from which the target translation may concern. Experiment results show that our method can significantly outperform strong baselines on multiple data sets of different domains.Comment: 11 pages, 7 figures, 2019 Conference on Empirical Methods in Natural Language Processin

    Quality at the Tail

    Full text link
    Practical applications employing deep learning must guarantee inference quality. However, we found that the inference quality of state-of-the-art and state-of-the-practice in practical applications has a long tail distribution. In the real world, many tasks have strict requirements for the quality of deep learning inference, such as safety-critical and mission-critical tasks. The fluctuation of inference quality seriously affects its practical applications, and the quality at the tail may lead to severe consequences. State-of-the-art and state-of-the-practice with outstanding inference quality designed and trained under loose constraints still have poor inference quality under constraints with practical application significance. On the one hand, the neural network models must be deployed on complex systems with limited resources. On the other hand, safety-critical and mission-critical tasks need to meet more metric constraints while ensuring high inference quality. We coin a new term, ``tail quality,'' to characterize this essential requirement and challenge. We also propose a new metric, ``X-Critical-Quality,'' to measure the inference quality under certain constraints. This article reveals factors contributing to the failure of using state-of-the-art and state-of-the-practice algorithms and systems in real scenarios. Therefore, we call for establishing innovative methodologies and tools to tackle this enormous challenge.Comment: 9 pages, 4 figure

    RAFL: A Robust and Adaptive Federated Meta-Learning Framework Against Adversaries

    Get PDF
    With the emergence of data silos and increasing privacy awareness, traditional centralized machine learning provides limited support. Federated learning (FL), as a promising alternative machine learning approach, is capable of leveraging distributed personalized datasets from multiple clients to train a shared global model in a privacy-preserving manner. However, FL systems are vulnerable to attacker-controlled adversarial clients that potentially conduct adversarial attacks by uploading unreliable model updates or clients unintentionally uploading low-quality models leading to degraded FL performance and reduced resilience to attacks. In this paper, we propose RAFL: a new robust-by-design federated meta learning framework capable of mitigating adversarial model updates on non-IID data. RAFL leverages 1) a residual rule-based detection method and a Variational AutoEncoder (VAE) learning based detection method combined to distinguish adversarial clients from benign clients. 2) a similarity-based model aggregation method to reduce the likelihood of uploading adversarial models from adversarial clients. 3) multiple learning loops to collaboratively train multiple personalized detection models against adversaries effectively. Experimental results demonstrate that our proposed FL framework is robust by design and outperforms other defensive methods against adversaries in terms of model accuracy and efficiency

    PPFM:An Adaptive and Hierarchical Peer-to-Peer Federated Meta-Learning Framework

    Get PDF

    Targeted aspect based multimodal sentiment analysis:an attention capsule extraction and multi-head fusion network

    Get PDF
    Multimodal sentiment analysis has currently identified its significance in a variety of domains. For the purpose of sentiment analysis, different aspects of distinguishing modalities, which correspond to one target, are processed and analyzed. In this work, we propose the targeted aspect-based multimodal sentiment analysis (TABMSA) for the first time. Furthermore, an attention capsule extraction and multi-head fusion network (EF-Net) on the task of TABMSA is devised. The multi-head attention (MHA) based network and the ResNet-152 are employed to deal with texts and images, respectively. The integration of MHA and capsule network aims to capture the interaction among the multimodal inputs. In addition to the targeted aspect, the information from the context and the image is also incorporated for sentiment delivered. We evaluate the proposed model on two manually annotated datasets. the experimental results demonstrate the effectiveness of our proposed model for this new task

    Location-based Robust Beamforming Design for Cellular-enabled UAV Communications

    Get PDF
    Cellular communications have been regarded as promising approaches to deliver high-broadband communication links for Unmanned Aerial Vehicles (UAVs), which have been widely deployed to conduct various missions, e.g. precision agriculture, forest monitoring and border patrol. However, the unique features of aerial UAVs including high-altitude manipulation, three-dimension (3D) mobility, and rapid velocity changes, pose challenging issues to realize reliable cellular-enabled UAV communications, especially with the severe inter-cell interference generated by UAVs. To deal with this issue, we propose a novel position-based robust beamforming algorithm through complementarily integrating the navigation information and wireless channel information to improve the performance of cellular-enabled UAV communications. Specifically, in order to achieve the optimal beam weight vector, the navigation information of the UAV system is innovatively exploited to predict the changes of Direction-of-arrival (DoA) angle. To fight against the high mobility of UAV operations, an optimization problem is formed by considering the tapered surface of DoA angle and solved to correct the inherent position error. Comprehensive simulation experiments are conducted and the results show that the proposed robust beamforming algorithm could achieve over 90% DoA estimation error reduction and up to 14dB SINR gain compared with five benchmark beamforming algorithms, including Linearly Constrained Minimum Variance (LCMV), Position-based beamforming, Diagonal Loading (DL), Robust Capon Beamforming (RCB) and Robust LCMV algorithm

    Edge-Computing-Based Channel Allocation for Deadline-Driven IoT Networks

    Get PDF
    Multichannel communication is an important means to improve the reliability of low-power Internet-of-Things (IoT) networks. Typically, data transmissions in IoT networks are often required to be delivered before a given deadline, making deadline-driven channel allocation an essential task. The existing works on time-division multiple access often fail to establish channel schedules to meet the deadline requirement, as they often assume that transmissions can be successful within one transmission slot. Besides, the allocation and link estimation incur considerable overhead for the IoT nodes. In this article, we propose an edge-based channel allocation (ECA) for unreliable IoT networks. In ECA, we explicitly consider the impact of allocation sequences and employ a recurrent-neural-network-based channel estimation scheme. We utilize link quality and retransmission opportunities to maximize the packet delivery ratio before deadline. The allocation algorithms are executed on edge servers such that: 1) the channel allocation can be updated more frequently to deal with the wireless dynamics; 2) the allocation results can be obtained in real time; and 3) channel estimation can be more accurate. Extensive evaluation results show that ECA can significantly improve the reliability of deadline-driven IoT networks

    Dynamic Context-guided Capsule Network for Multimodal Machine Translation

    Full text link
    Multimodal machine translation (MMT), which mainly focuses on enhancing text-only translation with visual features, has attracted considerable attention from both computer vision and natural language processing communities. Most current MMT models resort to attention mechanism, global context modeling or multimodal joint representation learning to utilize visual features. However, the attention mechanism lacks sufficient semantic interactions between modalities while the other two provide fixed visual context, which is unsuitable for modeling the observed variability when generating translation. To address the above issues, in this paper, we propose a novel Dynamic Context-guided Capsule Network (DCCN) for MMT. Specifically, at each timestep of decoding, we first employ the conventional source-target attention to produce a timestep-specific source-side context vector. Next, DCCN takes this vector as input and uses it to guide the iterative extraction of related visual features via a context-guided dynamic routing mechanism. Particularly, we represent the input image with global and regional visual features, we introduce two parallel DCCNs to model multimodal context vectors with visual features at different granularities. Finally, we obtain two multimodal context vectors, which are fused and incorporated into the decoder for the prediction of the target word. Experimental results on the Multi30K dataset of English-to-German and English-to-French translation demonstrate the superiority of DCCN. Our code is available on https://github.com/DeepLearnXMU/MM-DCCN
    corecore