87 research outputs found
Enhancing Context Modeling with a Query-Guided Capsule Network for Document-level Translation
Context modeling is essential to generate coherent and consistent translation
for Document-level Neural Machine Translations. The widely used method for
document-level translation usually compresses the context information into a
representation via hierarchical attention networks. However, this method
neither considers the relationship between context words nor distinguishes the
roles of context words. To address this problem, we propose a query-guided
capsule networks to cluster context information into different perspectives
from which the target translation may concern. Experiment results show that our
method can significantly outperform strong baselines on multiple data sets of
different domains.Comment: 11 pages, 7 figures, 2019 Conference on Empirical Methods in Natural
Language Processin
Quality at the Tail
Practical applications employing deep learning must guarantee inference
quality. However, we found that the inference quality of state-of-the-art and
state-of-the-practice in practical applications has a long tail distribution.
In the real world, many tasks have strict requirements for the quality of deep
learning inference, such as safety-critical and mission-critical tasks. The
fluctuation of inference quality seriously affects its practical applications,
and the quality at the tail may lead to severe consequences. State-of-the-art
and state-of-the-practice with outstanding inference quality designed and
trained under loose constraints still have poor inference quality under
constraints with practical application significance. On the one hand, the
neural network models must be deployed on complex systems with limited
resources. On the other hand, safety-critical and mission-critical tasks need
to meet more metric constraints while ensuring high inference quality.
We coin a new term, ``tail quality,'' to characterize this essential
requirement and challenge. We also propose a new metric,
``X-Critical-Quality,'' to measure the inference quality under certain
constraints. This article reveals factors contributing to the failure of using
state-of-the-art and state-of-the-practice algorithms and systems in real
scenarios. Therefore, we call for establishing innovative methodologies and
tools to tackle this enormous challenge.Comment: 9 pages, 4 figure
RAFL: A Robust and Adaptive Federated Meta-Learning Framework Against Adversaries
With the emergence of data silos and increasing privacy awareness, traditional centralized machine learning provides limited support. Federated learning (FL), as a promising alternative machine learning approach, is capable of leveraging distributed personalized datasets from multiple clients to train a shared global model in a privacy-preserving manner. However, FL systems are vulnerable to attacker-controlled adversarial clients that potentially conduct adversarial attacks by uploading unreliable model updates or clients unintentionally uploading low-quality models leading to degraded FL performance and reduced resilience to attacks. In this paper, we propose RAFL: a new robust-by-design federated meta learning framework capable of mitigating adversarial model updates on non-IID data. RAFL leverages 1) a residual rule-based detection method and a Variational AutoEncoder (VAE) learning based detection method combined to distinguish adversarial clients from benign clients. 2) a similarity-based model aggregation method to reduce the likelihood of uploading adversarial models from adversarial clients. 3) multiple learning loops to collaboratively train multiple personalized detection models against adversaries effectively. Experimental results demonstrate that our proposed FL framework is robust by design and outperforms other defensive methods against adversaries in terms of model accuracy and efficiency
Targeted aspect based multimodal sentiment analysis:an attention capsule extraction and multi-head fusion network
Multimodal sentiment analysis has currently identified its significance in a
variety of domains. For the purpose of sentiment analysis, different aspects of
distinguishing modalities, which correspond to one target, are processed and
analyzed. In this work, we propose the targeted aspect-based multimodal
sentiment analysis (TABMSA) for the first time. Furthermore, an attention
capsule extraction and multi-head fusion network (EF-Net) on the task of TABMSA
is devised. The multi-head attention (MHA) based network and the ResNet-152 are
employed to deal with texts and images, respectively. The integration of MHA
and capsule network aims to capture the interaction among the multimodal
inputs. In addition to the targeted aspect, the information from the context
and the image is also incorporated for sentiment delivered. We evaluate the
proposed model on two manually annotated datasets. the experimental results
demonstrate the effectiveness of our proposed model for this new task
Location-based Robust Beamforming Design for Cellular-enabled UAV Communications
Cellular communications have been regarded as promising approaches to deliver high-broadband communication links for Unmanned Aerial Vehicles (UAVs), which have been widely deployed to conduct various missions, e.g. precision agriculture, forest monitoring and border patrol. However, the unique features of aerial UAVs including high-altitude manipulation, three-dimension (3D) mobility, and rapid velocity changes, pose challenging issues to realize reliable cellular-enabled UAV communications, especially with the severe inter-cell interference generated by UAVs. To deal with this issue, we propose a novel position-based robust beamforming algorithm through complementarily integrating the navigation information and wireless channel information to improve the performance of cellular-enabled UAV communications. Specifically, in order to achieve the optimal beam weight vector, the navigation information of the UAV system is innovatively exploited to predict the changes of Direction-of-arrival (DoA) angle. To fight against the high mobility of UAV operations, an optimization problem is formed by considering the tapered surface of DoA angle and solved to correct the inherent position error. Comprehensive simulation experiments are conducted and the results show that the proposed robust beamforming algorithm could achieve over 90% DoA estimation error reduction and up to 14dB SINR gain compared with five benchmark beamforming algorithms, including Linearly Constrained Minimum Variance (LCMV), Position-based beamforming, Diagonal Loading (DL), Robust Capon Beamforming (RCB) and Robust LCMV algorithm
Edge-Computing-Based Channel Allocation for Deadline-Driven IoT Networks
Multichannel communication is an important means to improve the reliability of low-power Internet-of-Things (IoT) networks. Typically, data transmissions in IoT networks are often required to be delivered before a given deadline, making deadline-driven channel allocation an essential task. The existing works on time-division multiple access often fail to establish channel schedules to meet the deadline requirement, as they often assume that transmissions can be successful within one transmission slot. Besides, the allocation and link estimation incur considerable overhead for the IoT nodes. In this article, we propose an edge-based channel allocation (ECA) for unreliable IoT networks. In ECA, we explicitly consider the impact of allocation sequences and employ a recurrent-neural-network-based channel estimation scheme. We utilize link quality and retransmission opportunities to maximize the packet delivery ratio before deadline. The allocation algorithms are executed on edge servers such that: 1) the channel allocation can be updated more frequently to deal with the wireless dynamics; 2) the allocation results can be obtained in real time; and 3) channel estimation can be more accurate. Extensive evaluation results show that ECA can significantly improve the reliability of deadline-driven IoT networks
Dynamic Context-guided Capsule Network for Multimodal Machine Translation
Multimodal machine translation (MMT), which mainly focuses on enhancing
text-only translation with visual features, has attracted considerable
attention from both computer vision and natural language processing
communities. Most current MMT models resort to attention mechanism, global
context modeling or multimodal joint representation learning to utilize visual
features. However, the attention mechanism lacks sufficient semantic
interactions between modalities while the other two provide fixed visual
context, which is unsuitable for modeling the observed variability when
generating translation. To address the above issues, in this paper, we propose
a novel Dynamic Context-guided Capsule Network (DCCN) for MMT. Specifically, at
each timestep of decoding, we first employ the conventional source-target
attention to produce a timestep-specific source-side context vector. Next, DCCN
takes this vector as input and uses it to guide the iterative extraction of
related visual features via a context-guided dynamic routing mechanism.
Particularly, we represent the input image with global and regional visual
features, we introduce two parallel DCCNs to model multimodal context vectors
with visual features at different granularities. Finally, we obtain two
multimodal context vectors, which are fused and incorporated into the decoder
for the prediction of the target word. Experimental results on the Multi30K
dataset of English-to-German and English-to-French translation demonstrate the
superiority of DCCN. Our code is available on
https://github.com/DeepLearnXMU/MM-DCCN
- …