92 research outputs found
Towards a Deep Understanding of Multilingual End-to-End Speech Translation
In this paper, we employ Singular Value Canonical Correlation Analysis
(SVCCA) to analyze representations learnt in a multilingual end-to-end speech
translation model trained over 22 languages. SVCCA enables us to estimate
representational similarity across languages and layers, enhancing our
understanding of the functionality of multilingual speech translation and its
potential connection to multilingual neural machine translation. The
multilingual speech translation model is trained on the CoVoST 2 dataset in all
possible directions, and we utilize LASER to extract parallel bitext data for
SVCCA analysis. We derive three major findings from our analysis: (I)
Linguistic similarity loses its efficacy in multilingual speech translation
when the training data for a specific language is limited. (II) Enhanced
encoder representations and well-aligned audio-text data significantly improve
translation quality, surpassing the bilingual counterparts when the training
data is not compromised. (III) The encoder representations of multilingual
speech translation demonstrate superior performance in predicting phonetic
features in linguistic typology prediction. With these findings, we propose
that releasing the constraint of limited data for low-resource languages and
subsequently combining them with linguistically related high-resource languages
could offer a more effective approach for multilingual end-to-end speech
translation.Comment: Accepted to Findings of EMNLP 202
Towards Efficient Viscous Modeling Based on Cartesian Methods for Automated Flow Simulation
The advanced Computational Fluid Dynamics (CFD) techniques that address the current limitations of Cartesian-based Navier-Stokes CFD schemes are explored in current investigation. Three promising methods of implementing improved wall boundary conditions are applied: (1) the enhanced diamond path stencil approach, (2) the reformulated extended extrapolation boundary condition, and (3) the ghost cell method. Several initial testing cases have been conducted with all these three boundary conditions, including the flow past a circular cylinder, flow past a flat plate at different inclined angles and flow past an AGARD RAE2822 airfoil. All the results show the effectiveness of these boundary conditions in resolving both laminar and turbulent boundary layer. Among all these methods, the extended extrapolation boundary condition attains the better results than the other two methods
Application of Deep Learning Methods for Distinguishing Gamma-Ray Bursts from Fermi/GBM TTE Data
To investigate GRBs in depth, it is crucial to develop an effective method
for identifying GRBs accurately. Current criteria, e.g., onboard blind search,
ground blind search, and target search, are limited by manually set thresholds
and perhaps miss GRBs, especially for sub-threshold events. We propose a novel
approach that utilizes convolutional neural networks (CNNs) to distinguish GRBs
and non-GRBs directly. We structured three CNN models, plain-CNN, ResNet, and
ResNet-CBAM, and endeavored to exercise fusing strategy models. Count maps of
NaI detectors onboard Fermi/GBM were employed as the input samples of datasets
and models were implemented to evaluate their performance on different time
scale data. The ResNet-CBAM model trained on 64 ms dataset achieves high
accuracy overall, which includes residual and attention mechanism modules. The
visualization methods of Grad-CAM and t-SNE explicitly displayed that the
optimal model focuses on the key features of GRBs precisely. The model was
applied to analyze one-year data, accurately identifying approximately 98% of
GRBs listed in the Fermi burst catalog, 8 out of 9 sub-threshold GRBs, and 5
GRBs triggered by other satellites, which demonstrated the deep learning
methods could effectively distinguish GRBs from observational data. Besides,
thousands of unknown candidates were retrieved and compared with the bursts of
SGR J1935+2154 for instance, which exemplified the potential scientific value
of these candidates indeed. Detailed studies on integrating our model into
real-time analysis pipelines thus may improve their accuracy of inspection, and
provide valuable guidance for rapid follow-up observations of multi-band
telescopes.Comment: accepted for publication in ApJSS. 45 pages,17 figure
In-Orbit Instrument Performance Study and Calibration for POLAR Polarization Measurements
POLAR is a compact space-borne detector designed to perform reliable
measurements of the polarization for transient sources like Gamma-Ray Bursts in
the energy range 50-500keV. The instrument works based on the Compton
Scattering principle with the plastic scintillators as the main detection
material along with the multi-anode photomultiplier tube. POLAR has been
launched successfully onboard the Chinese space laboratory TG-2 on 15th
September, 2016. In order to reliably reconstruct the polarization information
a highly detailed understanding of the instrument is required for both data
analysis and Monte Carlo studies. For this purpose a full study of the in-orbit
performance was performed in order to obtain the instrument calibration
parameters such as noise, pedestal, gain nonlinearity of the electronics,
threshold, crosstalk and gain, as well as the effect of temperature on the
above parameters. Furthermore the relationship between gain and high voltage of
the multi-anode photomultiplier tube has been studied and the errors on all
measurement values are presented. Finally the typical systematic error on
polarization measurements of Gamma-Ray Bursts due to the measurement error of
the calibration parameters are estimated using Monte Carlo simulations.Comment: 43 pages, 30 figures, 1 table; Preprint accepted by NIM
The GECAM Real-Time Burst Alert System
Gravitational Wave High-energy Electromagnetic Counterpart All-sky Monitor
(GECAM), consisting of two micro-satellites, is designed to detect gamma-ray
bursts associated with gravitational-wave events. Here, we introduce the
real-time burst alert system of GECAM, with the adoption of the BeiDou-3 short
message communication service. We present the post-trigger operations, the
detailed ground-based analysis, and the performance of the system. In the first
year of the in-flight operation, GECAM was triggered by 42 GRBs. GECAM
real-time burst alert system has the ability to distribute the alert within
1 minute after being triggered, which enables timely follow-up
observations.Comment: 17 pages, 10 figures; Accepted for publication in RA
- …