Search CORE

98 research outputs found

Kinetic Monte Carlo simulation of mixtures: phase equilibria

Author: Tan Shiliang Johnathan
Publication venue: 'University of Queensland Library'
Publication date: 06/11/2015
Field of study

University of Queensland eSpace

Atomic packing and short-to-medium range order evolution of Zr-Pd metallic glass

Author: ChunLin Tan
Li Qi
Mei Liu
ShiLiang Zhang
XinYu Zhang
Publication venue: Springer Nature
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer

Author: Chen Xie
Liang Zheng
Tan Tian
Wang Peng
Yang Yifan
Zhang Shiliang
Publication venue
Publication date: 14/09/2023
Field of study

In spite of the excellent strides made by end-to-end (E2E) models in speech recognition in recent years, named entity recognition is still challenging but critical for semantic understanding. In order to enhance the ability to recognize named entities in E2E models, previous studies mainly focus on various rule-based or attention-based contextual biasing algorithms. However, their performance might be sensitive to the biasing weight or degraded by excessive attention to the named entity list, along with a risk of false triggering. Inspired by the success of the class-based language model (LM) in named entity recognition in conventional hybrid systems and the effective decoupling of acoustic and linguistic information in the factorized neural Transducer (FNT), we propose a novel E2E model to incorporate class-based LMs into FNT, which is referred as C-FNT. In C-FNT, the language model score of named entities can be associated with the name class instead of its surface form. The experimental results show that our proposed C-FNT presents significant error reduction in named entities without hurting performance in general word recognition

arXiv.org e-Print Archive

Unified Normalization for Accelerating and Stabilizing Transformers

Author: Lan Chaoxiang
Li Zheyang
Pu Shiliang
Tan Wenming
Xiao Jun
Yang Qiming
Yang Zhi
Zhang Kai
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 02/08/2022
Field of study

Solid results from Transformers have made them prevailing architectures in various natural language and vision tasks. As a default component in Transformers, Layer Normalization (LN) normalizes activations within each token to boost the robustness. However, LN requires on-the-fly statistics calculation in inference as well as division and square root operations, leading to inefficiency on hardware. What is more, replacing LN with other hardware-efficient normalization schemes (e.g., Batch Normalization) results in inferior performance, even collapse in training. We find that this dilemma is caused by abnormal behaviors of activation statistics, including large fluctuations over iterations and extreme outliers across layers. To tackle these issues, we propose Unified Normalization (UN), which can speed up the inference by being fused with other linear operations and achieve comparable performance on par with LN. UN strives to boost performance by calibrating the activation and gradient statistics with a tailored fluctuation smoothing strategy. Meanwhile, an adaptive outlier filtration strategy is applied to avoid collapse in training whose effectiveness is theoretically proved and experimentally verified in this paper. We demonstrate that UN can be an efficient drop-in alternative to LN by conducting extensive experiments on language and vision tasks. Besides, we evaluate the efficiency of our method on GPU. Transformers equipped with UN enjoy about 31% inference speedup and nearly 18% memory reduction. Code will be released at https://github.com/hikvision-research/Unified-Normalization.Comment: ACM MM'2

arXiv.org e-Print Archive

A re-assessment of the isosteric heat for CCl4 adsorption on graphite

Author: (Johnathan) Tan Shiliang
Do D. D.
Nicholson D.
Zhang Han
Publication venue: 'Elsevier BV'
Publication date: 28/01/2019
Field of study

We have carried out molecular simulations of carbon tetrachloride adsorption on graphite, in order to investigate the role of the octopole in potential models for the CCl/graphite system, and the temperature dependence of the first-order gas-liquid transition in the first adsorbate layer. Two classes of potential model for carbon tetrachloride were considered: the first has 5 LJ sites and the second includes five partial charges to model the leading octopole. Both models are adequate to represent the vapour-liquid equilibrium, suggesting that the octopole makes an insignificant contribution to the properties of the bulk phase. Both models show that adsorbed CCl molecules are delocalized on a graphite surface because of the strong intermolecular interactions. It is found that the LJ sites on the chlorine atoms, not the octopole, play the most important role in matching the experimental isotherm and isosteric heat data with simulation. The heat is constant, across the first-order transition of the first adsorbate layer. The simulation results show that both the magnitude of the density jump, and the isosteric heat across the first-order transition, decrease as the temperature increases. This is in qualitative agreement with the 1972 experimental data of Avgul and Kiselev, but these experimental data exhibit an unusually strong decrease in the isosteric heat, and the coexistence region between the two phases displays an unusual asymmetrical shape. Detailed analysis of our simulation results, together with the calculated isosteric heat from the experimental isotherms of Machin and Ross, show that there may be errors associated with the heat data of Avgul and Kiselev at high temperatures

University of Queensland eSpace