7 research outputs found
TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection
Thanks to recent advancements in end-to-end speech modeling technology, it
has become increasingly feasible to imitate and clone a user`s voice. This
leads to a significant challenge in differentiating between authentic and
fabricated audio segments. To address the issue of user voice abuse and misuse,
the second Audio Deepfake Detection Challenge (ADD 2023) aims to detect and
analyze deepfake speech utterances. Specifically, Track 2, named the
Manipulation Region Location (RL), aims to pinpoint the location of manipulated
regions in audio, which can be present in both real and generated audio
segments. We propose our novel TranssionADD system as a solution to the
challenging problem of model robustness and audio segment outliers in the trace
competition. Our system provides three unique contributions: 1) we adapt
sequence tagging task for audio deepfake detection; 2) we improve model
generalization by various data augmentation techniques; 3) we incorporate
multi-frame detection (MFD) module to overcome limited representation provided
by a single frame and use isolated-frame penalty (IFP) loss to handle outliers
in segments. Our best submission achieved 2nd place in Track 2, demonstrating
the effectiveness and robustness of our proposed system
Stabilizing a high-voltage LiNi0.5Mn1.5O4 cathode towards all solid state batteries: a Li–Al–Ti–P–O solid electrolyte nano-shell with a host material
LiNi0.5Mn1.5O4 (LNMO) spinel has drawn increasing attention due to its high voltage, stabilized electrochemical performance and safety features as a cathode for lithium-ion batteries. However, the main challenge lies in its unstable surface structure, especially at elevated temperatures. In this paper, we decorate the LNMO precursor with a solid electrolyte of Li1.4Al0.4Ti1.6(PO4)3 (LATP) via a facile sol–gel method, followed by a co-crystallization process at 820 °C, to successfully generate a LATP modification shell at the surface of LNMO. The LATP modification shell could not only optimize the morphology of LNMO including the limitation of particle growth and control of crystalline orientation, but also realize ion doping during the co-crystallization process. By tuning the LATP contents, the 2 wt% LATP modification is found to be the most effective at balancing the interfacial stability and Li+ diffusion kinetics of LNMO, as well as enhancing its rate capability and capacity retention at high temperatures. As a result, the 2 wt% LATP-modified LNMO cathode exhibits a high reversible capacity of 84.8 mA h g−1 after 500 cycles with a capacity retention of 68.9%, and a superior rate capability (102.0 mA h g−1 at 20 C) at room temperature. Moreover, this electrode also delivers a good capacity retention of 85.7% after 100 cycles at 55 °C, which is ascribed to the stabilized interface with a LATP protective layer