7 research outputs found

    TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection

    Full text link
    Thanks to recent advancements in end-to-end speech modeling technology, it has become increasingly feasible to imitate and clone a user`s voice. This leads to a significant challenge in differentiating between authentic and fabricated audio segments. To address the issue of user voice abuse and misuse, the second Audio Deepfake Detection Challenge (ADD 2023) aims to detect and analyze deepfake speech utterances. Specifically, Track 2, named the Manipulation Region Location (RL), aims to pinpoint the location of manipulated regions in audio, which can be present in both real and generated audio segments. We propose our novel TranssionADD system as a solution to the challenging problem of model robustness and audio segment outliers in the trace competition. Our system provides three unique contributions: 1) we adapt sequence tagging task for audio deepfake detection; 2) we improve model generalization by various data augmentation techniques; 3) we incorporate multi-frame detection (MFD) module to overcome limited representation provided by a single frame and use isolated-frame penalty (IFP) loss to handle outliers in segments. Our best submission achieved 2nd place in Track 2, demonstrating the effectiveness and robustness of our proposed system

    Stabilizing a high-voltage LiNi0.5Mn1.5O4 cathode towards all solid state batteries: a Li–Al–Ti–P–O solid electrolyte nano-shell with a host material

    No full text
    LiNi0.5Mn1.5O4 (LNMO) spinel has drawn increasing attention due to its high voltage, stabilized electrochemical performance and safety features as a cathode for lithium-ion batteries. However, the main challenge lies in its unstable surface structure, especially at elevated temperatures. In this paper, we decorate the LNMO precursor with a solid electrolyte of Li1.4Al0.4Ti1.6(PO4)3 (LATP) via a facile sol–gel method, followed by a co-crystallization process at 820 °C, to successfully generate a LATP modification shell at the surface of LNMO. The LATP modification shell could not only optimize the morphology of LNMO including the limitation of particle growth and control of crystalline orientation, but also realize ion doping during the co-crystallization process. By tuning the LATP contents, the 2 wt% LATP modification is found to be the most effective at balancing the interfacial stability and Li+ diffusion kinetics of LNMO, as well as enhancing its rate capability and capacity retention at high temperatures. As a result, the 2 wt% LATP-modified LNMO cathode exhibits a high reversible capacity of 84.8 mA h g−1 after 500 cycles with a capacity retention of 68.9%, and a superior rate capability (102.0 mA h g−1 at 20 C) at room temperature. Moreover, this electrode also delivers a good capacity retention of 85.7% after 100 cycles at 55 °C, which is ascribed to the stabilized interface with a LATP protective layer
    corecore