121 research outputs found

    A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

    Full text link
    In this paper, we introduce our work of building a Streaming Multilingual Speech Model (SM2), which can transcribe or translate multiple spoken languages into texts of the target language. The backbone of SM2 is Transformer Transducer, which has high streaming capability. Instead of human labeled speech translation (ST) data, SM2 models are trained using weakly supervised data generated by converting the transcriptions in speech recognition corpora with a machine translation service. With 351 thousand hours of anonymized speech training data from 25 languages, SM2 models achieve comparable or even better ST quality than some recent popular large-scale non-streaming speech models. More importantly, we show that SM2 has the truly zero-shot capability when expanding to new target languages, yielding high quality ST results for {source-speech, target-text} pairs that are not seen during training.Comment: submitted to ICASSP 202

    LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

    Full text link
    End-to-end formulation of automatic speech recognition (ASR) and speech translation (ST) makes it easy to use a single model for both multilingual ASR and many-to-many ST. In this paper, we propose streaming language-agnostic multilingual speech recognition and translation using neural transducers (LAMASSU). To enable multilingual text generation in LAMASSU, we conduct a systematic comparison between specified and unified prediction and joint networks. We leverage a language-agnostic multilingual encoder that substantially outperforms shared encoders. To enhance LAMASSU, we propose to feed target LID to encoders. We also apply connectionist temporal classification regularization to transducer training. Experimental results show that LAMASSU not only drastically reduces the model size but also outperforms monolingual ASR and bilingual ST models.Comment: Submitted to ICASSP 202

    Simulation of centrifugal pumps based on MPS solver

    Get PDF
    In this article, a three dimensional full-scale centrifugal pump is, for the first time, simulated using the moving particle semi-implicit method(MPS). The genetic smooth wall boundary(GSW) is used and extended to three dimension to deal with the complicated wall shape and thin blades in turbo-machines. The non-surface detection technique(NSD) based on conceptual particles is combined with this wall model to avoid the loss of particle number density near wall boundaries. A local wall particle refinement method is developed. Fine particles and coarse particles are applied to curved surface and flat surface respectively so as to reduce the computational cost while maintain the high discretization precision. The fully developed velocity inflow and pressure outflow boundary conditions are proposed. Three typical cases including hydrostatic cases with complicated geometry, flows over a two-dimensional backward-facing step, and flows in a three-dimensional tube are tested to verify the proposed models. This paper constructs a framework for the simulation of incompressible fluid machines, in which 3D complex revolving bodies can be integrally discretized and interactions between the stator and rotor can be integrally solved within a single coordinate. This paper provides a particle-based solver for incompressible fluid machinery and has the potential to study its inner flow with multiphase or phase change

    STICKERCONV: Generating Multimodal Empathetic Responses from Scratch

    Full text link
    Stickers, while widely recognized for enhancing empathetic communication in online interactions, remain underexplored in current empathetic dialogue research, notably due to the challenge of a lack of comprehensive datasets. In this paper, we introduce the Agent for STICKERCONV (Agent4SC), which uses collaborative agent interactions to realistically simulate human behavior with sticker usage, thereby enhancing multimodal empathetic communication. Building on this foundation, we develop a multimodal empathetic dialogue dataset, STICKERCONV, comprising 12.9K dialogue sessions, 5.8K unique stickers, and 2K diverse conversational scenarios. This dataset serves as a benchmark for multimodal empathetic generation. To advance further, we propose PErceive and Generate Stickers (PEGS), a multimodal empathetic response generation framework, complemented by a comprehensive set of empathy evaluation metrics based on LLM. Our experiments demonstrate PEGS's effectiveness in generating contextually relevant and emotionally resonant multimodal empathetic responses, contributing to the advancement of more nuanced and engaging empathetic dialogue systems

    Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

    Full text link
    We propose gated language experts and curriculum training to enhance multilingual transformer transducer models without requiring language identification (LID) input from users during inference. Our method incorporates a gating mechanism and LID loss, enabling transformer experts to learn language-specific information. By combining gated transformer experts with shared transformer layers, we construct multilingual transformer blocks and utilize linear experts to effectively regularize the joint network. The curriculum training scheme leverages LID to guide the gated experts in improving their respective language performance. Experimental results on a bilingual task involving English and Spanish demonstrate significant improvements, with average relative word error reductions of 12.5% and 7.3% compared to the baseline bilingual and monolingual models, respectively. Notably, our method achieves performance comparable to the upper-bound model trained and inferred with oracle LID. Extending our approach to trilingual, quadrilingual, and pentalingual models reveals similar advantages to those observed in the bilingual models, highlighting its ease of extension to multiple languages

    Cell cycle arrest mediated by Cd-induced DNA damage in Arabidopsis root tips

    Get PDF
    Accumulating evidence demonstrates that the aberrant expression of cell cycle regulation and DNA repair genes can result in abnormal cell proliferation and genomic instability in eukaryotic cells under different stresses. Herein, Arabidopsis thaliana (Arabidopsis) seedlings were grown hydroponically on 0.5 × MS media containing cadmium (Cd) at 0–2.5 mg L−1 for 5 d of treatment. Real time quantitative reverse transcription-polymerase chain reaction (qRT-PCR) analysis revealed that expression of DNA damage repair and cell cycle regulation genes, including BRCA1, MRE11, WEE1, CDKA;1 and PCNA1, showed an inverted U-shaped dose-response. In contrast, notably reduced expression was observed for G1-to-S transition-related genes, Histone H4, E2Fa and PCNA2; DSB end processing, GR1; G2-to-M transition-related gene, CYCB1;1; and DNA mismatch repair, MSH2, MSH6 and MLH1 genes in root tips exposed to 0.125–2.5 mg/L Cd for 5 d. Flow cytometry (FCM) analysis revealed significant increases of cells with a 2C nuclear content and with a 4C and 8C nuclear content under Cd stresses of 0.125 and 1–2.5 mg L−1, respectively. Our results suggest that 0.125 mg L−1 Cd-induced DNA damage induced the marked G1/S arrest, leading to accelerated growth in root tips, while 1.0–2.5 mg L−1 Cd-induced DNA damage caused a notable G2/M arrest in root tips, leading to reduced growth in root tips. This may be a protective mechanism that prevents cells with damaged DNA from dividing under Cd stress

    A Single-Cell Atlas of Bovine Skeletal Muscle Reveals Mechanisms Regulating Intramuscular Adipogenesis and Fibrogenesis

    Get PDF
    Background Intramuscular fat (IMF) and intramuscular connective tissue (IMC) are often seen in human myopathies and are central to beef quality. The mechanisms regulating their accumulation remain poorly understood. Here, we explored the possibility of using beef cattle as a novel model for mechanistic studies of intramuscular adipogenesis and fibrogenesis. Methods Skeletal muscle single-cell RNAseq was performed on three cattle breeds, including Wagyu (high IMF), Brahman (abundant IMC but scarce IMF), and Wagyu/Brahman cross. Sophisticated bioinformatics analyses, including clustering analysis, gene set enrichment analyses, gene regulatory network construction, RNA velocity, pseudotime analysis, and cell-cell communication analysis, were performed to elucidate heterogeneities and differentiation processes of individual cell types and differences between cattle breeds. Experiments were conducted to validate the function and specificity of identified key regulatory and marker genes. Integrated analysis with multiple published human and non-human primate datasets was performed to identify common mechanisms. Results A total of 32 708 cells and 21 clusters were identified, including fibro/adipogenic progenitor (FAP) and other resident and infiltrating cell types. We identified an endomysial adipogenic FAP subpopulation enriched for COL4A1 and CFD (log2FC = 3.19 and 1.92, respectively; P \u3c 0.0001) and a perimysial fibrogenic FAP subpopulation enriched for COL1A1 and POSTN (log2FC = 1.83 and 0.87, respectively; P \u3c 0.0001), both of which were likely derived from an unspecified subpopulation. Further analysis revealed more progressed adipogenic programming of Wagyu FAPs and more advanced fibrogenic programming of Brahman FAPs. Mechanistically, NAB2 drives CFD expression, which in turn promotes adipogenesis. CFD expression in FAPs of young cattle before the onset of intramuscular adipogenesis was predictive of IMF contents in adulthood (R2 = 0.885, P \u3c 0.01). Similar adipogenic and fibrogenic FAPs were identified in humans and monkeys. In aged humans with metabolic syndrome and progressed Duchenne muscular dystrophy (DMD) patients, increased CFD expression was observed (P \u3c 0.05 and P \u3c 0.0001, respectively), which was positively correlated with adipogenic marker expression, including ADIPOQ (R2 = 0.303, P \u3c 0.01; and R2 = 0.348, P \u3c 0.01, respectively). The specificity of Postn/POSTN as a fibrogenic FAP marker was validated using a lineage-tracing mouse line. POSTN expression was elevated in Brahman FAPs (P \u3c 0.0001) and DMD patients (P \u3c 0.01) but not in aged humans. Strong interactions between vascular cells and FAPs were also identified. Conclusions Our study demonstrates the feasibility of beef cattle as a model for studying IMF and IMC. We illustrate the FAP programming during intramuscular adipogenesis and fibrogenesis and reveal the reliability of CFD as a predictor and biomarker of IMF accumulation in cattle and humans
    • …
    corecore