84 research outputs found

    Connecting Speech Encoder and Large Language Model for ASR

    Full text link
    The impressive capability and versatility of large language models (LLMs) have aroused increasing attention in automatic speech recognition (ASR), with several pioneering studies attempting to build integrated ASR models by connecting a speech encoder with an LLM. This paper presents a comparative study of three commonly used structures as connectors, including fully connected layers, multi-head cross-attention, and Q-Former. Speech encoders from the Whisper model series as well as LLMs from the Vicuna model series with different model sizes were studied. Experiments were performed on the commonly used LibriSpeech, Common Voice, and GigaSpeech datasets, where the LLMs with Q-Formers demonstrated consistent and considerable word error rate (WER) reductions over LLMs with other connector structures. Q-Former-based LLMs can generalise well to out-of-domain datasets, where 12% relative WER reductions over the Whisper baseline ASR model were achieved on the Eval2000 test set without using any in-domain training data from Switchboard. Moreover, a novel segment-level Q-Former is proposed to enable LLMs to recognise speech segments with a duration exceeding the limitation of the encoders, which results in 17% relative WER reductions over other connector structures on 90-second-long speech data

    Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models

    Full text link
    Audio-visual large language models (LLM) have drawn significant attention, yet the fine-grained combination of both input streams is rather under-explored, which is challenging but necessary for LLMs to understand general video inputs. To this end, a fine-grained audio-visual joint representation (FAVOR) learning framework for multimodal LLMs is proposed in this paper, which extends a text-based LLM to simultaneously perceive speech and audio events in the audio input stream and images or videos in the visual input stream, at the frame level. To fuse the audio and visual feature streams into joint representations and to align the joint space with the LLM input embedding space, we propose a causal Q-Former structure with a causal attention module to enhance the capture of causal relations of the audio-visual frames across time. An audio-visual evaluation benchmark (AVEB) is also proposed which comprises six representative single-modal tasks with five cross-modal tasks reflecting audio-visual co-reasoning abilities. While achieving competitive single-modal performance on audio, speech and image tasks in AVEB, FAVOR achieved over 20% accuracy improvements on the video question-answering task when fine-grained information or temporal causal reasoning is required. FAVOR, in addition, demonstrated remarkable video comprehension and reasoning abilities on tasks that are unprecedented by other multimodal LLMs. An interactive demo of FAVOR is available at https://github.com/BriansIDP/AudioVisualLLM.git, and the training code and model checkpoints will be released soon

    Mutations of genes in synthesis of the carotenoid precursors of ABA lead to pre-harvest sprouting and photo-oxidation in rice

    Get PDF
    Pre-harvest sprouting (PHS) or vivipary in cereals is an important agronomic trait that results in significant economic loss. A considerable number of mutations that cause PHS have been identified in several species. However, relatively few viviparous mutants in rice (Oryza sativa L.) have been reported. To explore the mechanism of PHS in rice, we carried out an extensive genetic screening and identified 12 PHS mutants (phs). Based on their phenotypes, these phs mutants were classified into three groups. Here we characterize in detail one of these groups, which contains mutations in genes encoding major enzymes of the carotenoid biosynthesis pathway, including phytoene desaturase (OsPDS), ζ-carotene desaturase (OsZDS), carotenoid isomerase (OsCRTISO) and lycopene β-cyclase (β-OsLCY), which are essential for the biosynthesis of carotenoid precursors of ABA. As expected, the amount of ABA was reduced in all four phs mutants compared with that in the wild type. Chlorophyll fluorescence analysis revealed the occurrence of photoinhibition in the photosystem and decreased capacity for eliminating excess energy by thermal dissipation. The greatly increased activities of reactive oxygen species (ROS) scavenging enzymes, and reduced photosystem (PS) II core proteins CP43, CP47 and D1 in leaves of the Oscrtiso/phs3-1 mutant and OsLCY RNAi transgenic rice indicated that photo-oxidative damage occurred in PS II, consistent with the accumulation of ROS in these plants. These results suggest that the impairment of carotenoid biosynthesis causes photo-oxidation and ABA-deficiency phenotypes, of which the latter is a major factor controlling the PHS trait in rice

    H9N2 Viruses Isolated From Mammals Replicated in Mice at Higher Levels Than Avian-Origin Viruses

    Get PDF
    H9N2 subtype influenza A virus (IAV) has more than 20 genotypes that are able to cross species barriers and expand from birds to mammals and humans. To better understand the impact of different H9N2 genotypes and their characteristics, five H9N2 viruses from different hosts including chickens, geese, pigs, mink, and humans representing the B69 88(Gs/14, Ck/15, and Mi/14), B35 (Sw/08) and G9 genotypes (Hu/04) were infected in chicken and mice. In mice, mammal-origin viruses replicated at higher levels in the lungs compared to avian viruses. The goose-virus replicated at the lowest levels indicating poor adaptation. Increased pro-inflammatory cytokines were positively correlated with viral loads in the lung. In chickens, all viruses were excreted from cloacal and/or oropharyngeal swabs. Interestingly, Mink-origin virus exhibited higher virulence and replication in mice and chickens. Our data indicate that mammal-origin H9N2 viruses are more adapted and virulent in mice than the avian-origin viruses

    The Protective Efficacy of a SARS-CoV-2 Vaccine Candidate B.1.351V against Several Variant Challenges in K18-hACE2 Mice

    Get PDF
    The emergence of SARS-CoV-2 variants of concern (VOCs) with increased transmissibility and partial resistance to neutralization by antibodies has been observed globally. There is an urgent need for an effective vaccine to combat these variants. Our study demonstrated that the B.1.351 variant inactivated vaccine candidate (B.1.351V) generated strong binding and neutralizing antibody responses in BALB/c mice against the B.1.351 virus and other SARS-CoV-2 variants after two doses within 28 days. Immunized K18-hACE2 mice also exhibited elevated levels of live virus-neutralizing antibodies against various SARS-CoV-2 viruses. Following infection with these viruses, K18-hACE2 mice displayed a stable body weight, a high survival rate, minimal virus copies in lung tissue, and no lung damage compared to the control group. These findings indicate that B.1.351V offered protection against infection with multiple SARS-CoV-2 variants in mice, providing insights for the development of a vaccine targeting SARS-CoV-2 VOCs for human use

    Recombinant proteins A29L, M1R, A35R, and B6R vaccination protects mice from mpox virus challenge

    Get PDF
    Since May 2022, mutant strains of mpox (formerly monkeypox) virus (MPXV) have been rapidly spreading among individuals who have not traveled to endemic areas in multiple locations, including Europe and the United States. Both intracellular and extracellular forms of mpox virus have multiple outer membrane proteins that can stimulate immune response. Here, we investigated the immunogenicity of MPXV structural proteins such as A29L, M1R, A35R, and B6R as a combination vaccine, and the protective effect against the 2022 mpox mutant strain was also evaluated in BALB/c mice. After mixed 15 μg QS-21 adjuvant, all four virus structural proteins were administered subcutaneously to mice. Antibody titers in mouse sera rose sharply after the initial boost, along with an increased capacity of immune cells to produce IFN-γ alongside an elevated level of cellular immunity mediated by Th1 cells. The vaccine-induced neutralizing antibodies significantly inhibited the replication of MPXV in mice and reduced the pathological damage of organs. This study demonstrates the feasibility of a multiple recombinant vaccine for MPXV variant strains

    Evaluation and identification of powdery mildew-resistant genes in 137 wheat relatives

    Get PDF
    Powdery mildew is one of the most severe diseases affecting wheat yield and quality and is caused by Blumeria graminis f. sp. tritici (Bgt). Host resistance is the preferred strategy to prevent this disease. However, the narrow genetic basis of common wheat has increased the demand for diversified germplasm resources against powdery mildew. Wheat relatives, especially the secondary gene pool of common wheat, are important gene donors in the genetic improvement of common wheat because of its abundant genetic variation and close kinship with wheat. In this study, a series of 137 wheat relatives, including 53 Triticum monococcum L. (2n = 2x = 14, AA), 6 T. urartu Thumanjan ex Gandilyan (2n = 2x = 14, AA), 9 T. timopheevii Zhuk. (2n = 4x = 28, AAGG), 66 T. aestivum subsp. spelta (2n = 6x = 42, AABBDD), and 3 Aegilops speltoides (2n = 2x = 14, SS) were systematically evaluated for their powdery mildew resistance and composition of Pm genes. Out of 137 (60.58%) accessions, 83 were resistant to Bgt isolate E09 at the seedling stage, and 116 of 137 (84.67%) wheat relatives were resistant to the mixture of Bgt isolates at the adult stage. This indicates that these accessions show a high level of resistance to powdery mildew. Some 31 markers for 23 known Pm genes were used to test these 137 accessions, and, in the results, only Pm2, Pm4, Pm6, Pm58, and Pm68 were detected. Among them, three Pm4 alleles (Pm4a, Pm4b, and Pm4f) were identified in 4 T. subsp. spelta accessions. q-RT PCR further confirmed that Pm4 alleles played a role in disease resistance in these four accessions. The phylogenetic tree showed that the kinship of Pm4 was close to Pm24 and Sr62. This study not only provides reference information and valuable germplasm resources for breeding new wheat varieties with disease resistance but also lays a foundation for enriching the genetic basis of wheat resistance to powdery mildew

    Chinese Antarctic Magnetometer Chain at the Cusp Latitude

    Get PDF
    A Chinese Antarctic Magnetometer (CAM) chain from Zhongshan Station (ZHS) to Dome-A (DMA) has been established since February 2009. A regular magnetometer is operated at ZHS, and four low power magnetometers are operated along the interior route from ZHS to DMA in the cusp latitude, extending over a distance of 1260 km. These stations fill an important void in the Antarctic magnetometer network. Furthermore, the CAM chain is magnetically conjugated with the Arctic region reaching from the Svalbard archipelago to Daneborg, on the east coast of Greenland. Conjugate measurements using the Arctic and Antarctic magnetometers provide excellent opportunities to investigate phenomena related to the coupling of the solar wind to the magnetosphere and ionosphere, such as magnetic impulse events, flux transfer events, traveling convection vortices and ultra-low frequency waves

    The Ark: Sanctuary for ISFs

    Get PDF
    The thesis is aiming to decrease the potential flooding damages to the ISFs, and a balance between the limited resources and high population density to relieve the flooding issues through the renovation and reconstruction of local churches in the Baseco Compound. Flooding has become one of the most common and devastated issues in developing countries, and it is worsen by the housing crisis that lead to a formation of informal settlements with shoddy constructions and destabilized communities. What is an applicable approach to decrease the potential flooding damage to the ISFs? How to find a balance between the limited resources and high population density to relief the flooding issues architecturally
    corecore