59 research outputs found

    gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window

    Full text link
    Following the success in language domain, the self-attention mechanism (transformer) is adopted in the vision domain and achieving great success recently. Additionally, as another stream, multi-layer perceptron (MLP) is also explored in the vision domain. These architectures, other than traditional CNNs, have been attracting attention recently, and many methods have been proposed. As one that combines parameter efficiency and performance with locality and hierarchy in image recognition, we propose gSwin, which merges the two streams; Swin Transformer and (multi-head) gMLP. We showed that our gSwin can achieve better accuracy on three vision tasks, image classification, object detection and semantic segmentation, than Swin Transformer, with smaller model size.Comment: 5 pages, 7 figures, IEEE ICASSP 202

    Accent Estimation of Japanese Words from Their Surfaces and Romanizations for Building Large Vocabulary Accent Dictionaries

    Full text link
    In Japanese text-to-speech (TTS), it is necessary to add accent information to the input sentence. However, there are a limited number of publicly available accent dictionaries, and those dictionaries e.g. UniDic, do not contain many compound words, proper nouns, etc., which are required in a practical TTS system. In order to build a large scale accent dictionary that contains those words, the authors developed an accent estimation technique that predicts the accent of a word from its limited information, namely the surface (e.g. kanji) and the yomi (simplified phonetic information). It is experimentally shown that the technique can estimate accents with high accuracies, especially for some categories of words. The authors applied this technique to an existing large vocabulary Japanese dictionary NEologd, and obtained a large vocabulary Japanese accent dictionary. Many cases have been observed in which the use of this dictionary yields more appropriate phonetic information than UniDic.Comment: 7 pages, 2 figures. IEEE ICASSP 202

    調波音打楽器音分離による歌声のスペクトルゆらぎに基づく音楽信号処理の研究

    Get PDF
    学位の種別:課程博士University of Tokyo(東京大学

    Multichannel harmonic and percussive component separation by joint modeling of spatial and spectral continuity

    Get PDF
    International audienceThis paper considers the blind separation of the harmonic and percussive components of multichannel music signals. We model the contribution of each source to all mixture channels in the time-frequency domain via a spatial covariance matrix, which encodes its spatial characteristics, and a scalar spectral variance, which represents its spectral structure. We then exploit the spatial continuity and the different spectral continuity structures of harmonic and percussive components as prior information to derive maximum a posteriori (MAP) estimates of the parameters using the expectation-maximization (EM) algorithm. Experimental results over professional musical mixtures show the effectiveness of the proposed approach

    Observation of morphological changes of female osmiophilic bodies prior to Plasmodium gametocyte egress from erythrocytes

    Get PDF
    Plasmodium parasites cause malaria in mammalian hosts and are transmitted by Anopheles mosquitoes. Gametocytes, which differentiate from asexual-stage parasites, are activated by environmental changes when ingested into the mosquito midgut, and are rapidly released from erythrocytes prior to fertilization. Secretory proteins localized to osmiophilic bodies (OBs), organelles unique to gametocytes, have been reported to be involved in female gametocyte egress. In this study, we investigate the dynamics of OBs in activated gametocytes of Plasmodium falciparum and Plasmodium yoelii using the female OB-specific marker protein, G377. After activation, female gametocyte OBs migrate to the parasite surface and fuse to form large vesicles beneath the parasite plasma membrane. At the marginal region of female gametocytes, fused vesicles secrete contents by exocytosis into the parasitophorous vacuole space, prior to parasite egress via the break-down of the erythrocyte membrane. This is the first detailed description of how proteins are transported through osmiophilic bodies
    corecore