128 research outputs found

    TriECCC: Trilingual Corpus of the Extraordinary Chambers in the Courts of Cambodia for Speech Recognition and Translation Studies

    Get PDF
    This paper presents an extended work on the trilingual spoken language translation corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC), namely TriECCC. TriECCC is a simultaneously spoken language translation corpus with parallel resources of speech and text in three languages: Khmer, English, and French. This corpus has approximately [Formula: see text] thousand utterances, approximately [Formula: see text], [Formula: see text], and [Formula: see text] h in length of speech, and [Formula: see text], [Formula: see text] and [Formula: see text] million words in text, in Khmer, English, and French, respectively. We first report the baseline results of machine translation (MT), and speech translation (ST) systems, which show reasonable performance. We then investigate the use of the ROVER method to combine multiple MT outputs and fine-tune the pre-trained English–French MT models to enhance the Khmer MT systems. Experimental results show that the ROVER is effective for combining English-to-Khmer and French-to-Khmer systems. Fine-tuning from both single and multiple parents shows the effective improvement on the BLEU scores for Khmer-to-English/French and English/French-to-Khmer MT systems

    MYCL promotes iPSC-like colony formation via MYC Box 0 and 2 domains

    Get PDF
    iPS細胞作製過程における初期化因子MYCLのタンパク質ドメインの機能解析. 京都大学プレスリリース. 2021-12-20.Protein domain structures affect the quality of stem cells. 京都大学プレスリリース. 2021-12-20.Human induced pluripotent stem cells (hiPSCs) can differentiate into cells of the three germ layers and are promising cell sources for regenerative medicine therapies. However, current protocols generate hiPSCs with low efficiency, and the generated iPSCs have variable differentiation capacity among different clones. Our previous study reported that MYC proteins (c-MYC and MYCL) are essential for reprogramming and germline transmission but that MYCL can generate hiPSC colonies more efficiently than c-MYC. The molecular underpinnings for the different reprogramming efficiencies between c-MYC and MYCL, however, are unknown. In this study, we found that MYC Box 0 (MB0) and MB2, two functional domains conserved in the MYC protein family, contribute to the phenotypic differences and promote hiPSC generation in MYCL-induced reprogramming. Proteome analyses suggested that in MYCL-induced reprogramming, cell adhesion-related cytoskeletal proteins are regulated by the MB0 domain, while the MB2 domain regulates RNA processes. These findings provide a molecular explanation for why MYCL has higher reprogramming efficiency than c-MYC

    Structural Analysis of Instruction Utterances

    Full text link
    Abstract. In realizing video retrieval system, the crucial point is how to provide an effective access method of video contents. This paper fo-cuses on Japanese cooking instruction utterances and describes a method of analyzing structure of them, which leads to a summary of video. We detect a hierarchical structure of video contents by using linguistic and visual information. We found that the integration of visual information can improve the detection of task units better than using linguistic in-formation alone.
    corecore