67 research outputs found
Onsetsu hyoki no kyotsusei ni motozuita Ajia moji nyuryoku intafesu ni kansuru kenkyu
制度:新 ; 報告番号:甲3450号 ; 学位の種類:博士(国際情報通信学) ; 授与年月日:2011/10/26 ; 早大学位記番号:新577
UCSY-SC1: A Myanmar speech corpus for automatic speech recognition
This paper introduces a speech corpus which is developed for Myanmar Automatic Speech Recognition (ASR) research. Automatic Speech Recognition (ASR) research has been conducted by the researchers around the world to improve their language technologies. Speech corpora are important in developing the ASR and the creation of the corpora is necessary especially for low-resourced languages. Myanmar language can be regarded as a low-resourced language because of lack of pre-created resources for speech processing research. In this work, a speech corpus named UCSY-SC1 (University of Computer Studies Yangon - Speech Corpus1) is created for Myanmar ASR research. The corpus consists of two types of domain: news and daily conversations. The total size of the speech corpus is over 42 hrs. There are 25 hrs of web news and 17 hrs of conversational recorded data.The corpus was collected from 177 females and 84 males for the news data and 42 females and 4 males for conversational domain. This corpus was used as training data for developing Myanmar ASR. Three different types of acoustic models such as Gaussian Mixture Model (GMM) - Hidden Markov Model (HMM), Deep Neural Network (DNN), and Convolutional Neural Network (CNN) models were built and compared their results. Experiments were conducted on different data sizes and evaluation is done by two test sets: TestSet1, web news and TestSet2, recorded conversational data. It showed that the performance of Myanmar ASRs using this corpus gave satisfiable results on both test sets. The Myanmar ASR using this corpus leading to word error rates of 15.61% on TestSet1 and 24.43% on TestSet2
Empirical Dependency-Based Head Finalization for Statistical Chinese-, English-, and French-to-Myanmar (Burmese) Machine Translation
Abstract We conduct dependency-based head finalization for statistical machine translation (SMT) for Myanmar (Burmese). Although Myanmar is an understudied language, linguistically it is a head-final language with similar syntax to Japanese and Korean. So, applying the efficient techniques of Japanese and Korean processing to Myanmar is a natural idea. Our approach is a combination of two approaches. The first is a head-driven phrase structure grammar (HPSG) based head finalization for English-to-Japanese translation, the second is dependency-based pre-ordering originally designed for English-to-Korean translation. We experiment on Chinese-, English-, and French-to-Myanmar translation, using a statistical pre-ordering approach as a comparison method. Experimental results show the dependency-based head finalization was able to consistently improve a baseline SMT system, for different source languages and different segmentation schemes for the Myanmar language
Aging-Affected MSC Functions and Severity of Periodontal Tissue Destruction in a Ligature-Induced Mouse Periodontitis Model
Mesenchymal stem cells (MSCs) are known to play important roles in the repair of lost or damaged tissues and immunotolerance. On the other hand, aging is known to impair MSC function. However, little is currently known about how aged MSCs affect the host response to the local inflammatory condition and tissue deterioration in periodontitis, which is a progressive destructive disease of the periodontal tissue potentially leading to multiple tooth loss. In this study, we examined the relationship between aging-induced impairment of MSC function and the severity of periodontal tissue destruction associated with the decrease in host immunomodulatory response using a ligature-induced periodontitis model in young and aged mice. The results of micro computerized tomography (micro-CT) and histological analysis revealed a more severe bone loss associated with increased osteoclast activity in aged (50-week-old) mice compared to young (5-week-old) mice. Immunostaining analysis revealed that, in aged mice, the accumulation of inflammatory T and B cells was higher, whereas the percentage of platelet-derived growth factor receptor alpha (PDGFR alpha)(+) MSCs, which are known to modulate the apoptosis of T cells, was significantly lower than in young mice. In vitro analysis of MSC function showed that the expression of surface antigen markers for MSCs (Sca-1, CD90, CD146), colony formation, migration, and osteogenic differentiation of aged MSCs were significantly declined compared to those of young MSCs. Moreover, a significantly higher proportion of aged MSCs were positive for the senescence-associated beta galactosidase activity. Importantly, aged MSCs presented a decreased expression of FAS-L, which was associated with a lower immunomodulatory property of aged MSCs to induce T cell apoptosis in co-cultures compared with young MSCs. In summary, this is the first study showing that aging-induced impairment of MSC function, including immunomodulatory response, is potentially correlated with progressive periodontal tissue deterioration
Statistical Machine Translation between Myanmar Sign Language and Myanmar Written Text
This paper contributes the first evaluation of the quality of automatic translation between Myanmar sign language (MSL) and Myanmar written text, in both directions. Our developing MSL-Myanmar parallel corpus was used for translations and the experiments were carried out using three different statistical machine translation (SMT) approaches: phrase-based, hierarchical phrase-based, and the operation sequence model. In addition, three different segmentation schemes were studies, these were syllable segmentation, word segmentation and sign unit based word segmentation. The results show that the highest quality machine translation was attained with syllable segmentations for both MSL and Myanmar written text
Development of Natural Language Processing based Communication and Educational Assisted Systems for the People with Hearing Disability in Myanmar
Information and communication technologies (ICTs) provide people with disabilities to better integrate socially and economically into their communities by supporting access to information and knowledge, learning and teaching situations, personal communication and interaction. Our research purpose is to develop systems that will provide communication and educational assistance to persons with hearing disability using Natural Language Processing (NLP). In this paper, we present corpus building for Myanmar sign language (MSL), Machine Translation (MT) between MSL, Myanmar written text (MWT) and Myanmar SignWriting (MSW) and two Fingerspelling keyboard layouts for Myanmar SignWriting. We believe that the outcome of this research is useful for educational contents and communication between hearing disability and general people
Distribution of <i>salmonella</i> serovars in humans, foods, farm animals and environment, companion and wildlife animals in Singapore
We analyzed the epidemiological distribution of Salmonella serovars in humans, foods, animals and the environment as a One-Health step towards identifying risk factors for human salmonellosis. Throughout the 2012–2016 period, Salmonella ser. Enteritidis was consistently the predominating serovar attributing to >20.0% of isolates in humans. Other most common serovars in humans include Salmonella ser. Stanley, Salmonella ser. Weltevreden, Salmonella ser. Typhimurium and Salmonella ser. 4,5,12:b:-(dT+). S. Enteritidis was also the most frequent serovar found among the isolates from chicken/chicken products (28.5%) and eggs/egg products (61.5%) during the same period. In contrast, S. Typhimurium (35.2%) and Salmonella ser. Derby (18.8%) were prevalent in pork/pork products. S. Weltevreden was more frequent in seafood (19.2%) than others (≤3.0%). Most isolates (>80.0%) from farms, companion and wildlife animals belonged to serovars other than S. Enteritidis or S. Typhimurium. Findings demonstrate the significance of a One-Health investigative approach to understand the epidemiology Salmonella for more effective and integrated surveillance systems
Pf7: an open dataset of Plasmodium falciparum genome variation in 20,000 worldwide samples
We describe the MalariaGEN Pf7 data resource, the seventh release of Plasmodium falciparum genome variation data from the MalariaGEN network. It comprises over 20,000 samples from 82 partner studies in 33 countries, including several malaria endemic regions that were previously underrepresented. For the first time we include dried blood spot samples that were sequenced after selective whole genome amplification, necessitating new methods to genotype copy number variations. We identify a large number of newly emerging crt mutations in parts of Southeast Asia, and show examples of heterogeneities in patterns of drug resistance within Africa and within the Indian subcontinent. We describe the profile of variations in the C-terminal of the csp gene and relate this to the sequence used in the RTS,S and R21 malaria vaccines. Pf7 provides high-quality data on genotype calls for 6 million SNPs and short indels, analysis of large deletions that cause failure of rapid diagnostic tests, and systematic characterisation of six major drug resistance loci, all of which can be freely downloaded from the MalariaGEN website
- …