1,051 research outputs found

    Improving accuracy of Part-of-Speech (POS) tagging using hidden markov model and morphological analysis for Myanmar Language

    Get PDF
    In Natural Language Processing (NLP), Word segmentation and Part-of-Speech (POS) tagging are fundamental tasks. The POS information is also necessary in NLP’s preprocessing work applications such as machine translation (MT), information retrieval (IR), etc. Currently, there are many research efforts in word segmentation and POS tagging developed separately with different methods to get high performance and accuracy. For Myanmar Language, there are also separate word segmentors and POS taggers based on statistical approaches such as Neural Network (NN) and Hidden Markov Models (HMMs). But, as the Myanmar language's complex morphological structure, the OOV problem still exists. To keep away from error and improve segmentation by utilizing POS data, segmentation and labeling should be possible at the same time.The main goal of developing POS tagger for any Language is to improve accuracy of tagging and remove ambiguity in sentences due to language structure. This paper focuses on developing word segmentation and Part-of- Speech (POS) Tagger for Myanmar Language. This paper presented the comparison of separate word segmentation and POS tagging with joint word segmentation and POS tagging

    Statistical Machine Translation between Myanmar Sign Language and Myanmar Written Text

    Get PDF
    This paper contributes the first evaluation of the quality of automatic translation between Myanmar sign language (MSL) and Myanmar written text, in both directions. Our developing MSL-Myanmar parallel corpus was used for translations and the experiments were carried out using three different statistical machine translation (SMT) approaches: phrase-based, hierarchical phrase-based, and the operation sequence model. In addition, three different segmentation schemes were studies, these were syllable segmentation, word segmentation and sign unit based word segmentation. The results show that the highest quality machine translation was attained with syllable segmentations for both MSL and Myanmar written text

    Harvesting Paragraph-Level Question-Answer Pairs from Wikipedia

    Full text link
    We study the task of generating from Wikipedia articles question-answer pairs that cover content beyond a single sentence. We propose a neural network approach that incorporates coreference knowledge via a novel gating mechanism. Compared to models that only take into account sentence-level information (Heilman and Smith, 2010; Du et al., 2017; Zhou et al., 2017), we find that the linguistic knowledge introduced by the coreference representation aids question generation significantly, producing models that outperform the current state-of-the-art. We apply our system (composed of an answer span extraction system and the passage-level QG system) to the 10,000 top-ranking Wikipedia articles and create a corpus of over one million question-answer pairs. We also provide a qualitative analysis for this large-scale generated corpus from Wikipedia.Comment: Accepted to ACL 2018 (long paper

    Recent Trends in Computational Intelligence

    Get PDF
    Traditional models struggle to cope with complexity, noise, and the existence of a changing environment, while Computational Intelligence (CI) offers solutions to complicated problems as well as reverse problems. The main feature of CI is adaptability, spanning the fields of machine learning and computational neuroscience. CI also comprises biologically-inspired technologies such as the intellect of swarm as part of evolutionary computation and encompassing wider areas such as image processing, data collection, and natural language processing. This book aims to discuss the usage of CI for optimal solving of various applications proving its wide reach and relevance. Bounding of optimization methods and data mining strategies make a strong and reliable prediction tool for handling real-life applications

    Preliminary Gibbon Status Review for Lao PDR 2008

    Get PDF
    Provides a site directory for gibbon habitats and populations in Laos. Examines patterns of threats and conservation and documents the national conservation status and outlook for each species. Makes recommendations for urgent interventions

    A Study of The Flouting and Hedging Maxims on Politics Articles Found in The Jakarta Post

    Get PDF
    ABSTRACT Communication needs cooperation between the speaker and the hearer. They share the same assumption and expectation about the topic and how to develop their communication. Cooperative Principle and its maxims studied in Discourse Analysis are often referred to as they provide a lucid description of how listeners and readers can distill information from an utterance even though that information has not been mentioned outright. Then, it can be concluded that conversation or communication can go on smoothly if the Cooperative Principle is used. However, not all communications, either verbal or non-verbal, use four maxims. It disobeys of Grice's maxims either one maxim or more. Related to those phenomena, the research problems are explored, namely “What are the maxims flouted and hedged by the writer of politics articles found in The Jakarta Post?" and "When the maxims are flouted and hedged by the writer of politics articles found in The Jakarta Post?". This study uses a descriptive qualitative design since the data were descriptively analyzed based on the Grice’s theory of Cooperative Principle, especially how the maxims are flouted and hedged by the writer of politic articles in The Jakarta Post. The finding of the study reveals that there are many kinds of flouting and hedging maxims found in the politic article of The Jakarta Post. The researcher finds that the highest number of the flouting maxims is overstatement, followed by metaphor, rhetorical question, and irony. Moreover for the hedging maxim, it has the higher number than rhetorical question and irony. From the study it is proved that the flouting and the hedging maxims of the Cooperative Principles as theorized by Grice do occur not only in conversational utterances but also in sentences in written discourse when the writers are delivering and maintaining their opinions. From those findings, it is hoped that this study will be useful for the journalists to consider or comprehend the maxim particularly the flouting and hedging maxims when they write the article, especially on politic article. So, what they intend to convey will be easily understood by the readers

    Spatiotemporal patterns of earthquakes and their implications for earthquake hazards

    Get PDF
    [EMBARGOED UNTIL 5/31/2023] This work focuses on characterizing spatiotemporal patterns of earthquakes, their possible causes, and their implications for seismic hazard assessment. I studied both local and global earthquakes in the view of complex fault systems. Specifically, I studied the background seismicity and long-lived aftershock activities in intraplate North China and the Central and Eastern United State (CEUS), and characterized the correlation between strain rate and seismicity and evaluated the prediction power of strain rate in different tectonic settings. I found that periodic or quasiperiodic earthquake recurrence on individual faults, as predicted by the elastic rebound model, is not common in nature. Instead, most earthquake sequences are complex and variable, and often show clusters of events separated by long but irregular intervals of quiescence. The common earthquake clustering may be caused by earthquake-induced viscoelastic relaxation and fault interaction. Most earthquake sequences are burstier than the Poisson model, implying a higher probability of repeating events soon after a large earthquake. Possible long-lived aftershocks are found in intraplate North China and the CEUS. Background seismicity in intraplate regions may vary with time, highlighting the complexity of intraplate seismicity. Mistakenly identifying long-lived aftershocks as background earthquakes may overestimate seismic hazard in intraplate regions. The correlation between strain rate and seismicity varies between different tectonic settings and is time-dependent. Good strain rate-seismicity correlations are found in plate boundary regions and during seismically active periods, while no correlations are found in stable continents and during inactive periods. All these variations need to be considered in hazard assessment.Includes bibliographical references

    Connected Developments: The Governance of Formal Global Knowledge Networks in Sustainability Transformations

    Get PDF
    Climate change adds pressure to the international community to work cooperatively, find ways to govern technologies and expert knowledge, develop better policies, and mobilise resources, tools, and practices to deal with potential consequences and impacts. The institutional drivers underpinning current knowledge applications in globally connected spaces of sustainable development practice are increasingly complex, intertwined, and empirically understudied. In this context, this PhD thesis aims to advance our empirical understanding of why and how formal cooperation networks form, negotiate, mobilise and utilise particular technologies and expert knowledge and attempt to steer visions and pathways for change. This research combines multi-sited ethnography with social network analysis and policy analysis and investigates formal contexts of global connection. This thesis examines practices of science and technology policy through technology-driven networks in multiple locations in Europe and Southeast Asia. In particular, this thesis analyses the processes and conditions through which tools (e.g. modelling technologies), practices (e.g. climate negotiations, technology transfer activities, risk management, and environmental planning), and ways of dealing with climate-related uncertainties are implemented in a global knowledge network articulated under the UN system. The participant observation that is applied in the research is grounded in mobile contexts of project-based interactions, intergovernmental negotiations, international expert meetings, high-level advisory boards, technology assessments, implementation of technology transfer programmes, capacity-building workshops, expert discussions on anticipation and uncertainty, and the production of reports, climate policies, and procurement systems. This thesis examines how the artefacts of transfer interact in the implementation of the Technology Mechanism under the UNFCCC, drawing on cases of climate and hydrological modelling ranging from the Climate Technology Centre and Network (CTCN) to Thailand and Myanmar. It maps and analyses the global response of networked organisations, with special attention to persistent North South power dynamics imposed by global environmental governance regimes and their emergent ‘transformational claims’. This thesis delves into a critical evaluation of transformational change narratives in institutionalised knowledge systems, practices of technology transfer, and science policy spaces inside the United Nations. It contributes to a better foundational understanding of knowledge governance relating to critical social and environmental challenges, and rethinks futures of collective climate action in light of sustainability transformations theory and practice
    • …
    corecore