11 research outputs found

    Construction of Grammar Based Term Extraction Model for Japanese

    Get PDF
    Colloque avec actes et comité de lecture. internationale.International audienceIn this paper we propose a grammar-based term extraction model for Japanese toward construction of multilingual term dictionary. The proposed approach evaluates termhood using morphological patterns of terms. Most of terms in Japanese consist of compound nouns or simple phrases, but detailed grammatical patterns for term extraction have not been constructed because of their complex compounding mechanisms. Applying detailed morphological investigation to Japanese compounding, we make sure their structures of word formation that can be comparable to those of Indo-European terms such as English and French

    Pattern Based Term Extraction Using ACABIT System

    Get PDF
    International audienceIn this paper, we propose a pattern-based term extraction approach for Japanese, applying ACABIT system originally developed for French. The proposed approach evaluates termhood using morphological patterns of basic terms and term variants. After extracting term candidates, ACABIT system filters out non-terms from the candidates based on log-likelihood. This approach is suitable for Japanese term extraction because most of Japanese terms are compound nouns or simple phrasal patterns

    Deverbal Compound Noun Analysis Based on Lexical Conceptual Structure

    No full text
    This paper proposes a principled approach for analysis of semantic relations between constituents in compound nouns based on lexical semantic structure. One of the difficulties of compound noun analysis is that the mechanisms governing the decision system of semantic relations and the representation method of semantic relations associated with lexical and contextual meaning are not obvious. The aim of our research is to clarify how lexical semantics contribute to the relations in compound nouns since such nouns are very productive and are supposed to be governed by systematic mechanisms. The results of applying our approach to the analysis of noun-deverbal compounds in Japanese and English show that lexical conceptual structure contributes to the restrictional rules in compounds

    Deverbal Compound Noun Analysis Based on Lexical Conceptual Structure

    No full text
    This paper proposes a principled approach for analysis of semantic relations between constituents in compound nouns based on lexical semantic structure. One of the difficulties of compound noun analysis is that the mechanisms governing the decision system of semantic relations and the representation method of semantic relations associated with lexical and contextual meaning are not obvious. The aim of our research is to clarify how lexical semantics contribute to the relations in compound nouns since such nouns are very productive and are supposed to be governed by systematic mechanisms. The results of applying our approach to the analysis of noun-deverbal compounds in Japanese and English show that lexical conceptual structure contributes to the restrictional rules in compounds.

    A Method for Extracting Composite Terms from Japanese Domain Corpora

    Get PDF
    テキストコーパスからの用語抽出は、自然言語処理技術の重要な応用である。従来テキストコーパスから用語候補を抽出する方法として、主として候補出現に関わる統計的指標を用いて用語性を判定する方法が採用されて来たが、統計的手法では出現頻度の低い候補についての判定が困難であった。今回の発表では、複合語に注目し、用語性を損なう形態素出現パターンを排除する形での用語候補抽出を行うことにより、高い精度で複合語用語抽出が可能となることを示す。Term extraction is one of the most important application of natural language processing technologies. Statistic criteria are widely adopted to evaluate the termhood of the extracted candidates. However, it is difficult to evaluate the termhood of less frequent candidates. In this study we propose a method for Japanese composite term extraction in which unproper morpheme patterns are eliminated. Using the new method, high precision of term extraction can be attained for Japanese composite terms

    Brains, not Brawn: The Use of ‘’Smart’’ Comparable Corpora in Bilingual Terminology Mining

    No full text
    International audienc

    Computational terminology and filtering of terminological information : Introduction to the special issue

    No full text
    corecore