374 research outputs found

    Paraphrase concept and typology. A linguistically based and computationally oriented approach

    Get PDF
    In this paper, we present a critical analysis of the state of the art in the definition and typologies of paraphrasing. This analysis shows that there exists no characterization of paraphrasing that is comprehensive, linguistically based and computationally tractable at the same time. The following sets out to define and delimit the concept on the basis of the propositional content. We present a general, inclusive and computationally oriented typology of the linguistic mechanisms that give rise to form variations between paraphrase pairs

    WRPA: A system for relational paraphrase acquisition from Wikipedia

    Get PDF
    In this paper we present WRPA, a system for Relational Paraphrase Acquisition from Wikipedia. WRPA extracts paraphrasing patterns that hold a particular relation between two entities taking advantage of Wikipedia structure. What is new in this system is that Wikipedia's exploitation goes beyond infoboxes, reaching itemized information embedded in Wikipedia pages. WRPA is language independent, assuming that there exists Wikipedia and shallow linguistic tools for that particular language, and also independent of the relation addressed

    Plagiarism meets paraphrasing: insights for the new generation in automatic plagiarism detection

    Get PDF
    Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little attention has been paid to its analysis in the framework of automatic plagiarism detection. Therefore, state-of-the-art plagiarism detectors find it difficult to detect cases of paraphrase plagiarism. In this article, we analyse the relationship between paraphrasing and plagiarism, paying special attention to which paraphrase phenomena underlie acts of plagiarism and which of them are detected by plagiarism detection systems. With this aim in mind, we created the P4P corpus, a new resource which uses a paraphrase typology to annotate a subset of the PAN-PC-10 corpus for automatic plagiarism detection. The results of the Second International Competition on Plagiarism Detection were analysed in the light of this annotation. The presented experiments show that (i) more complex paraphrase phenomena and a high density of paraphrase mechanisms make plagiarism detection more difficult, (ii) lexical substitutions are the paraphrase mechanisms used the most when plagiarising, and (iii) paraphrase mechanisms tend to shorten the plagiarized text. For the first time, the paraphrase mechanisms behind plagiarism have been analysed, providing critical insights for the improvement of automatic plagiarism detection systems

    CoCo, a web interface for corpora compilation

    Get PDF
    CoCo is a collaborative web interface for the compilation of linguistic resources. In this demo we are presenting one of its possible applications: paraphrase acquisition

    ClInt: A bilingual Spanish-Catalan spoken corpus of clinical interviews

    Get PDF
    In this paper we present ClInt (Clinical Interview), a bilingual Spanish-Catalan spoken corpus that contains 15 hours of clinical interviews. It consists of audio files aligned with multiple-level transcriptions comprising orthographic, phonetic and morphological information, as well as linguistic and extralinguistic encoding. This is a previously non-existent resource for these languages and it offers a wide-ranging exploitation potential in a broad variety of disciplines such as Linguistics, Natural Language Processing and related fields

    Inverse EEG source problems and approximation

    Get PDF
    International audienceWe consider the inverse EEG (ElectroEncephaloGraphy) problem that consists in recovering, from measurements on electrodes of the electric potential on the scalp, a distribution of pointwise dipolar current sources located in the brain and modeling e.g. the presence of epileptic foci

    Influences of the G2350A polymorphism in the ACE Gene on cardiac structure and function of ball game players

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Except for the I/D polymorphism in the angiotensin I-converting enzyme (ACE) gene, there were few reports about the relationship between other genetic polymorphisms in this gene and the changes in cardiac structure and function of athletes. Thus, we investigated whether the G2350A polymorphism in the <it>ACE </it>gene is associated with the changes in cardiac structure and function of ball game players. Total 85 healthy ball game players were recruited in this study, and they were composed of 35 controls and 50 ball game players, respectively. Cardiac structure and function were measured by 2-D echocardiography, and the G2350A polymorphism in the <it>ACE </it>gene analyzed by the SNaPshot method.</p> <p>Results</p> <p>There were significant differences in left ventricular mass index (LVmassI) value among each sporting discipline studied. Especially in the athletes of basketball disciplines, indicated the highest LVmassI value than those of other sporting disciplines studied (p < 0.05). However, there were no significant association between any echocardiographic data and the G2350A polymorphism in the <it>ACE </it>gene in the both controls and ball game players.</p> <p>Conclusions</p> <p>Our data suggests that the G2350A polymorphism in the <it>ACE </it>gene may not significantly contribute to the changes in cardiac structure and function of ball game players, although sporting disciplines of ball game players may influence the changes in LVmassI value of these athletes. Further studies using a larger sample size and other genetic markers in the <it>ACE </it>gene will be needed.</p

    Angiotensin converting enzyme gene polymorphism is associated with severity of coronary artery disease in men with high total cholesterol levels

    Get PDF
    This study examines whether renin-angiotensin-aldosterone system gene polymorphisms: ACE (encoding for angiotensin converting enzyme) c.2306-117_404 I/D, AGTR1 (encoding for angiotensin II type-1 receptor) c.1080*86A>C and CYP11B2 (encoding for aldosterone synthase) c.-344C>T are associated with the extension of coronary atherosclerosis in a group of 647 patients who underwent elective coronary angiography. The extension of CAD was evaluated using the Gensini score. The polymorphisms were determined by PCR and RFLP assays. The associations between genotypes and the extent of coronary atherosclerosis were tested by the Kruskal-Wallis test, followed by pairwise comparisons using Wilcoxon test. The population has been divided into groups defined by: sex, smoking habit, past myocardial infarction, BMI (>, ≤ 25), age (>, ≤ 55), diabetes mellitus, level of total cholesterol (>, ≤ 200 mg/dl), LDL cholesterol (>, ≤ 130 mg/dl), HDL cholesterol (>, ≤ 40 mg/dl), triglycerides (>, ≤ 150 mg/dl). Significant associations between the ACE c.2306-117_404 I/D polymorphism and the Gensini score in men with high total cholesterol levels (PKruskal-Wallis = 0.008; Padjusted = 0.009), high level of LDL cholesterol (PKruskal-Wallis = 0.016; Padjusted = 0.028) and low level of HDL cholesterol (PKruskal-Wallis = 0.04; Padjusted = 0.055) have been found. No association between the AGTR1 c.1080*86A>C and CYP11B2 c.-344C>T and the Gensini score has been found. These results suggest that men who carry ACE c.2306-117_404 DD genotype and have high total cholesterol, high LDL cholesterol and low HDL cholesterol levels may be predisposed to the development of more severe CAD
    corecore