8 research outputs found

    Using Conjunctions and Adverbs for Author Verification

    Get PDF
    Abstract: Linguistics and stylistics have been investigated for author identification for quite a while, but recently, we have testified a impressive growth in the volume with which lawyers and courts have called upon the expertise of linguists in cases of disputed authorship. This motivates computer science researchers to look to the problem of author identification from a different perspective. In this work, we propose a stylometric feature set based on conjunctions and adverbs of the Portuguese language to address the problem of author identification. Two different approaches of classification were considered. The first one is called writer-independent and it reduces the pattern recognition problem to a single model and two classes, hence, makes it possible to build robust system even when few genuine samples per writer are available. The second one is called the personal model, or writer-dependent, which very often performs better but needs a bigger number of samples per writer. Experiments on a database composed of short articles from 30 different authors and Support Vector Machine (SVM) as classifier demonstrate that the proposed strategy can produced results comparable to the literature

    Using Conjunctions and Adverbs for Author Verification

    No full text
    Linguistics and stylistics have been investigated for author identification for quite awhile, but recently, we have testified a impressive growth in the volume with which lawyers and courts have called upon the expertise of linguists in cases of disputed authorship. This motivatescomputer science researchers to look to the problem of author identification from a different perspective. In this work, we propose a stylometric feature set based on conjunctions and ad-verbs of the Portuguese language to address the problem of author identification. Two different approaches of classification were considered. The first one is called writer-independent and it re-duces the pattern recognition problem to a single model and two classes, hence, makes it possible to build robust system even when few genuine samples per writer are available. The second oneis called the personal model, or writer-dependent, which very often performs better but needs a bigger number of samples per writer. Experiments on a database composed of short articlesfrom 30 different authors and Support Vector Machine (SVM) as classifier demonstrate that the proposed strategy can produced results comparable to the literature

    Charge Trapping at the Step Edges of TiO2 Anatase (101)

    No full text
    A combination of photoemission, atomic force, and scanning tunneling microscopy/spectroscopy measurements shows that excess electrons in the TiO2 anatase (101) surface are trapped at step edges. Consequently, steps act as preferred adsorption sites for O-2. In density functional theory calculations electrons localize at clean step edges, this tendency is enhanced by O vacancies and hydroxylation. The results show the importance of defects for the wide-ranging applications of titania
    corecore