7 research outputs found

    Language comparison via network topology

    Full text link
    Modeling relations between languages can offer understanding of language characteristics and uncover similarities and differences between languages. Automated methods applied to large textual corpora can be seen as opportunities for novel statistical studies of language development over time, as well as for improving cross-lingual natural language processing techniques. In this work, we first propose how to represent textual data as a directed, weighted network by the text2net algorithm. We next explore how various fast, network-topological metrics, such as network community structure, can be used for cross-lingual comparisons. In our experiments, we employ eight different network topology metrics, and empirically showcase on a parallel corpus, how the methods can be used for modeling the relations between nine selected languages. We demonstrate that the proposed method scales to large corpora consisting of hundreds of thousands of aligned sentences on an of-the-shelf laptop. We observe that on the one hand properties such as communities, capture some of the known differences between the languages, while others can be seen as novel opportunities for linguistic studies

    RaKUn: Rank-based Keyword extraction via Unsupervised learning and Meta vertex aggregation

    Full text link
    Keyword extraction is used for summarizing the content of a document and supports efficient document retrieval, and is as such an indispensable part of modern text-based systems. We explore how load centrality, a graph-theoretic measure applied to graphs derived from a given text can be used to efficiently identify and rank keywords. Introducing meta vertices (aggregates of existing vertices) and systematic redundancy filters, the proposed method performs on par with state-of-the-art for the keyword extraction task on 14 diverse datasets. The proposed method is unsupervised, interpretable and can also be used for document visualization.Comment: The final authenticated publication is available online at https://doi.org/10.1007/978-3-030-31372-2_2

    Specific Binding of the Pathogenic Prion Isoform: Development and Characterization of a Humanized Single-Chain Variable Antibody Fragment

    Get PDF
    Murine monoclonal antibody V5B2 which specifically recognizes the pathogenic form of the prion protein represents a potentially valuable tool in diagnostics or therapy of prion diseases. As murine antibodies elicit immune response in human, only modified forms can be used for therapeutic applications. We humanized a single-chain V5B2 antibody using variable domain resurfacing approach guided by computer modelling. Design based on sequence alignments and computer modelling resulted in a humanized version bearing 13 mutations compared to initial murine scFv. The humanized scFv was expressed in a dedicated bacterial system and purified by metal-affinity chromatography. Unaltered binding affinity to the original antigen was demonstrated by ELISA and maintained binding specificity was proved by Western blotting and immunohistochemistry. Since monoclonal antibodies against prion protein can antagonize prion propagation, humanized scFv specific for the pathogenic form of the prion protein might become a potential therapeutic reagent

    Mining Exceptional Social Behaviour

    Get PDF
    Essentially, our lives are made of social interactions. These can be recorded through personal gadgets as well as sensors adequately attached to people for research purposes. In particular, such sensors may record real time location of people. This location data can then be used to infer interactions, which may be translated into behavioural patterns. In this paper, we focus on the automatic discovery of exceptional social behaviour from spatio-temporal data. For that, we propose a method for Exceptional Behaviour Discovery (EBD). The proposed method combines Subgroup Discovery and Network Science techniques for finding social behaviour that deviates from the norm. In particular, it transforms movement and demographic data into attributed social interaction networks, and returns descriptive subgroups. We applied the proposed method on two real datasets containing location data from children playing in the school playground. Our results indicate that this is a valid approach which is able to obtain meaningful knowledge from the data.This work has been partially supported by the German Research Foundation (DFG) project “MODUS” (under grant AT 88/4-1). Furthermore, the research leading to these results has received funding (JG) from ESRC grant ES/N006577/1. This work was financed by the project Kids First, project number 68639
    corecore