4 research outputs found

    Using text mining to uncover hidden information in historical sources

    Full text link
    Π’ ΡΡ‚Π°Ρ‚ΡŒΠ΅ прСдставлСн ΠΎΠ±Π·ΠΎΡ€ Ρ‚ΠΎΠ³ΠΎ, ΠΊΠ°ΠΊ ΠΈΠ½Ρ‚Π΅Π»Π»Π΅ΠΊΡ‚ΡƒΠ°Π»ΡŒΠ½Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· тСкста ΠΈΡΠΏΠΎΠ»ΡŒΠ·ΡƒΠ΅Ρ‚ΡΡ для выявлСния скрытой ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΠΈ Π² историчСских тСкстах. Π’Π½ΠΈΠΌΠ°Π½ΠΈΠ΅ акцСнтируСтся Π½Π° ΠΌΠ΅Ρ‚ΠΎΠ΄Π΅ тСматичСского модСлирования ΠΈ модСлях эмбСддингов слов. Π‘Ρ‚Π°Ρ‚ΡŒΡ ΠΈΠ»Π»ΡŽΡΡ‚Ρ€ΠΈΡ€ΡƒΠ΅Ρ‚, ΠΊΠ°ΠΊ эти ΠΌΠ΅Ρ‚ΠΎΠ΄Ρ‹ использовались Π² ΠΊΠΎΠ½ΠΊΡ€Π΅Ρ‚Π½Ρ‹Ρ… историчСских исслСдованиях. ДСлаСтся Π²Ρ‹Π²ΠΎΠ΄ ΠΎ Ρ‚ΠΎΠΌ, Ρ‡Ρ‚ΠΎ ΠΈΠ½Ρ‚Π΅Π»Π»Π΅ΠΊΡ‚ΡƒΠ°Π»ΡŒΠ½Ρ‹ΠΉ Π°Π½Π°Π»ΠΈΠ· тСкста являСтся ΠΏΠΎΠ»Π΅Π·Π½Ρ‹ΠΌ инструмСнтом для обнаруТСния скрытой ΠΈΠ½Ρ„ΠΎΡ€ΠΌΠ°Ρ†ΠΈΠΈ Π² историчСских тСкстах.The article presents an overview of how text mining can be employed to reveal hidden information in historical texts. The attention is focused on the method of thematic modeling and word embedding models. The article illustrates how these techniques have been utilized in historical research. It concludes that text mining is a useful tool for uncovering hidden information in historical

    A Corpus Approach to Roman Law Based on Justinian’s Digest

    Get PDF
    Traditional philological methods in Roman legal scholarship such as close reading and strict juristic reasoning have analysed law in extraordinary detail. Such methods, however, have paid less attention to the empirical characteristics of legal texts and occasionally projected an abstract framework onto the sources. The paper presents a series of computer-assisted methods to open new frontiers of inquiry. Using a Python coding environment, we have built a relational database of the Latin text of the Digest, a historical sourcebook of Roman law compiled under the order of Emperor Justinian in 533 CE. Subsequently, we investigated the structure of Roman law by automatically clustering the sections of the Digest according to their linguistic profile. Finally, we explored the characteristics of Roman legal language according to the principles and methods of computational distributional semantics. Our research has discovered an empirical structure of Roman law which arises from the sources themselves and complements the dominant scholarly assumption that Roman law rests on abstract structures. By building and comparing Latin word embeddings models, we were also able to detect a semantic split in words with general and legal sense. These investigations point to a practical focus in Roman law which is consistent with the view that ancient law schools were more interested in training lawyers for practice rather than in philosophical neatness.</jats:p

    Vir is to Moderatus as Mulier is to Intemperans. Lemma Embeddings for Latin

    No full text
    This paper presents a new set of lemma embeddings for the Latin language. Embeddings are trained on a manually annotated corpus of texts belonging to the Classical era: different models, architectures and dimensions are tested and evaluated using a novel benchmark for the synonym selection task. A qualitative evaluation is also performed on the embeddings of rare lemmas. In addition, we release vectors pre-trained on the β€œOpera Maiora” by Thomas Aquinas, thus providing a resource to analyze Latin in a diachronic perspective
    corecore