Search CORE

920 research outputs found

Ex Machina: Electronic Resources for the Classics

Author: Juhl Beth
Publication venue: ScholarWorks@UARK
Publication date: 01/01/1995
Field of study

ScholarWorks@UARK

UARK (University of Arkansas )

A Legal Perspective on Training Models for Natural Language Processing

Author: Dore Giulia
Eckart de Castilho Richard
Gurevych Iryna
Labropoulou Penny
Margoni Thomas
Publication venue
Publication date: 01/01/2018
Field of study

A significant concern in processing natural language data is the often unclear legal status of the input and output data/resources. In this paper, we investigate this problem by discussing a typical activity in Natural Language Processing: the training of a machine learning model from an annotated corpus. We examine which legal rules apply at relevant steps and how they affect the legal status of the results, especially in terms of copyright and copyright-related rights

TUbiblio

Enlighten

Proceedings of the Workshop on Challenges in the Management of Large Corpora (CMLC-10)

Author
Publication venue: European Language Resources Association
Publication date: 20/06/2022
Field of study

ZORA

Citation Mining of Humanities Journals: The Progress to Date and the Challenges Ahead

Author: Colavizza G.
Romanello M.
Publication venue: 'Ghent University'
Publication date: 01/01/2019
Field of study

International Migration, Integration and Social Cohesion online publications

From manuscript catalogues to a handbook of Syriac literature: Modeling an infrastructure for Syriaca.org

Author: Gibson Nathan P.
Michelson David A.
Schwartz Daniel L.
Publication venue
Publication date: 03/03/2016
Field of study

Despite increasing interest in Syriac studies and growing digital availability of Syriac texts, there is currently no up-to-date infrastructure for discovering, identifying, classifying, and referencing works of Syriac literature. The standard reference work (Baumstark's Geschichte) is over ninety years old, and the perhaps 20,000 Syriac manuscripts extant worldwide can be accessed only through disparate catalogues and databases. The present article proposes a tentative data model for Syriaca.org's New Handbook of Syriac Literature, an open-access digital publication that will serve as both an authority file for Syriac works and a guide to accessing their manuscript representations, editions, and translations. The authors hope that by publishing a draft data model they can receive feedback and incorporate suggestions into the next stage of the project.Comment: Part of special issue: Computer-Aided Processing of Intertextuality in Ancient Languages. 15 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Episciences.org

Directory of Open Access Journals

Texas A&M Repository

The use of corpora and other electronic tools in historical research on translation

Author: Gómez Castro Cristina
Publication venue: Routledge / Taylor & Francis
Publication date: 06/02/2024
Field of study

[EN] Translation history and historiographical approaches to translation have traditionally relied on the knowledge provided by the historical context and both contextual and paratextual features of the translated texts together with their reception. Nonetheless, only by correlating historiographical insights with empirical evidence obtained from the translated texts will it be possible to produce a coherent and sound translation history. In this line of work, technology and digital humanities offer tools to the translation historian which that can complement non-computational methods and more traditional approaches to the sources and which that can be very beneficial if implemented correctly. This chapter advocates the use of tools such as corpora derived from linguistics to complement the research carried out from a historiographical point of view, while also indicating some of their possible drawbacks or limitations. In this increasingly technological world, the translation history researcher should be aware of both the opportunities and challenges provided by these tools and embrace their use with the aim of facilitating interdisciplinary avenues and progress in the field

Leon University (Spain)

Manual to the LMEMT corpus

Author: Ackerknecht
Adair
Aijmer
Albala
Alexander
Alexander
Alexander
Alexander
Alonso-Almeida
Altenberg
Andrew
Andrews
Anglicus
Anon.
Anon.
Anon.
Anon.
Anon.
Anthony
Anu
Arderne
Atkinson
Bacon
Bacon
Baer
Bailey
Baker
Banks
Banks
Barry
Barry
Bazerman
Bazin
Bergdolt
Berry
Biber
Biber
Biber
Biber
Bilgrami
Black
Blei
Brand
Brauschweig
Brooks
Brown
Buchan
Bynum
Cambridge History of Science
Carlson
Carol
Cawdrey
Cheyne
Cheyne
Cody
Conlin
Cornaro
Critser
Crombie
Culpeper
Culpeper
Culpeper
Culpeper
Cunningham
de Chauliac
de Mediolano
de Vigo
Digby
Digby
DiMeo
Durant
Durante
Durante
Early English Text Society
Early Science and Medicine: A Journal for the Study of Science Technology and Medicine in the Pre-modern Period 3/2
ECCO. Eighteenth Century Collections Online (ECCO)
Eden
Eggins
Eich
Eisenberg
EMEMT = Early Modern English Medical Texts. Corpus Description and Studies
Erasmus
ESTC = English Short Title Catalogue
Evans
Falconer
Farington
Ferrières
Firth
Fissell
Fissell
Fissell
Fitzmaurice
Forster
Fowler
Fox
Francia
French
Friend
Friend
Fries
Galenus
Gee
Goffman
Gotti
Gotti
Gotti
Gotti
Grafton
Graham
Grataroli
Grogan
Gross
Grumett
Guerrini
Harris
Harris
Harris
Haycock
Hiltunen
Hiltunen
Hiltunen
Hintikka
Historical Thesaurus of English
Hoffmann
Holbrook
Holmes
Irma
Jenner
Jenner
Joanna
Jones
Jones
Jucker
Jucker
Kilpatrick
King
Kirchner
Knoeff
Kohnen
Kohnen
Kronick
Kronick
Kytö
Labov
Lane
Lane
Lanfranc
Lawlor
Lefanu
Lehto
Leong
Leong
Levine
Library
Library
Library
Library
Library
Library
Library
Lind
Locke
Loudon
Lémery
Marco
Marttila
Mathieu
Maza
McClellan
McConchie
McEnery
Meyer
Mikkeli
Mikkeli
Moessner
Moessner
Moon
Moretti
Moskowich
Moskowich
Mulcaster
Mullet
Nagy
National Library of Wales
Nevalainen
Nutton
Ogden
Olson
Osborn
Pahta
Pahta
Pahta
Pahta
Pahta
Palmer
Pelling
Pennell
Pisanelli
Pomata
Porter
Porter
Porter
Porter
Porter
Porter
Porter
Power
Quirk
Risse
Rousseau
Rowland
Rusnock
Sahlgren
Schaffer
Schwab
Shapin
Sherman
Sinclair
Sinclair
Sinclair
Skaffari
Slack
Smith
Smith
Smith
Sprat
Staiano
Stebbings
Steyvers
Stine
Swann
Sydenham
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taavitsainen
Taylor
The medical register for the year 1783
Thomas
Thomson
Tieken-Boon van Ostade
Toolan
Tröhler
Turk
Turner
Tyrkkö
Tyrkkö
Tyrkkö
Tyrkkö
Tyrkkö
Tyrkkö
Tyrkkö
Van Helmont
Veatch
Wagner
Wales
Watts
Wear
Webster
Werlich
Wesley
Wesley
Wild
Wilkins
Williamson
Willis
Wilson
Wilson
Withey
Withey
Withington
Zelle
Zipf
Publication venue: John Benjamins
Publication date: 01/01/2019
Field of study

Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

Linguistics in the digital humanities: (computational) corpus linguistics

Author: Jensen Kim Ebensgaard
Publication venue: SMID - Society of Media Researchers In Denmark
Publication date: 01/12/2014
Field of study

Corpus linguistics has been closely intertwined with digital technology since the introduction of university computer mainframes in the 1960s. Making use of both digitized data in the form of the language corpus and computational methods of analysis involving concordancers and statistics software, corpus linguistics arguably has a place in the digital humanities. Still, it remains obscure and fi gures only sporadically in the literature on the digital humanities. Th is article provides an overview of the main principles of corpus linguistics and the role of computer technology in relation to data and method and also off ers a bird's-eye view of the history of corpus linguistics with a focus on its intimate relationship with digital technology and how digital technology has impacted the very core of corpus linguistics and shaped the identity of the corpus linguist. Ultimately, the article is oriented towards an acknowledgment of corpus linguistics' alignment with the digital humanities

Directory of Open Access Journals

Copenhagen University Research Information System

Tidsskrift.dk (Det Kongelige Bibliotek)