Search CORE

29 research outputs found

Compiling Linguistic Constraints into Finite State Automata

Author: B. Courtois
D. Maurel
E. Roche
E. Roche
Karttunen
M. Constant
M. Gross
M.D. Silberztein
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2006
Field of study

International audienceThis paper deals with linguistic constraints encoded in the form of (binary) tables, generally called lexicon-grammar tables. We describe a unified method to compile sets of tables of linguistic constraints into Finite State Automata. This method has been practically implemented in the linguistic platform Unitex

Crossref

HAL Université de Tours

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Finite-State Technology as a Programming Environment

Author: C.D. Johnson
G. Noord van
J. Daciuk
J.W. Amtrup
K.R. Beesley
M. Holzer
M. Mohri
M. Mohri
M. Mohri
M. Silberztein
R.C. Carrasco
R.M. Kaplan
Y. Cohen-Sygal
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Detecting Latin-Based Medical Terminology in Croatian Texts

Author: A Gjuran-Coha
B Schneier
C Herrero-Zorita
D Proux
GL Smith
H Liu
JML Piñero
L Norton
M Pacak
M Silberztein
MG Pacak
MP Buono di
P Dujols
P Simon
S Wolff
T Davenport
T Liang
U Hahn
Ž Poljak
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

No matter what the main language of texts in the medical domain is, there is always an evidence of the usage of Latin-derived words and formative elements in terminology development. Generally speaking, this usage presents language-specific morpho-semantic behaviors in forming both technical-scientific and common-usage words. Nevertheless, this usage of Latin in Croatian medical texts does not seem consistent due to the fact that diferent mechanisms of word formation may be applied to the same term. In our pursuit to map all the diferent occurrences of the same concept to only one, we propose a model designed within NooJ and based on dictionaries and morphological grammars. Starting from the manual detection of nouns and their variations, we recognize some word formation mechanisms and develop grammars suitable to recognize Latinisms and Croatinized Latin medical terminology

Repozitorij Filozofskog fakulteta u Zagrebu' at University of Zagreb

Crossref

Università degli Studi di Napoli L'Orientale: CINECA IRIS

Archivio della Ricerca - Università di Salerno

Digitalni arhiv Filozofskog fakulteta u Zagrebu

On Heads and Coordination in Valence Acquisition

Author: A. Böhmová
A. Przepiórkowski
A. Wright
C. Pollard
C. Pollard
C.J. Fillmore
D. Janus
I.A. Mel’čuk
I.A. Sag
J. Beavers
L. Bloomfield
L. Tesnière
M. Silberztein
P. Sgall
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Abstract. The aim of this paper is to present the design of a partial syntactic annotation of the IPI PAN Corpus of Polish [22] and the cor-responding extension of the corpus search engine Poliqarp [25,12] devel-oped at the Institue of Computer Science PAS and currently employed in Polish and Portuguese corpora projects. In particular, we will argue for the need to distinguish between, and represent both, syntactic and se-mantic heads, and we will sketch the representation of coordination, the area traditionally controversial both in theoretical and in computational linguistics. The annotation is designed in a way intended to maximise the usefulness of the resulting corpus for the task of automatic valence acquisition

CiteSeerX

Crossref