Search CORE

20 research outputs found

Creating a Live, Public Short Message Service Corpus: The NUS SMS Corpus

Author: A. B. Bodomo
A. Deumert
C. Dürscheid
C. Thurlow
D. Crystal
D. Pietrini
F. Liu
I. Hutchby
K. -l. Zhou
M. D. Back
M. Žic Fuchs
Min-Yen Kan
P. G. Ipeirotis
R. Ling
R. Rettie
S. Herring
S. Sotillo
Tao Chen
W. Liu
Y. Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Short Message Service (SMS) messages are largely sent directly from one person to another from their mobile phones. They represent a means of personal communication that is an important communicative artifact in our current digital era. As most existing studies have used private access to SMS corpora, comparative studies using the same raw SMS data has not been possible up to now. We describe our efforts to collect a public SMS corpus to address this problem. We use a battery of methodologies to collect the corpus, paying particular attention to privacy issues to address contributors' concerns. Our live project collects new SMS message submissions, checks their quality and adds the valid messages, releasing the resultant corpus as XML and as SQL dumps, along with corpus statistics, every month. We opportunistically collect as much metadata about the messages and their sender as possible, so as to enable different types of analyses. To date, we have collected about 60,000 messages, focusing on English and Mandarin Chinese.Comment: It contains 31 pages, 6 figures, and 10 tables. It has been submitted to Language Resource and Evaluation Journa

arXiv.org e-Print Archive

CiteSeerX

Crossref

ScholarBank@NUS

Sprache im Fokus

Author: Dürscheid C
Ramers K.H
Schwarz M
Publication venue: Niemeyer
Publication date: 01/01/1997
Field of study

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Pädagogische Fachkräfte als Sprachvorbild in der Kindertagesstätte

Author: Beckerle C.
Dürscheid Ch.
Fried L.
Grimm H.
Hellrung U.
Jeuk S.
Kniffka G.
Motsch H.-J.
Vygotski L. S.
Publication venue: 'Hogrefe Publishing Group'
Publication date
Field of study

Crossref

Verbal morphosyntactic disambiguation through topological field recognition in German-language law texts

Author: A. Frank
A. Kathol
A. Voutilainen
A. Voutilainen
C. Dürscheid
K. Foth
M. Becker
M. Nussbaumer
M.P. Harper
S. Hansen-Schirra
S. Höfler
S. Höfler
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The morphosyntactic disambiguation of verbs is a crucial pre-processing step for the syntactic analysis of morphologically rich languages like German and domains with complex clause structures like law texts. This paper explores how much linguistically motivated rules can contribute to the task. It introduces an incremental system of verbal morphosyntactic disambiguation that exploits the concept of topological fields. The system presented is capable of reducing the rate of POS-tagging mistakes from 10.2% to 1.6%. The evaluation shows that this reduction is mostly gained through checking the compatibility of morphosyntactic features within the long-distance syntactic relationships of discontinuous verbal elements. Furthermore, the present study shows that in law texts, the average distance between the left and right bracket of clauses is relatively large (9.5 tokens), and that in this domain, a wide context window is therefore necessary for the morphosyntactic disambiguation of verbs

Crossref

ZORA

Sequential patterns in SMS and WhatsApp dialogues: Practices for coordinating actions and managing topics

Author: Beißwenger M
Dürscheid C
Goffman E
Günthner S
Herring S
Herring S
Imo W
Jucker AH
Katharina König
König K
König K
König K
Sacks H
Wyss EL
Örnberg Berglund T
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

The lexicography of German

Author: A Klosa
B Schaeder
C Dürscheid
G Stötzel
HE Wiegand
K Grubmüller
K Kemmer
M Mann
M Schlaefer
O Reichmann
O Reichmann
P Kühn
P Kühn
S Engelberg
S Ulsamer
SP Szlek
U Haß-Zumkehr
W Haubrichs
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/10/2017
Field of study

This chapter discusses the main dictionaries of the German language as it is spoken and written in Germany, and also German as it is spoken and written in Austria, Switzerland, the eastern fringes of Belgium, and South Tyrol. It also briefly describes Pennsylvania German. Corpora and other language resources used in German dictionary-making are also presented. Finally, there is a discussion of some current issues in German lexicography, as well as future prospects

Crossref

Publikationsserver des Instituts für Deutsche Sprache

The lexicography of German

Author: B Schaeder
C Dürscheid
D Herberg
D Steffens
G Stötzel
HE Wiegand
HE Wiegand
HE Wiegand
K Grubmüller
K Kemmer
M Mann
M Schlaefer
O Reichmann
O Reichmann
P Kühn
P Kühn
S Engelberg
S Ulsamer
U Haß-Zumkehr
W Haubrichs
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref