36 research outputs found
Paraphrastic Reformulations in Spoken Corpora
International audienceOur work addresses the automatic detection of paraphrastic reformulation in French spoken corpora. The proposed approach is syn-tagmatic. It is based on specific markers and the specificities of the spoken language. Manual multi-dimensional annotation performed by two annotators provides fine-grained reference data. An automatic method is proposed in order to decide whether sentences contain or not paraphras-tic relations. The obtained results show up to 66.4% precision. Analysis of the manual annotations indicates that few paraphrastic segments show morphological modifications (inflection, derivation or compounding) and that the syntactic equivalence between the segments is seldom respected, as these usually belong to different syntactic categories
Recommended from our members
Evidence for insertional codemixing: mixed compounds and French nominal groups in Brussels Dutch
In this paper we analyse mixed compounds, such as legume+winkel âvegetable shop, greengroceryâ and winter+paletot âwinter coatâ which contain a French and a Dutch element, and French nominal groups, such as carte dâidentitĂ© âidentity cardâ, and journal parlĂ© âradio newsâ, which bilingual speakers from Brussels frequently insert into Brussels Dutch utterances. Using Muyskenâs (2000) typology of bilingual speech, we claim that the mixed compounds and the nominal groups display the characteristics of insertional code-mixing. In addition, some evidence for the existence of a continuum between borrowing and code-switching can be obtained from these examples. As the multimorphemic units that are inserted into Dutch are neither single words, nor full constituents, their status in the lexicon raises interesting issues for researchers interested in the interface between syntax and the lexicon (see also Backus 2003). We try to argue that nominal groups such as carte dâidentitĂ© and journal parlĂ© are probably best seen as lexical templates or constructional idioms (Booij, 2002b). The insertion of French constructional idioms in Brussels Dutch represents an innovation in the lexical patterns that are available to speakers of this language, which is highly relevant for theories of language change
Print awareness of adult illiterates: a comparison with young pre-readers and low-educated adult readers
In this study the print awareness of 25 unschooled adult illiterates in the Netherlands was compared with that of 24 pre-reading children and of 23 low-educated literate adults with approximately four years of primary schooling. The illiterates were interviewed about their experiences with writing and all participants completed six assessments of print awareness in the language they preferred (first or second language). The outcomes revealed that the three groups did not differ in distinguishing conventional written signs from other visual signs, that both groups of non-readers differed significantly from low educated readers but not from each other in knowledge of logos, inscriptions and knowledge of the written register, while the adult illiterates performed significantly better than the children on grapheme knowledge. Adult illiterates in literate societies seem to be well informed about the uses and functions of written language and about what writing looks like, but like young children they are not good at reading environmental print out of context and in explaining what exactly is represented in writing. The variation in reactions within the group of illiterate adults could be related to existing models of emergent literacy. Implications for adult literacy education are discussed