Multiword Expressions We Live by:A Validated Usage-based Dataset from Corpora of Written Italian

Abstract

none5siThe paper describes the creation of a manually validated dataset of Italian multiword expressions, building on candidates automatically extracted from corpora of written Italian. The main features of the resource, such as POS-pattern and lemma distribution, are also discussed, together with possible applications.openFrancesca Masini, M. Silvia Micheli, Andrea Zaninello, Sara Castagnoli, Malvina NissimFrancesca Masini, M. Silvia Micheli, Andrea Zaninello, Sara Castagnoli, Malvina Nissi

    Similar works

    Available Versions