International audienceIn this paper, we present an ontology population approach for legal ontologies. We exploit Wikipedia as a source of manually annotated examples of legal entities. We align YAGO, a Wikipedia-based ontology, and LKIF, an ontology specifically designed for the legal domain. Through this alignment, we can effectively populate the LKIF ontology, with the aim to obtain examples to train a Named Entity Recognizer and Classifier to be used for finding and classifying entities in legal texts. Since examples of annotated data in the legal domain are very few, we apply a machine learning strategy called curriculum learning aimed to overcome problems of overfitting by learning increasingly more complex concepts. We compare the performance of this method to identify Named Entities with respect to batch learning as well as two other baselines. Results are satisfying and foster further research in this direction

Alemany, Laura,

Cardellino, Cristian

Teruel, Milagro

Villata, Serena

HAL-UNICE

Learning Slowly To Learn Better: Curriculum Learning for Legal Ontology Population

Archive Ouverte en Sciences de l'Information et de la Communication

Alemany, Laura, Alonso

INRIA a CCSD electronic archive server

Ponencia presentada Proceedings of the Thirtieth International Florida Artificial Intelligence Research Society ConferenceFil: Cardellino, Cristian. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía, Física y Computación; Argentina.Fil: Teruel, Milagro. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía, Física y Computación; Argentina.Fil: Alonso Alemany, Laura. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía, Física y Computación; Argentina.Fil: Alonso Alemany, Laura. Universite Cote d’Azur; France.In this paper, we present an ontology population approach for legal ontologies. We exploit Wikipedia as a source of manually annotated examples of legal entities. We align YAGO, a Wikipedia-based ontology, and LKIF, an ontology specifically designed for the legal domain. Through this alignment, we can effectively populate the LKIF ontology, with the aim to obtain examples to train a Named Entity Recognizer and Classifier to be used for finding and classifying entities in legal texts. Since examples of annotated data in the legal domain are very few, we apply a machine learning strategy called curriculum learning aimed to overcome problems of overfitting by learning increasingly more complex concepts. We compare the performance of this method to identify Named Entities with respect to batch learning as
well as two other baselines. Results are satisfying and foster further research in this direction.https://aaai.org/ocs/index.php/FLAIRS/FLAIRS17/paper/view/15526Fil: Cardellino, Cristian. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía, Física y Computación; Argentina.Fil: Teruel, Milagro. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía, Física y Computación; Argentina.Fil: Alonso Alemany, Laura. Universidad Nacional de Córdoba. Facultad de Matemática, Astronomía, Física y Computación; Argentina.Fil: Alonso Alemany, Laura. Universite Cote d’Azur; France.Otras Ciencias de la Computación e Informació

Alonso Alemany, Laura

Repositorio Digital de la Universidad Nacional de Córdoba

English

Learning slowly to learn better : curriculum learning for legal ontology population

https://hal.archives-ouvertes.fr/hal-01572442/document

Learning Slowly To Learn Better: Curriculum Learning for Legal Ontology Population

Abstract

Similar works

Full text

Available Versions

HAL-UNICE

Archive Ouverte en Sciences de l'Information et de la Communication

INRIA a CCSD electronic archive server

Repositorio Digital de la Universidad Nacional de Córdoba