Search CORE

4 research outputs found

Named Entity Recognition in Indian court judgments

Author: Agarwal Astha
Gupta Smita
Kalamkar Prathamesh
Karn Saurabh
Raghavan Vivek
Tiwari Aman
Publication venue
Publication date: 07/11/2022
Field of study

Identification of named entities from legal texts is an essential building block for developing other legal Artificial Intelligence applications. Named Entities in legal texts are slightly different and more fine-grained than commonly used named entities like Person, Organization, Location etc. In this paper, we introduce a new corpus of 46545 annotated legal named entities mapped to 14 legal entity types. The Baseline model for extracting legal named entities from judgment text is also developed.Comment: to be published in NLLP 2022 Workshop at EMNL

arXiv.org e-Print Archive

Lynx: A knowledge-based AI service platform for content processing, enrichment and analysis for the legal domain

Author: Boil Ballesteros P.
Bosque Gil J.
Gomez Diaz E.
Gracia Jorge
Kaltenböck M.
Karampatakis S.
Kernerman I.
Lagzdins A.
Lonke D.
Maganza F.
Martín-Chozas P.
Montiel-Ponsoda E.
Moreno Schneider J.
Navas-Loro M.
Rehm G.
Revenko A.
Rodríguez-Doncel V.
Sageder C.
Verhoeven P.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

The EU-funded project Lynx focuses on the creation of a knowledge graph for the legal domain (Legal Knowledge Graph, LKG) and its use for the semantic processing, analysis and enrichment of documents from the legal domain. This article describes the use cases covered in the project, the entire developed platform and the semantic analysis services that operate on the documents. © 202

Repositorio Universidad de Zaragoza

Towards Unstructured Knowledge Integration in Natural Language Processing

Author: Ruggeri Federico <1993>
Publication venue: Alma Mater Studiorum - Università di Bologna
Publication date: 23/06/2022
Field of study

In the last decades, Artificial Intelligence has witnessed multiple breakthroughs in deep learning. In particular, purely data-driven approaches have opened to a wide variety of successful applications due to the large availability of data. Nonetheless, the integration of prior knowledge is still required to compensate for specific issues like lack of generalization from limited data, fairness, robustness, and biases. In this thesis, we analyze the methodology of integrating knowledge into deep learning models in the field of Natural Language Processing (NLP). We start by remarking on the importance of knowledge integration. We highlight the possible shortcomings of these approaches and investigate the implications of integrating unstructured textual knowledge. We introduce Unstructured Knowledge Integration (UKI) as the process of integrating unstructured knowledge into machine learning models. We discuss UKI in the field of NLP, where knowledge is represented in a natural language format. We identify UKI as a complex process comprised of multiple sub-processes, different knowledge types, and knowledge integration properties to guarantee. We remark on the challenges of integrating unstructured textual knowledge and bridge connections with well-known research areas in NLP. We provide a unified vision of structured knowledge extraction (KE) and UKI by identifying KE as a sub-process of UKI. We investigate some challenging scenarios where structured knowledge is not a feasible prior assumption and formulate each task from the point of view of UKI. We adopt simple yet effective neural architectures and discuss the challenges of such an approach. Finally, we identify KE as a form of symbolic representation. From this perspective, we remark on the need of defining sophisticated UKI processes to verify the validity of knowledge integration. To this end, we foresee frameworks capable of combining symbolic and sub-symbolic representations for learning as a solution

AMS Tesi di Dottorato