A Hybrid Approach to General Information Extraction

Grap, Marie Belen

A Hybrid Approach to General Information Extraction

Authors: Marie Belen Grap
Publication date: 1 September 2015
Publisher: DigitalCommons@CalPoly

Abstract

Information Extraction (IE) is the process of analyzing documents and identifying desired pieces of information within them. Many IE systems have been developed over the last couple of decades, but there is still room for improvement as IE remains an open problem for researchers. This work discusses the development of a hybrid IE system that attempts to combine the strengths of rule-based and statistical IE systems while avoiding their unique pitfalls in order to achieve high performance for any type of information on any type of document. Test results show that this system operates competitively in cases where target information belongs to a highly-structured data type and when critical contextual information is in close proximity to the target

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

DigitalCommons@CalPoly

oai:digitalcommons.calpoly.edu...

Last time updated on 05/05/2016