Skip to main content
Article thumbnail
Location of Repository

Manuscript ID 0094-SIP-2011-PIEEE.R1 1 Statistical Entity Extraction from Web

By Zaiqing Nie, Ji-rong Wen and Wei-ying Ma

Abstract

Abstract — There are various kinds of valuable semantic information about real-world entities embedded in web pages and databases. Extracting and integrating these entity information from the Web is of great significance. Comparing to traditional information extraction problems, web entity extraction needs to solve several new challenges to fully take advantage of the unique characteristic of the Web. In this paper, we introduce our recent work on statistical extraction of structured entities, named entities, entity facts and relations from Web. We also briefly introduce iKnoweb, an interactive knowledge mining framework for entity information integration. We will use two novel web applications, Microsoft Academic Search (aka Libra) and EntityCube, as working examples

Topics: Index Terms — Entity Extraction, Named Entity Extraction, Entity Search, Entity Relationship Mining, Natural Language Processing, Web Page Segmentation, Interactive Knowledge Mining, Crowdsourcing T
Year: 2013
OAI identifier: oai:CiteSeerX.psu:10.1.1.352.7436
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://research.microsoft.com/... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.