Concept Extraction and Clustering for Topic Digital Library Construction

Chengzhi, Zhang; Dan, Wu

research

Concept Extraction and Clustering for Topic Digital Library Construction

Authors: Zhang Chengzhi
Wu Dan
Publication date: 1 December 2008
Publisher
Doi

Abstract

This paper is to introduce a new approach to build topic digital library using concept extraction and document clustering. Firstly, documents in a special domain are automatically produced by document classification approach. Then, the keywords of each document are extracted using the machine learning approach. The keywords are used to cluster the documents subset. The clustered result is the taxonomy of the subset. Lastly, the taxonomy is modified to the hierarchical structure for user navigation by manual adjustments. The topic digital library is constructed after combining the full-text retrieval and hierarchical navigation function

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

E-LIS

oai:eprints.rclis.org:12692

Last time updated on 16/07/2013

E-LIS repository

oai:eprints.rclis.org:12692

Last time updated on 05/04/2020