Search CORE

6 research outputs found

Knowledge Discovery and Management within Service Centers

Author: Zaman Nazia
Publication venue: North Dakota State University
Publication date: 01/01/2016
Field of study

These days, most enterprise service centers deploy Knowledge Discovery and Management (KDM) systems to address the challenge of timely delivery of a resourceful service request resolution while efficiently utilizing the huge amount of data. These KDM systems facilitate prompt response to the critical service requests and if possible then try to prevent the service requests getting triggered in the first place. Nevertheless, in most cases, information required for a request resolution is dispersed and suppressed under the mountain of irrelevant information over the Internet in unstructured and heterogeneous formats. These heterogeneous data sources and formats complicate the access to reusable knowledge and increase the response time required to reach a resolution. Moreover, the state-of-the art methods neither support effective integration of domain knowledge with the KDM systems nor promote the assimilation of reusable knowledge or Intellectual Capital (IC). With the goal of providing an improved service request resolution within the shortest possible time, this research proposes an IC Management System. The proposed tool efficiently utilizes domain knowledge in the form of semantic web technology to extract the most valuable information from those raw unstructured data and uses that knowledge to formulate service resolution model as a combination of efficient data search, classification, clustering, and recommendation methods. Our proposed solution also handles the technology categorization of a service request which is very crucial in the request resolution process. The system has been extensively evaluated with several experiments and has been used in a real enterprise customer service center

NDSU Libraries Institutional Repository

CARACTERIZACIÓN DE ORACIONES CLAVE DE RESÚMENES MEDIANTE MEDIDAS DE CALIDAD DE AGRUPACIÓN INTERNA

Author: Hernández Castañeda Néstor
Publication venue: 'Universidad Autonoma del Estado de Mexico'
Publication date: 01/07/2017
Field of study

El gran aumento de información digital compartida a través de internet y de otros medios ha hecho necesaria la creación de sistemas que permitan la generación de resúmenes automáticos con el objetivo de presentar a los usuarios la información más relevante del texto o el documento, lo que permite reducir los tiempos de búsqueda y obtención de la información. Los resúmenes se pueden generar por diversos métodos, pero de forma general se clasifican en dos métodos. Los métodos abstractivos y los métodos extractivos. Estos últimos son los que vamos a utilizar para el propósito de este trabajo. Existen técnicas de generación de resúmenes extractivos que difieren en la forma de generar el resumen. Algunas de estas técnicas se basan en la selección de frases similares al título del documento, otras por la posición de frases u oraciones en el texto o asignando pesos a las oraciones. Generalmente, estas técnicas de generación de resúmenes son dependientes del idioma o del dominio. Por esta razón se han desarrollado técnicas de generación de resúmenes independientes del idioma y del dominio, estas técnicas también difieren en la forma de generar el resumen. En este trabajo se va estudiar la generación de resúmenes extractivos por agrupamiento ya que existe gran incertidumbre sobre la relación que existe entre la calidad de las agrupaciones generadas y la calidad del resumen obtenido. Debido a que estos resúmenes son generados por agrupamiento obtienen características propias de los grupos, como pueden ser: compactación, separación, distribución y densidad. Por lo que algunos algoritmos de agrupación son incapaces de evaluar características propias de los grupos. Por esta razón en este trabajo se utilizan medidas de calidad interna de agrupación, las cuales mantienen independencia del algoritmo empleado. A través de estas medidas se evalúa la relación que existe entre la calidad de los grupos y la calidad de los resúmenes obtenidos. Además, en este trabajo se hace un estudio para saber cómo afectan las características de los grupos en la calidad de la agrupación. A través de los experimentos realizados se determina que dos medidas de calidad interna de agrupación pueden evaluar correctamente la relación entre la calidad de los grupos generados con la calidad de los resúmenes utilizados, así como las características de los grupos que son: separación, compactación, ruido, densidad y distribución. Estas medidas son el índice Silhouette y el índice Davies Bouldin

Red Mexicana de Repositorios Institucionales

Repositorio Institucional de la Universidad Autónoma del Estado de México

Semantically enhanced document clustering

Author: Stankov Ivan
Publication venue
Publication date
Field of study

This thesis advocates the view that traditional document clustering could be significantly improved by representing documents at different levels of abstraction at which the similarity between documents is considered. The improvement is with regard to the alignment of the clustering solutions to human judgement. The proposed methodology employs semantics with which the conceptual similarity be-tween documents is measured. The goal is to design algorithms which implement the meth-odology, in order to solve the following research problems: (i) how to obtain multiple deter-ministic clustering solutions; (ii) how to produce coherent large-scale clustering solutions across domains, regardless of the number of clusters; (iii) how to obtain clustering solutions which align well with human judgement; and (iv) how to produce specific clustering solu-tions from the perspective of the user’s understanding for the domain of interest. The developed clustering methodology enhances separation between and improved coher-ence within clusters generated across several domains by using levels of abstraction. The methodology employs a semantically enhanced text stemmer, which is developed for the pur-pose of producing coherent clustering, and a concept index that provides generic document representation and reduced dimensionality of document representation. These characteristics of the methodology enable addressing the limitations of traditional text document clustering by employing computationally expensive similarity measures such as Earth Mover’s Distance (EMD), which theoretically aligns the clustering solutions closer to human judgement. A threshold for similarity between documents that employs many-to-many similarity matching is proposed and experimentally proven to benefit the traditional clustering algorithms in pro-ducing clustering solutions aligned closer to human judgement. 4 The experimental validation demonstrates the scalability of the semantically enhanced document clustering methodology and supports the contributions: (i) multiple deterministic clustering solutions and different viewpoints to a document collection are obtained; (ii) the use of concept indexing as a document representation technique in the domain of document clustering is beneficial for producing coherent clusters across domains; (ii) SETS algorithm provides an improved text normalisation by using external knowledge; (iv) a method for measuring similarity between documents on a large scale by using many-to-many matching; (v) a semantically enhanced methodology that employs levels of abstraction that correspond to a user’s background, understanding and motivation. The achieved results will benefit the research community working in the area of document management, information retrieval, data mining and knowledge management

Online Research @ Cardiff

Document Clustering with Semantic Analysis

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

Crossref