Search CORE

3 research outputs found

Evaluation Measures for Text Summarization

Author: Ježek Karel
Steinberger Josef
Publication venue: Institute of Informatics, Slovak Academy of Sciences
Publication date: 26/01/2012
Field of study

We explain the ideas of automatic text summarization approaches and the taxonomy of summary evaluation methods. Moreover, we propose a new evaluation measure for assessing the quality of a summary. The core of the measure is covered by Latent Semantic Analysis (LSA) which can capture the main topics of a document. The summarization systems are ranked according to the similarity of the main topics of their summaries and their reference documents. Results show a high correlation between human rankings and the LSA-based evaluation measure. The measure is designed to compare a summary with its full text. It can compare a summary with a human written abstract as well; however, in this case using a standard ROUGE measure gives more precise results. Nevertheless, if abstracts are not available for a given corpus, using the LSA-based measure is an appropriate choice

Computing and Informatics (E-Journal - Institute of Informatics, SAS, Bratislava)

Comparing Approaches for Weighting Applications Specific Data in Multi-Application User Interest Modeling

Author: Patra Atish Kumar
Publication venue
Publication date: 05/02/2015
Field of study

This thesis presents a framework known as User Interest Modeling and Personalization (UIMAP) which builds a model by identifying and aggregating an individual user's interest expressed through their interactions with different applications at different times. To do this, we have implemented a content consumer/producer architecture. For this thesis, Microsoft Word and PowerPoint are treated as content producer applications while a web browser is used as a content consumer application. We unobtrusively observe user interactions with these applications as well as the actual content consumed/prepared in them. The challenge is to understand the importance of each application towards the user's real interest. Based on user activity data in these applications, Multilayer Perceptron (MLP), Support Vector Machine (SVM) and Weighted K-Nearest Neighborhood (WKNN) techniques are compared in their ability to combine these kinds of heterogeneous interest indicators into a single model. Thus, each application is weighted differently based on its contributing indicators to predict the relevant content for the specific need of an individual. We found that textual content from content producer applications plays an equally important role as content from consumer applications. Implicit feedbacks from consumer applications also have a major role in user's interest. The results indicated that WKNN is preferred if feature weighting is the primary goal while SVM is the preferred choice if identifying relevant content is the main objective

Texas A&M Repository

Visualización de esquemas de representación de conocimiento para el acceso a recursos en repositorios digitales

Author: Gaona García Paulo Alonso
Publication venue
Publication date: 01/01/2014
Field of study

El siguiente documento presenta los resultados de investigación realizados a partir de estudios enfocados en el desarrollo e implementación de interfaces de búsqueda de objetos de aprendizaje, a partir de técnicas de visualización sobre repositorios digitales. Actualmente existen una gran cantidad de recursos digitales sobre Internet, y el acceso a los mismos en gran medida dependen de las estrategias que puedan ofrecer motores de búsquedas convencionales, o soluciones especializadas que permitan su clasificación, gestión y administración, como es el caso de los repositorios digitales. Sin embargo, existen una serie de factores que influyen sobre el acceso a los mismos, partiendo de la definición de los metadatos, y las estrategias de búsqueda que se definan sobre grandes volúmenes de información. Una de las áreas de mayor aceptación a lo largo de los últimos años es la visualización de información, área de trabajo que facilita la presentación visual de información compleja haciendo uso adecuado de espacios y estructuras gráficas, con el fin de facilitar su rápida asimilación y comprensión. Por lo tanto, para los propósitos específicos de esta investigación, abordaremos el área de visualización de información mediante el uso de metodologías de evaluación y estrategias de diseño para el desarrollo e implementación de interfaces de búsquedas efectivas, para el acceso a colecciones de recursos digitales alojados en repositorios digitales. El propósito fundamental de esta investigación es ofrecer alternativas de acceso a partir de técnicas de visualización, para facilitar a creadores de repositorios digitales el análisis, desarrollo e implementación de interfaces de búsqueda visual

e_Buah - Biblioteca Digital de la Universidad de Alcalá

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblioteca Digital de la Universidad de Alcalá