8 research outputs found

    A EVOLUÇÃO DA TECNOLOGIA NA ELABORAÇÃO DO ÍNDICE DE FINAL DE LIVROS: UMA REVISÃO DE LITERATURA

    Get PDF
    The back-of-book index is one of the oldest devices used for Organizing and Retrieving Information, and its significance relies on the fact of its being one of the main access point to the content of the book, helping the reader to comprehend the relevant concepts contained in the manuscripts. However, the process of elaboration of back-of-book in Brazil happens, often, in a manual way, what demands time and financial costs, what justifies the low number of publications and contain the index. Thus, this article aims at presenting studies que discuss about the use of technology for the elaboration of the index, from 16 publications selected by means of literature review. The results have shown that the publication of the first years presented in an incipient way the technological insertion, and over the years, there were reports of indexes produced by means of automatic indexing. It is evinced that are necessary studies about the use of semiautomatic indexing to help in the elaboration of the index, considering that the indexer’s intervention, even with the help of technology, is still necessary for explicating the semantics and the context in the elaboration of indexes, as well as the use of technology if needed for expedite the finalization of the index.O Índice de Final de Livro é um dos instrumentos mais antigos utilizados para Organização e Recuperação da Informação, e sua importância decorre do fato de ser este um dos principais pontos de acesso ao conteúdo de um livro, auxiliando o leitor na compreensão dos conceitos relevantes contidos nos manuscritos. Porém, o processo de elaboração do Índice de Final de Livro no Brasil acontece, geralmente, de maneira manual, o que demanda tempo e custos financeiros, justificando o baixo número de publicações que contêm o índice. Dessa forma, este artigo tem o objetivo de apresentar estudos que discutem sobre o uso da tecnologia na elaboração do índice, a partir de 16 publicações selecionadas pelo processo de levantamento bibliográfico. Os resultados mostraram que as publicações nos primeiros anos apresentaram de forma incipiente a inserção tecnológica e, com o passar dos anos, houve vários relatos de índices produzidos por meio da indexação automática. Conclui-se que são necessários estudos sobre a utilização da indexação semiautomática para auxiliar na elaboração do índice, considerando que a intervenção do indexador, mesmo com o auxílio da tecnologia, continua sendo necessária para explicitar a semântica e o contexto na elaboração de índices, assim como a utilização da tecnologia é necessária para agilizar a finalização do índice

    A novel, Language-Independent Keyword Extraction method

    Get PDF
    Obtaining the most representative set of words in a document is a very significant task, since it allows characterizing the document and simplifies search and classification activities. This paper presents a novel method, called LIKE, that offers the ability of automatically extracting keywords from a document regardless of the language used in it. To do so, it uses a three-stage process: the first stage identifies the most representative terms, the second stage builds a numeric representation that is appropriate for those terms, and the third one uses a feed-forward neural network to obtain a predictive model. To measure the efficacy of the LIKE method, the articles published by the Workshop of Computer Science Researchers (WICC) in the last 14 years (1999-2012) were used. The results obtained show that LIKE is better than the KEA method, which is one of the most widely mentioned solutions in literature about this topic.X Workshop bases de datos y minería de datosRed de Universidades con Carreras en Informática (RedUNCI

    Keywords at Work: Investigating Keyword Extraction in Social Media Applications

    Full text link
    This dissertation examines a long-standing problem in Natural Language Processing (NLP) -- keyword extraction -- from a new angle. We investigate how keyword extraction can be formulated on social media data, such as emails, product reviews, student discussions, and student statements of purpose. We design novel graph-based features for supervised and unsupervised keyword extraction from emails, and use the resulting system with success to uncover patterns in a new dataset -- student statements of purpose. Furthermore, the system is used with new features on the problem of usage expression extraction from product reviews, where we obtain interesting insights. The system while used on student discussions, uncover new and exciting patterns. While each of the above problems is conceptually distinct, they share two key common elements -- keywords and social data. Social data can be messy, hard-to-interpret, and not easily amenable to existing NLP resources. We show that our system is robust enough in the face of such challenges to discover useful and important patterns. We also show that the problem definition of keyword extraction itself can be expanded to accommodate new and challenging research questions and datasets.PHDComputer Science & EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttps://deepblue.lib.umich.edu/bitstream/2027.42/145929/1/lahiri_1.pd

    Computer Science & Technology Series : XIX Argentine Congress of Computer Science. Selected papers

    Get PDF
    CACIC’13 was the nineteenth Congress in the CACIC series. It was organized by the Department of Computer Systems at the CAECE University in Mar del Plata. The Congress included 13 Workshops with 165 accepted papers, 5 Conferences, 3 invited tutorials, different meetings related with Computer Science Education (Professors, PhD students, Curricula) and an International School with 5 courses. CACIC 2013 was organized following the traditional Congress format, with 13 Workshops covering a diversity of dimensions of Computer Science Research. Each topic was supervised by a committee of 3-5 chairs of different Universities. The call for papers attracted a total of 247 submissions. An average of 2.5 review reports were collected for each paper, for a grand total of 676 review reports that involved about 210 different reviewers. A total of 165 full papers, involving 489 authors and 80 Universities, were accepted and 25 of them were selected for this book.Red de Universidades con Carreras en Informática (RedUNCI
    corecore