Search CORE

138 research outputs found

TIR 2015 Workshop Preface

Author: Granitzer Michael
Seifert Christin
Stein Benno
Publication venue
Publication date: 01/01/2015
Field of study

Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record

University of Twente Research Information

TU Graz: Course: 707.000 Web Science and Web Technology: Lecture 10: Text Mining

Author: Granitzer Michael
Publication venue
Publication date: 30/09/2008
Field of study

This class introduces basics of web mining and information retrieval including, for example, an introduction to the Vector Space Model and Text Mining. Guest Lecturer: Dr. Michael Granitzer Optional: Modeling the Internet and the Web: Probabilistic Methods and Algorithms, Pierre Baldi, Paolo Frasconi, Padhraic Smyth, Wiley, 2003 (Chapter 4, Text Analysis

EdShare

QueryCrumbs for Experts: A Compact Visual Query Support System to Facilitate Insights into Search Engine Internals

Author: Granitzer Michael
Schlötterer Jörg
Seifert Christin
Publication venue
Publication date: 01/01/2018
Field of study

Crossref

University of Twente Research Information

From Tail to Head: Browser Based Suggestion of Long-tail Resources

Author: Granitzer Michael
Kern Roman
Schlötterer Jörg
Seifert Christin
Publication venue
Publication date: 01/09/2014
Field of study

University of Twente Research Information

Search-based Entity Disambiguation with Document-Centric Knowledge Bases

Author: Granitzer Michael
Seifert Christin
Zwicklbauer Stefan
Publication venue: ACM Press
Publication date: 01/01/2015
Field of study

Entity disambiguation is the task of mapping ambiguous terms in natural-language text to its entities in a knowledge base. One possibility to describe these entities within a knowledge base is via entity-annotated documents (document-centric knowledge base). It has been shown that entity disambiguation with search-based algorithms that use document-centric knowledge bases perform well on the biomedical domain. In this context, the question remains how the quantity of annotated entities within documents and the document count used for entity classification influence disambiguation results. Another open question is whether disambiguation results hold true on more general knowledge data sets (e.g. Wikipedia). In our work we implement a search-based, document-centric disambiguation system and explicitly evaluate the mentioned issues on the biomedical data set CALBC and general knowledge data set Wikipedia, respectively. We show that the number of documents used for classification and the amount of annotations within these documents must be well-matched to attain the best result. Additionally, we reveal that disambiguation accuracy is poor on Wikipedia. We show that disambiguation results significantly improve when using shorter but more documents (e.g. Wikipedia paragraphs). Our results indicate that search-based, document-centric disambiguation systems must be carefully adapted with reference to the underlying domain and availability of user dat

Crossref

ZENODO

University of Twente Research Information

Web-based Just-In-Time Retrieval for Cultural Content

Author: Granitzer Michael
Schlötterer Jörg
Seifert Christin
Publication venue
Publication date: 01/02/2014
Field of study

University of Twente Research Information