505 research outputs found

    Modeling Documents as Mixtures of Persons for Expert Finding

    Get PDF
    In this paper we address the problem of searching for knowledgeable persons within the enterprise, known as the expert finding (or expert search) task. We present a probabilistic algorithm using the assumption that terms in documents are produced by people who are mentioned in them.We represent documents retrieved to a query as mixtures of candidate experts language models. Two methods of personal language models extraction are proposed, as well as the way of combining them with other evidences of expertise. Experiments conducted with the TREC Enterprise collection demonstrate the superiority of our approach in comparison with the best one among existing solutions

    Being Omnipresent To Be Almighty: The Importance of The Global Web Evidence for Organizational Expert Finding

    Get PDF
    Modern expert nding algorithms are developed under the assumption that all possible expertise evidence for a person is concentrated in a company that currently employs the person. The evidence that can be acquired outside of an enterprise is traditionally unnoticed. At the same time, the Web is full of personal information which is sufficiently detailed to judge about a person's skills and knowledge. In this work, we review various sources of expertise evidence out-side of an organization and experiment with rankings built on the data acquired from six dierent sources, accessible through APIs of two major web search engines. We show that these rankings and their combinations are often more realistic and of higher quality than rankings built on organizational data only

    University of Twente at the TREC 2007 Enterprise Track : modeling relevance propagation for the expert search task

    Get PDF
    This paper describes several approaches which we used for the expert search task of the TREC 2007 Enterprise track.\ud We studied several methods of relevance propagation from documents to related candidate experts. Instead of one-step propagation from documents to directly related candidates, used by many systems in the previous years, we do not limit the relevance flow and disseminate it further through mutual documents-candidates connections. We model relevance propagation using random walk principles, or in formal terms, discrete Markov processes. We experiment with\ud innite and nite number of propagation steps. We also demonstrate how additional information, namely hyperlinks among documents, organizational structure of the enterprise and relevance feedback may be utilized by the presented techniques

    University of Twente at the TREC 2008 Enterprise Track: using the Global Web as an expertise evidence source

    Get PDF
    This paper describes the details of our participation in expert search task of the TREC 2007 Enterprise track.\ud This is the fourth (and the last) year of TREC 2007 Enterprise Track and the second year the University of Twente (Database group) submitted runs for the expert nding task. In the methods that were used to produce these runs, we mostly rely on the predicting potential of those expertise evidence sources that are publicly available on the Global Web, but not hosted at the website of the organization under study (CSIRO). This paper describes the follow-up studies\ud complimentary to our recent research [8] that demonstrated how taking the web factor seriously signicantly improves the performance of expert nding in the enterprise

    Using the Global Web as an Expertise Evidence Source

    Get PDF
    This paper describes the details of our participation in expert search task of the TREC 2007 Enterprise track. The presented study demonstrates the predicting potential of the expertise evidence that can be found outside of the organization. We discovered that combining the ranking built solely on the Enterprise data with the Global Web based ranking may produce significant increases in performance. However, our main goal was to explore whether this result can be further improved by using various quality measures to distinguish among web result items. While, indeed, it was beneficial to use some of these measures, especially those measuring relevance of URL strings and titles, it stayed unclear whether they are decisively important

    Intent Models for Contextualising and Diversifying Query Suggestions

    Full text link
    The query suggestion or auto-completion mechanisms help users to type less while interacting with a search engine. A basic approach that ranks suggestions according to their frequency in the query logs is suboptimal. Firstly, many candidate queries with the same prefix can be removed as redundant. Secondly, the suggestions can also be personalised based on the user's context. These two directions to improve the aforementioned mechanisms' quality can be in opposition: while the latter aims to promote suggestions that address search intents that a user is likely to have, the former aims to diversify the suggestions to cover as many intents as possible. We introduce a contextualisation framework that utilises a short-term context using the user's behaviour within the current search session, such as the previous query, the documents examined, and the candidate query suggestions that the user has discarded. This short-term context is used to contextualise and diversify the ranking of query suggestions, by modelling the user's information need as a mixture of intent-specific user models. The evaluation is performed offline on a set of approximately 1.0M test user sessions. Our results suggest that the proposed approach significantly improves query suggestions compared to the baseline approach.Comment: A short version of this paper was presented at CIKM 201

    An advanced Jones calculus for the classification of periodic metamaterials

    Full text link
    By relying on an advanced Jones calculus we analyze the polarization properties of light upon propagation through metamaterial slabs in a comprehensive manner. Based on symmetry considerations, we show that all periodic metamaterials may be divided into five different classes only. It is shown that each class differently affects the polarization of the transmitted light and sustains different eigenmodes. We show how to deduce these five classes from symmetry considerations and provide a simple algorithm that can be applied to decide by measuring transmitted intensities to which class a given metamaterial is belonging to only

    Search for expertise : going beyond direct evidence

    Get PDF
    The automatic search for knowledgeable people in the scope of an organization\ud is a key function which makes modern Enterprise search systems\ud commercially successful and socially demanded. A number of effective approaches\ud to expert finding were recently proposed in academic publications.\ud Although, most of them use reasonably defined measures of personal expertise,\ud they often limit themselves to rather unrealistic and sometimes oversimplified\ud principles. In this thesis, we explore several ways to go beyond\ud state-of-the-art assumptions used in research on expert finding and propose\ud several novel solutions for this and related tasks.\ud First, we describe measures of expertise that do not assume independent\ud occurrence of terms and persons in a document what makes them perform\ud better than the measures based on independence of all entities in a document.\ud One of these measures makes persons central to the process of terms generation\ud in a document. Another one assumes that the position of the personā€™s\ud mention in a document with respect to the positions of query terms indicates\ud the relation of the person to the documentā€™s relevant content. Second,\ud we find the ways to use not only direct expertise evidence for a person concentrated\ud within the document space of the personā€™s current employer and\ud only within those organizational documents that mention the person. We\ud successfully utilize the predicting potential of additional indirect expertise\ud evidence publicly available on the Web and in the organizational documents\ud implicitly related to a person. Finally, besides the expert finding methods we\ud proposed, we also demonstrate solutions for the tasks from related domains.\ud In one case, we use several algorithms of multi-step relevance propagation to\ud search for typed entities in Wikipedia. In another case, we suggest generic\ud methods for placing photos uploaded to Flickr on the World map using language\ud models of locations built entirely on the annotations provided by users\ud with a few task specific extensions

    Contextual Ranking of Database Query Results

    Get PDF

    Automatic tagging and geotagging in video collections and communities

    Get PDF
    Automatically generated tags and geotags hold great promise to improve access to video collections and online communi- ties. We overview three tasks offered in the MediaEval 2010 benchmarking initiative, for each, describing its use scenario, definition and the data set released. For each task, a reference algorithm is presented that was used within MediaEval 2010 and comments are included on lessons learned. The Tagging Task, Professional involves automatically matching episodes in a collection of Dutch television with subject labels drawn from the keyword thesaurus used by the archive staff. The Tagging Task, Wild Wild Web involves automatically predicting the tags that are assigned by users to their online videos. Finally, the Placing Task requires automatically assigning geo-coordinates to videos. The specification of each task admits the use of the full range of available information including user-generated metadata, speech recognition transcripts, audio, and visual features
    • ā€¦
    corecore