21 research outputs found

    Using Sensor Metadata Streams to Identify Topics of Local Events in the City

    Get PDF
    In this paper, we study the emerging Information Retrieval (IR) task of local event retrieval using sensor metadata streams. Sensor metadata streams include information such as the crowd density from video processing, audio classifications, and social media activity. We propose to use these metadata streams to identify the topics of local events within a city, where each event topic corresponds to a set of terms representing a type of events such as a concert or a protest. We develop a supervised approach that is capable of mapping sensor metadata observations to an event topic. In addition to using a variety of sensor metadata observations about the current status of the environment as learning features, our approach incorporates additional background features to model cyclic event patterns. Through experimentation with data collected from two locations in a major Spanish city, we show that our approach markedly outperforms an alternative baseline. We also show that modelling background information improves event topic identification

    A Methodology for Simulated Experiments in Interactive Search

    Get PDF
    Interactive information retrieval has received much attention in recent years, e.g. [7]. Furthermore, increased activity in developing interactive features in search systems used across existing popular Web search engines suggests that interactive systems are being recognised as a promising next step in assisting information search. One of the most challenging problems with interactive systems however remains evaluation. We describe the general specifications of a methodology for conducting controlled and reproducible experiments in the context of interactive search. It was developed in the AutoAdapt project1 focusing on search in intranets, but the methodology is more generic than that and can be applied to interactive Web search as well. The goal of this methodology is to evaluate the ability of different algorithms to produce domain models that provide accurate suggestions for query modifications. The AutoAdapt project investigates the application of automatically constructed adaptive domain models for providing suggestions for query modifications to the users of an intranet search engine. This goes beyond static models such as the one employed to guide users who search the Web site of the University of Essex which is based on a domain model that has been built in advance using the documents’ markup structure

    Preface

    Get PDF
    These proceedings contain the papers of the Third International Workshop on Recent Trends in News Informa-tion Retrieval (NewsIR\u201919) held in conjunction with the ACM SIGIR 2019 conference in Paris, France, on the25thof July 2019. Ten full papers and two short papers (one position paper and one demo paper) were selectedby the programme committee from a total of 21 submissions. Each submitted paper was reviewed by at leastthree members of an international programme committee. In addition to the selected papers, the workshopfeatures one keynote and one invited talk. The Keynote speech is given by Aron Pilhofer \u201cFrom Redlining toRobots: How newsrooms apply technology to the craft of journalism\u201d. The invited talk is given by FriedrichLindenberg \u201cMining Leaks and Open Data to Follow the Money\u201d. We would like to thank SIGIR for hostingus. Thanks also go to the keynote speakers, the program committee, the paper authors, and the participants,for without these people there would be no worksho

    Moving towards adaptive search in digital libraries

    Get PDF
    Search applications have become very popular over the last two decades, one of the main drivers being the advent of the Web. Nevertheless, searching on the Web is very different to searching on smaller, often more structured collections such as digital libraries, local Web sites, and intranets. One way of helping the searcher locating the right information for a specific information need in such a collection is by providing well-structured domain knowledge to assist query modification and navigation. There are two main challenges which we will both address in this chapter: acquiring the domain knowledge and adapting it automatically to the specific interests of the user community. We will outline how in digital libraries a domain model can automatically be acquired using search engine query logs and how it can be continuously updated using methods resembling ant colony behaviour. © 2011 Springer-Verlag

    The role of search for field force knowledge management

    No full text
    Search has become a ubiquitous, everyday activity, but finding the right information at the right time in an electronic document collection can still be a very challenging process. Significant time is being spent on identifying suitable search terms, exploring matching documents, rephrasing the search request and assessing whether a document contains the information sought. Once another user is faced with a similar information need, the whole process starts again. There is significant potential in cutting down on this activity by taking a user straight to the required information. As well as delivering technical information and vital regulatory information, a knowledge management solution is concerned with capturing valuable insight and experience in order to share it amongst workers. A search engine has been developed and deployed to technical support staff and we were able to assess its impact on mobile workers. The architecture is based on open-source software to satisfy the basic search functionality, such as indexing, search result ranking, faceting and spell checking. The search engine indexes a number of knowledge repositories relevant to the field engineers. On top of that we have developed an adaptive query suggestion mechanism called Sunny Aberdeen. Query suggestions provide an interactive feature that can guide the user through the search process by providing alternative terminology or suggesting 'best matches'. In our search engine, the query suggestions are generated and adapted over time using state-of-the-art machine learning approaches, which exploit past user interactions with the search engine to derive query suggestions. Apart from continuously updating the suggestions, this framework is also capable of reflecting current search trends as well as forgetting relations that are no longer relevant. Query log analysis of the system running in a real-life context indicates that the system was able to cut down the number of repeat faults and speeds up the decision process for sending out staff to certain jobs

    Identifying local events by using microblogs as social sensors

    No full text
    Local search is increasingly attracting more demand, whereby the users are interested to find out about places or events in their local vicinity. In this paper, we propose to use the Twitter microblogging platform to detect and rank local events of interest in real-time. We present a novel event retrieval framework, where both the contents of the tweets and the volume of the microblogging activity are exploited to locate an event happening in a certain area within a city that matches the user's interests as expressed in the form of a query. In particular, the framework measures unusual microblogging activities in a certain area and uses that as an indication of the occurrence of an event which is then used by the ranking function. Since the proposed event retrieval task is a new Information Retrieval (IR) task, we devise a methodology that is inspired by the conceptually similar IR problem of video segmentation to thoroughly evaluate our approach. Our evaluation is conducted on a set of tweets collected over a period of twelve days from different areas of London, as well as two sets of local events collected within the same period using crowdsourcing and local news sources in London. In addition to new insights on the factors that in uence the development of an effective event ranking model, our empirical results show the promise and effectiveness of our proposed approach in identifying and ranking local events in real-time

    Do topic shift and query reformulation patterns correlate in academic search?

    No full text
    While it is known that academic searchers differ from typical web searchers, little is known about the search behavior of academic searchers over longer periods of time. In this study we take a look at academic searchers through a large-scale log analysis on a major academic search engine. We focus on two aspects: query reformulation patterns and topic shifts in queries. We first analyze how each of these aspects evolve over time. We identify important query reformulation patterns: revisiting and issuing new queries tend to happen more often over time. We also find that there are two distinct types of users: one type of users becomes increasingly focused on the topics they search for as time goes by, and the other becomes increasingly diversifying. After analyzing these two aspects separately, we investigate whether, and to which degree, there is a correlation between topic shifts and query reformulations. Surprisingly, users’ preferences of query reformulations correlate little with their topic shift tendency. However, certain reformulations may help predict the magnitude of the topic shift that happens in the immediate next timespan. Our results shed light on academic searchers’ information seeking behavior and may benefit search personalization

    Third international workshop on recent trends in news information retrieval (NEWSIR'19)

    No full text
    The journalism industry has undergone a revolution in the past decade, leading to new opportunities as well as challenges. News consumption, production and delivery have all been affected and transformed by technology. Readers require new mechanisms to cope with the vast volume of information in order to be informed about news events. Reporters have begun to use natural language processing (NLP) and (IR) techniques for investigative work. Publishers and aggregators are seeking new business models, and new ways to reach and retain their audience. A shift in business models has led to a gradual shift in styles of journalism in attempts to increase page views; and, far more concerning, to real mis- and dis-information, alongside allegations of “fake news” threatening the journalistic freedom and integrity of legitimate news outlets. Social media platforms drive viewership, creating filter bubbles and an increasingly polarized readership. News documents have always been a part of research on information access and retrieval methods. Over the last few years, the IR community has increasingly recognized these challenges in journalism and opened a conversation about how we might begin to address them. Evidence of this recognition is the participation in the two previous editions of our NewsIR workshop, held in ECIR 2016 and 2018. One of the most important outcomes of those workshops is an increasing awareness in the community about the changing nature of journalism and the IR challenges it entails. To move yet another step forward, the goal of the third edition of our workshop is to create a multidisciplinary venue that brings together news experts from both technology and journalism. This would take NewsIR from a European forum targeting mainly IR researchers, into a more inclusive and influential international forum. We hope that this new format will foster further understanding for both news professionals and IR researchers, as well as producing better outcomes for news consumers. We will address the possibilities and challenges that technology offers to the journalists, the challenges that new developments in journalism create for IR researchers, and the complexity of information access tasks for news readers
    corecore