Search CORE

4,787 research outputs found

BGS Sigma 2012 open source user guide

Author: Adlam K.A.M.
Bow J.
Turner P.
Publication venue: British Geological Survey
Publication date: 01/01/2013
Field of study

The British Geological Survey began developing digital field mapping systems in 1989. However, it was apparent that the commercially available hardware was not suitable at that time. In 2001, we revisited the topic under the System for Integrated Geoscience Mapping (SIGMA) programme. By 2003, BGS had developed a PDA (personal digital assistant) field system, which was superseded in 2005, when we began deploying a beta system on rugged Tablet PCs. The Tablet PC system, which we called BGS•SIGMAmobile was used by BGS in mapping projects across the UK as well as overseas. It first became available in Open Source form, in June 2009 via the BGS website, www.bgs.ac.uk, under an agreement which stipulates that updates and modifications must be supplied to BGS in order to stimulate further developments. In 2011/2012, BGS•SIGMAmobile was rewritten in .NET and combined with our office based mapping software BGS•SIGMAdesktop within ArcGIS 10.x to create BGS•SIGMA 2012. It is envisaged that future releases will be made available from the BGS website incorporating new modules, modifications and upgrades supplied by BGS and external users of the system. This document has been written to guide users through the installation and use of BGS•SIGMA 2012 (mobile and desktop), which is the third free release. We are happy to receive feedback and modifications emailed to [email protected]

NERC Open Research Archive

Evaluating the retrieval effectiveness of Web search engines using a representative query sample

Author: Lewandowski Dirk
Publication venue
Publication date: 09/05/2014
Field of study

Search engine retrieval effectiveness studies are usually small-scale, using only limited query samples. Furthermore, queries are selected by the researchers. We address these issues by taking a random representative sample of 1,000 informational and 1,000 navigational queries from a major German search engine and comparing Google's and Bing's results based on this sample. Jurors were found through crowdsourcing, data was collected using specialised software, the Relevance Assessment Tool (RAT). We found that while Google outperforms Bing in both query types, the difference in the performance for informational queries was rather low. However, for navigational queries, Google found the correct answer in 95.3 per cent of cases whereas Bing only found the correct answer 76.6 per cent of the time. We conclude that search engine performance on navigational queries is of great importance, as users in this case can clearly identify queries that have returned correct results. So, performance on this query type may contribute to explaining user satisfaction with search engines

arXiv.org e-Print Archive

REPOSIT

Scalable Semantic Matching of Queries to Ads in Sponsored Search Advertising

Author: Baeza-Yates Ricardo
Djuric Nemanja
Feng Andrew
Grbovic Mihajlo
Ordentlich Erik
Owens Gavin
Radosavljevic Vladan
Silvestri Fabrizio
Yang Lee
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2016
Field of study

Sponsored search represents a major source of revenue for web search engines. This popular advertising model brings a unique possibility for advertisers to target users' immediate intent communicated through a search query, usually by displaying their ads alongside organic search results for queries deemed relevant to their products or services. However, due to a large number of unique queries it is challenging for advertisers to identify all such relevant queries. For this reason search engines often provide a service of advanced matching, which automatically finds additional relevant queries for advertisers to bid on. We present a novel advanced matching approach based on the idea of semantic embeddings of queries and ads. The embeddings were learned using a large data set of user search sessions, consisting of search queries, clicked ads and search links, while utilizing contextual information such as dwell time and skipped ads. To address the large-scale nature of our problem, both in terms of data and vocabulary size, we propose a novel distributed algorithm for training of the embeddings. Finally, we present an approach for overcoming a cold-start problem associated with new ads and queries. We report results of editorial evaluation and online tests on actual search traffic. The results show that our approach significantly outperforms baselines in terms of relevance, coverage, and incremental revenue. Lastly, we open-source learned query embeddings to be used by researchers in computational advertising and related fields.Comment: 10 pages, 4 figures, 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016, Pisa, Ital

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Metadata categorization for identifying search patterns in a digital library

Author: Bogaard T. (Tessel)
Hardman L. (Lynda)
Hollink L. (Laura)
Ossenbruggen J.R. (Jacco) van
Wielemaker J. (Jan)
Publication venue: 'Emerald'
Publication date: 06/03/2019
Field of study

Purpose: For digital libraries, it is useful to understand how users search in a collection. Investigating search patterns can help them to improve the user interface, collection management and search algorithms. However, search patterns may vary widely in different parts of a collection. The purpose of this paper is to demonstrate how to identify these search patterns within a well-curated historical newspaper collection using the existing metadata.Design/methodology/approach: The authors analyzed search logs combined with metadata

CWI's Institutional Repository

Personalized Search

Author: Carlsen Fredrik Nygård
Publication venue
Publication date: 01/01/2015
Field of study

As the volume of electronically available information grows, relevant items become harder to find. This work presents an approach to personalizing search results in scientific publication databases. This work focuses on re-ranking search results from existing search engines like Solr or ElasticSearch. This work also includes the development of Obelix, a new recommendation system used to re-rank search results. The project was proposed and performed at CERN, using the scientific publications available on the CERN Document Server (CDS). This work experiments with re-ranking using offline and online evaluation of users and documents in CDS. The experiments conclude that the personalized search result outperform both latest first and word similarity in terms of click position in the search result for global search in CDS

arXiv.org e-Print Archive

CERN Document Server

Enhanced Web Search Engines with Query-Concept Bipartite Graphs

Author: Chen Yan
Publication venue: ScholarWorks @ Georgia State University
Publication date: 16/08/2010
Field of study

With rapid growth of information on the Web, Web search engines have gained great momentum for exploiting valuable Web resources. Although keywords-based Web search engines provide relevant search results in response to users’ queries, future enhancement is still needed. Three important issues include (1) search results can be diverse because ambiguous keywords in queries can be interpreted to different meanings; (2) indentifying keywords in long queries is difficult for search engines; and (3) generating query-specific Web page summaries is desirable for Web search results’ previews. Based on clickthrough data, this thesis proposes a query-concept bipartite graph for representing queries’ relations, and applies the queries’ relations to applications such as (1) personalized query suggestions, (2) long queries Web searches and (3) query-specific Web page summarization. Experimental results show that query-concept bipartite graphs are useful for performance improvement for the three applications

ScholarWorks @ Georgia State University