16,642 research outputs found

    Can Google searches help nowcast and forecast unemployment rates in the Visegrad Group countries?

    Get PDF
    Online activity of the Internet users has been repeatedly shown to provide a rich information set for various research fields. We focus on the job-related searches on Google and their possible usefulness in the region of the Visegrad Group -- the Czech Republic, Hungary, Poland and Slovakia. Even for rather small economies, the online searches of their inhabitants can be successfully utilized for macroeconomic predictions. Specifically, we study the unemployment rates and their interconnection to the job-related searches. We show that the Google searches strongly enhance both nowcasting and forecasting models of the unemployment rates.Comment: 22 pages, 2 figures, 3 table

    Auditing Search Engines for Differential Satisfaction Across Demographics

    Get PDF
    Many online services, such as search engines, social media platforms, and digital marketplaces, are advertised as being available to any user, regardless of their age, gender, or other demographic factors. However, there are growing concerns that these services may systematically underserve some groups of users. In this paper, we present a framework for internally auditing such services for differences in user satisfaction across demographic groups, using search engines as a case study. We first explain the pitfalls of na\"ively comparing the behavioral metrics that are commonly used to evaluate search engines. We then propose three methods for measuring latent differences in user satisfaction from observed differences in evaluation metrics. To develop these methods, we drew on ideas from the causal inference literature and the multilevel modeling literature. Our framework is broadly applicable to other online services, and provides general insight into interpreting their evaluation metrics.Comment: 8 pages Accepted at WWW 201

    You can't see what you can't see: Experimental evidence for how much relevant information may be missed due to Google's Web search personalisation

    Full text link
    The influence of Web search personalisation on professional knowledge work is an understudied area. Here we investigate how public sector officials self-assess their dependency on the Google Web search engine, whether they are aware of the potential impact of algorithmic biases on their ability to retrieve all relevant information, and how much relevant information may actually be missed due to Web search personalisation. We find that the majority of participants in our experimental study are neither aware that there is a potential problem nor do they have a strategy to mitigate the risk of missing relevant information when performing online searches. Most significantly, we provide empirical evidence that up to 20% of relevant information may be missed due to Web search personalisation. This work has significant implications for Web research by public sector professionals, who should be provided with training about the potential algorithmic biases that may affect their judgments and decision making, as well as clear guidelines how to minimise the risk of missing relevant information.Comment: paper submitted to the 11th Intl. Conf. on Social Informatics; revision corrects error in interpretation of parameter Psi/p in RBO resulting from discrepancy between the documentation of the implementation in R (https://rdrr.io/bioc/gespeR/man/rbo.html) and the original definition (https://dl.acm.org/citation.cfm?id=1852106) as per 20/05/201

    CHORUS Deliverable 2.2: Second report - identification of multi-disciplinary key issues for gap analysis toward EU multimedia search engines roadmap

    Get PDF
    After addressing the state-of-the-art during the first year of Chorus and establishing the existing landscape in multimedia search engines, we have identified and analyzed gaps within European research effort during our second year. In this period we focused on three directions, notably technological issues, user-centred issues and use-cases and socio- economic and legal aspects. These were assessed by two central studies: firstly, a concerted vision of functional breakdown of generic multimedia search engine, and secondly, a representative use-cases descriptions with the related discussion on requirement for technological challenges. Both studies have been carried out in cooperation and consultation with the community at large through EC concertation meetings (multimedia search engines cluster), several meetings with our Think-Tank, presentations in international conferences, and surveys addressed to EU projects coordinators as well as National initiatives coordinators. Based on the obtained feedback we identified two types of gaps, namely core technological gaps that involve research challenges, and “enablers”, which are not necessarily technical research challenges, but have impact on innovation progress. New socio-economic trends are presented as well as emerging legal challenges

    CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

    Get PDF
    Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

    "Geco" and its potential for real estate research: Evidence from the US housing market

    Get PDF
    Over the past few years, Google econometrics (Geco) turns out to be a powerful tool for research based on individuals rational. Following the seminal work of Ginsberg et al. (2009), this is the second academic journal contribution to be based on search query data from Google Insights for Search (I4S). Existing information on the Home Buying Process is embedded into existing literature on the price-volume relationship in the housing market. The main findings are: I4S subcategories yield inferences about prices and transactions in the near future. While the “Real Estate Agency” subcategory serves as a very robust indicator of transaction volume, "Home Financing" provides interesting insights into the corresponding financing decisions. Therefore, this study contributes towards improving the infor-mational efficiency of a relatively imperfect market and is addressed to policy makers as well as real estate professionals.

    DancingLines: An Analytical Scheme to Depict Cross-Platform Event Popularity

    Full text link
    Nowadays, events usually burst and are propagated online through multiple modern media like social networks and search engines. There exists various research discussing the event dissemination trends on individual medium, while few studies focus on event popularity analysis from a cross-platform perspective. Challenges come from the vast diversity of events and media, limited access to aligned datasets across different media and a great deal of noise in the datasets. In this paper, we design DancingLines, an innovative scheme that captures and quantitatively analyzes event popularity between pairwise text media. It contains two models: TF-SW, a semantic-aware popularity quantification model, based on an integrated weight coefficient leveraging Word2Vec and TextRank; and wDTW-CD, a pairwise event popularity time series alignment model matching different event phases adapted from Dynamic Time Warping. We also propose three metrics to interpret event popularity trends between pairwise social platforms. Experimental results on eighteen real-world event datasets from an influential social network and a popular search engine validate the effectiveness and applicability of our scheme. DancingLines is demonstrated to possess broad application potentials for discovering the knowledge of various aspects related to events and different media

    Arrivals of tourists in Cyprus: mind the web search intensity

    Get PDF
    This paper validates the raison d’être of the effortlessly recovered web Search Intensity Indices (SII) for predicting the arrivals of tourists in Cyprus. By using monthly data (2004-2015) and two causality testing procedures we find, for properly selected key-phrases, that web search intensity (adjusted for different languages and different search engines) turns out to convey a useful predictive content for the arrivals of tourists in Cyprus. Additionally, we show that whenever the prevailing shares of visitors come from countries in different languages, then the identification of the aggregate SII becomes complex. Hence, we argue that blindly using key-phrases to identify an aggregate SII is like an immersion into the unknown, since two sources of bias (the language bias and the search engine bias) are fully neglected. Given the importance of the tourism sector in the total economy activity of Cyprus, our findings might prove to be quite useful to governmental agencies, policy makers and other stakeholders of the sector when their purpose is to allocate effectively the existing limited resources, and to plan short- and long-run promotion and investment strategies
    corecore