Search CORE

5 research outputs found

Embellishing Text Search Queries to Protect User Privacy

Author: Adar E.
Baeza-Yates R.
Barbaro M.
Benaloh J. C.
Dumais S. T.
Husbands P.
Joho H.
Kushilevitz E.
Song D. X.
Text TREC.
Publication venue: 'VLDB Endowment'
Publication date: 01/01/2010
Field of study

Users of text search engines are increasingly wary that their activities may disclose confidential information about their business or personal profiles. It would be desirable for a search engine to perform document retrieval for users while protecting their intent. In this paper, we identify the privacy risks arising from semantically related search terms within a query, and from recurring highspecificity query terms in a search session. To counter the risks, we propose a solution for a similarity text retrieval system to offer anonymity and plausible deniability for the query terms, and hence the user intent, without degrading the system’s precision-recall performance. The solution comprises a mechanism that embellishes each user query with decoy terms that exhibit similar specificity spread as the genuine terms, but point to plausible alternative topics. We also provide an accompanying retrieval scheme that enables the search engine to compute the encrypted document relevance scores from only the genuine search terms, yet remain oblivious to their distinction from the decoys. Empirical evaluation results are presented to substantiate the effectiveness of our solution. 1

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Authenticating the Query Results of Text Search Engines

Author: Baeza-Yates R.
Cheng W.
Devanbu P. T.
Li F.
Merkle R.
Papadopoulos S.
Pfleeger C. P.
Proposed Federal Information Processing DSS.
Text TREC.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2008
Field of study

The number of successful attacks on the Internet shows that it is very difficult to guarantee the security of online search engines. A breached server that is not detected in time may return incorrect results to the users. To prevent that, we introduce a methodology for generating an integrity proof for each search result. Our solution is targeted at search engines that perform similarity-based document retrieval, and utilize an inverted list implementation (as most search engines do). We formulate the properties that define a correct result, map the task of processing a text search query to adaptations of existing threshold-based algorithms, and devise an authentication scheme for checking the validity of a result. Finally, we confirm the efficiency and practicality of our solution through an empirical evaluation with real documents and benchmark queries. 1

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Cross-Language Chinese Text Retrieval in NTCIR Workshop

Author: Bawden David
Hersh William
Hsin-Hsi Chen
Jones Karan Sparck
Jones Karan Sparck
Kando N.
Kando N.
Kitani Tsuyoshi
Kuang-hua Chen
Shaw William M.
Text TREC
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Results and Lessons of the Question Answering Track at CLEF

Author: A Cassan
A Peñas
Alessandro Vallin
Anselmo Peñas
B Magnini
B Magnini
Bernardo Magnini
D Laurent
DA Ferrucci
EM Voorhees
J Herrera
M Montes-y-Gómez
P Clark
Pamela Forner
V Jijkoun
Vanessa Lopez
Voorhees EM (2000) Overview of the TREC-9 question answering track. In: Proceedings of the ninth text retrieval conference TREC
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Document Retrieval: Shallow Data, Deep Theories; Historical Reflections, Potential Directions

Crossref