116 research outputs found

    Personalization of Queries based on User Preferences

    Get PDF
    Query Personalization is the process of dynamically enhancing a query with related user preferences stored in a user profile with the aim of providing personalized answers. The underlying idea is that different users may find different things relevant to a search due to different preferences. Essential ingredients of query personalization are: (a) a model for representing and storing preferences in user profiles, and (b) algorithms for the generation of personalized answers using stored preferences. Modeling the plethora of preference types is a challenge. In this paper, we present a preference model that combines expressivity and concision. In addition, we provide algorithms for the selection of preferences related to a query and the progressive generation of personalized results, which are ranked based on user interest

    Benchmarking the Utility of w-Event Differential Privacy Mechanisms: When Baselines Become Mighty Competitors

    Get PDF
    The w-event framework is the current standard for ensuring differential privacy on continuously monitored data streams. Following the proposition of w-event differential privacy, various mechanisms to implement the framework are proposed. Their comparability in empirical studies is vital for both practitioners to choose a suitable mechanism, and researchers to identify current limitations and propose novel mechanisms. By conducting a literature survey, we observe that the results of existing studies are hardly comparable and partially intrinsically inconsistent. To this end, we formalize an empirical study of w-event mechanisms by re-occurring elements found in our survey. We introduce requirements on these elements that ensure the comparability of experimental results. Moreover, we propose a benchmark that meets all requirements and establishes a new way to evaluate existing and newly proposed mechanisms. Conducting a large-scale empirical study, we gain valuable new insights into the strengths and weaknesses of existing mechanisms. An unexpected - yet explainable - result is a baseline supremacy, i.e., using one of the two baseline mechanisms is expected to deliver good or even the best utility. Finally, we provide guidelines for practitioners to select suitable mechanisms and improvement options for researchers

    Fairness-Aware Methods in Rankings and Recommenders

    Get PDF
    We increasingly depend on a variety of data-driven algorithmic systems to assist us in many aspects of life. Search engines and recommender systems amongst others are used as sources of information and to help us in making all sort of decisions from selecting restaurants and books, to choosing friends and careers. This has given rise to important concerns regarding the fairness of such systems. In this tutorial, we aim at presenting a toolkit of methods used for ensuring fairness in rankings and recommendations. Our objectives are two-fold: (a) to present related methods of this novel, quickly evolving and impactful domain, and put them into perspective, and (b) to highlight open challenges and research paths for future work.acceptedVersionPeer reviewe

    Fairness in rankings and recommenders : Models, methods and research directions

    Get PDF
    We increasingly depend on a variety of data-driven algorithmic systems to assist us in many aspects of life. Search engines and recommendation systems amongst others are used as sources of information and to help us in making all sort of decisions from selecting restaurants and books, to choosing friends and careers. This has given rise to important concerns regarding the fairness of such systems. This tutorial aims at presenting a toolkit of definitions, models and methods used for ensuring fairness in rankings and recommendations. Our objectives are three-fold: (a) to provide a solid framework on a novel, quickly evolving, and impactful domain, (b) to present related methods and put them into perspective, and (c) to highlight challenges and research paths for researchers and practitioners that work in data management and applications.Peer reviewe

    Synthesizing structured text from logical database subsets. EDBT

    Get PDF
    ABSTRACT In the classical database world, information access has been based on a paradigm that involves structured, schema-aware, queries and tabular answers. In the current environment, however, where information prevails in most activities of society, serving people, applications, and devices in dramatically increasing numbers, this paradigm has proved to be very limited. On the query side, much work has been done on moving towards keyword queries over structured data. In our previous work, we have touched the other side as well, and have proposed a paradigm that generates entire databases in response to keyword queries. In this paper, we continue in the same direction and propose synthesizing textual answers in response to queries of any kind over structured data. In particular, we study the transformation of a dynamically-generated logical database subset into a narrative through a customizable, extensible, and templatebased process. In doing so, we exploit the structured nature of database schemas and describe three generic translation modules for different formations in the schema, called unary, split, and join modules. We have implemented the proposed translation procedure into our own database front end and have performed several experiments evaluating the textual answers generated as several features and parameters of the system are varied. We have also conducted a set of experiments measuring the effectiveness of such answers on users. The overall results are very encouraging and indicate the promise that our approach has for several applications

    Study about the different use of explicit and implicit tags in social bookmarking

    Full text link
    This is the accepted version of the following article: Arolas, E. E., & Ladrón-de-Guevar, F. G. (2012). Uses of explicit and implicit tags in social bookmarking. Journal of the American Society for Information Science and Technology, 63(2), 313-322. doi:10.1002/asi.21663, which has been published in final form at http://dx.doi.org/10.1002/asi.21663Although Web 2.0 contains many tools with different functionalities, they all share a common social nature. One tool in particular, social bookmarking systems (SBSs), allows users to store and share links to different types of resources, i.e., websites, videos, images. To identify and classify these resources so that they can be retrieved and shared, fragments of text are used. These fragments of text, usually words, are called tags. A tag that is found on the inside of a resource text is referred to as an obvious or explicit tag. There are also nonobvious or implicit tags, which don't appear in the resource text. The purpose of this article is to describe the present situation of the SBSs tool and then to also determine the principal features of and how to use explicit tags. It will be taken into special consideration which HTML tags with explicit tags are used more frequently.Estelles Arolas, E.; González Ladrón De Guevara, FR. (2012). Study about the different use of explicit and implicit tags in social bookmarking. Journal of the American Society for Information Science and Technology. 63(2):313-322. doi:10.1002/asi.21663S313322632Bar-Ilan, J., Zhitomirsky-Geffet, M., Miller, Y., & Shoham, S. (2010). The effects of background information and social interaction on image tagging. Journal of the American Society for Information Science and Technology, 61(5), 940-951. doi:10.1002/asi.21306Bateman, S., Muller, M. J., & Freyne, J. (2009). Personalized retrieval in social bookmarking. Proceedinfs of the ACM 2009 international conference on Supporting group work - GROUP ’09. doi:10.1145/1531674.1531688Delicious' Blog 2010 What's next for Delicious http://blog.delicious.com/blog/2010/12/whats-next-for-delicious.htmlDing, Y., Jacob, E. K., Zhang, Z., Foo, S., Yan, E., George, N. L., & Guo, L. (2009). Perspectives on social tagging. Journal of the American Society for Information Science and Technology, 60(12), 2388-2401. doi:10.1002/asi.21190Eisterlehner , F. Hotho , A. Jäschke , R. ECML PKDD Discovery Challenge 2009 (DC09)Farooq, U., Kannampallil, T. G., Song, Y., Ganoe, C. H., Carroll, J. M., & Giles, L. (2007). Evaluating tagging behavior in social bookmarking systems. Proceedings of the 2007 international ACM conference on Conference on supporting group work - GROUP ’07. doi:10.1145/1316624.1316677Farooq , U. Zhang , S.M. Carroll , J. 2009 Sensemaking of scholarly literature through taggingFu, W.-T., Kannampallil, T., Kang, R., & He, J. (2010). Semantic imitation in social tagging. ACM Transactions on Computer-Human Interaction, 17(3), 1-37. doi:10.1145/1806923.1806926Furnas, G. W., Landauer, T. K., Gomez, L. M., & Dumais, S. T. (1987). The vocabulary problem in human-system communication. Communications of the ACM, 30(11), 964-971. doi:10.1145/32206.32212Golder , S.A. Huberman , B.A. 2005 The structure of collaborative tagging systems http://www.hpl.hp.com/research/idl/papers/tagsKörner, C., Benz, D., Hotho, A., Strohmaier, M., & Stumme, G. (2010). Stop thinking, start tagging. Proceedings of the 19th international conference on World wide web - WWW ’10. doi:10.1145/1772690.1772744Koutrika, G., Effendi, F. A., Gyöngyi, Z., Heymann, P., & Garcia-Molina, H. (2008). Combating spam in tagging systems. ACM Transactions on the Web, 2(4), 1-34. doi:10.1145/1409220.1409225Lipczak, M., & Milios, E. (2010). The impact of resource title on tags in collaborative tagging systems. Proceedings of the 21st ACM conference on Hypertext and hypermedia - HT ’10. doi:10.1145/1810617.1810648Marinho, L. B., Nanopoulos, A., Schmidt-Thieme, L., Jäschke, R., Hotho, A., Stumme, G., & Symeonidis, P. (2010). Social Tagging Recommender Systems. Recommender Systems Handbook, 615-644. doi:10.1007/978-0-387-85820-3_19Marlow, C., Naaman, M., Boyd, D., & Davis, M. (2006). HT06, tagging paper, taxonomy, Flickr, academic article, to read. Proceedings of the seventeenth conference on Hypertext and hypermedia - HYPERTEXT ’06. doi:10.1145/1149941.1149949Mathes , A. 2004 Folksonomies-Cooperative classification and communication through shared metadata http://www.adammathes.com/academic/computer-mediated-communication/folksonomies.htmlMelenhorst, M., & van Setten, M. (2007). Usefulness of Tags in Providing Access to Large Information Systems. 2007 IEEE International Professional Communication Conference. doi:10.1109/ipcc.2007.4464070Millen, D., Feinberg, J., & Kerr, B. (2005). Social bookmarking in the enterprise. Queue, 3(9), 28. doi:10.1145/1105664.1105676Robu, V., Halpin, H., & Shepherd, H. (2009). Emergence of consensus and shared vocabularies in collaborative tagging systems. ACM Transactions on the Web, 3(4), 1-34. doi:10.1145/1594173.1594176Schmitz, C., Hotho, A., Jäschke, R., & Stumme, G. (s. f.). Mining Association Rules in Folksonomies. Data Science and Classification, 261-270. doi:10.1007/3-540-34416-0_28Smith , G. 2004 Atomiq: Folksonomy: social classification http://atomiq.org/archives/2004/08/folksonomy_social_classification.htmlSubramanya, S. B., & Liu, H. (2008). Socialtagger - collaborative tagging for blogs in the long tail. Proceeding of the 2008 ACM workshop on Search in social media - SSM ’08. doi:10.1145/1458583.1458588Au Yeung, C., Gibbins, N., & Shadbolt, N. (2009). Contextualising tags in collaborative tagging systems. Proceedings of the 20th ACM conference on Hypertext and hypermedia - HT ’09. doi:10.1145/1557914.1557958Zhang, N., Zhang, Y., & Tang, J. (2009). A tag recommendation system for folksonomy. Proceeding of the 2nd ACM workshop on Social web search and mining - SWSM ’09. doi:10.1145/1651437.165144

    On the Complexity of Query Result Diversification

    Get PDF
    Query result diversification is a bi-criteria optimization problem for ranking query results. Given a database D, a query Q and a positive integer k, it is to find a set of k tuples from Q(D) such that the tuples are as relevant as possible to the query, and at the same time, as diverse as possible to each other. Subsets of Q(D) are ranked by an objective function defined in terms of relevance and diversity. Query result diversification has found a variety of applications in databases, information retrieval and operations research. This paper studies the complexity of result diversification for relational queries. We identify three problems in connection with query result diversification, to determine whether there exists a set of k tuples that is ranked above a bound with respect to relevance and diversity, to assess the rank of a given k-element set, and to count how many k-element sets are ranked above a given bound. We study these problems for a variety of query languages and for three objective functions. We establish the upper and lower bounds of these problems, all matching, for both combined complexity and data complexity. We also investigate several special settings of these problems, identifying tractable cases. 1
    corecore