49 research outputs found
Research Paper Recommender System with Serendipity Using Tweets vs. Diversification
21st International Conference on Asia-Pacific Digital Libraries, ICADL 2019, Kuala Lumpur, Malaysia, November 4–7, 2019. Part of the Lecture Notes in Computer Science book series (LNCS, volume 11853), also part of the Information Systems and Applications, incl. Internet/Web, and HCI book sub series (LNISA, volume 11853).So far, a lot of works have studied research paper recommender systems. However, most of them have focused only on the accuracy and ignored the serendipity, which is an important aspect for user satisfaction. The serendipity is concerned with the novelty of recommendations and to which extent recommendations positively surprise users. In this paper, we investigate a research paper recommender system focusing on serendipity. In particular, we examine (1) whether a user’s tweets lead to a generation of serendipitous recommendations and (2) whether the use of diversification on a recommendation list improves serendipity. We have conducted an online experiment with 22 subjects in the domain of computer science. The result of our experiment shows that tweets do not improve the serendipity, despite their heterogeneous nature. However, diversification delivers serendipitous research papers that cannot be generated by a traditional strategy
Towards Serendipitous Research Paper Recommender Using Tweets and Diversification
23rd International Conference on Theory and Practice of Digital Libraries, TPDL 2019, Oslo, Norway, September 9-12, 2019. Part of the Lecture Notes in Computer Science book series (LNCS, volume 11799), also part of the Information Systems and Applications, incl. Internet/Web, and HCI book sub series (LNISA, volume 11799).In this paper, we examine whether a user’s tweets can help to a generate more serendipitous recommendations. In addition, we investigate whether the use of diversification applied on a list of recommended items further improves serendipity. To this end, we conduct an experiment with n = 22 subjects. The result of our experiment shows that the subject’s tweets did not improve serendipity, but diversification results in more serendipitous recommendations
Peeking into the other half of the glass : handling polarization in recommender systems.
This dissertation is about filtering and discovering information online while using recommender systems. In the first part of our research, we study the phenomenon of polarization and its impact on filtering and discovering information. Polarization is a social phenomenon, with serious consequences, in real-life, particularly on social media. Thus it is important to understand how machine learning algorithms, especially recommender systems, behave in polarized environments. We study polarization within the context of the users\u27 interactions with a space of items and how this affects recommender systems. We first formalize the concept of polarization based on item ratings and then relate it to the item reviews, when available. We then propose a domain independent data science pipeline to automatically detect polarization using the ratings rather than the properties, typically used to detect polarization, such as item\u27s content or social network topology. We perform an extensive comparison of polarization measures on several benchmark data sets and show that our polarization detection framework can detect different degrees of polarization and outperforms existing measures in capturing an intuitive notion of polarization. We also investigate and uncover certain peculiar patterns that are characteristic of environments where polarization emerges: A machine learning algorithm finds it easier to learn discriminating models in polarized environments: The models will quickly learn to keep each user in the safety of their preferred viewpoint, essentially, giving rise to filter bubbles and making them easier to learn. After quantifying the extent of polarization in current recommender system benchmark data, we propose new counter-polarization approaches for existing collaborative filtering recommender systems, focusing particularly on the state of the art models based on Matrix Factorization. Our work represents an essential step toward the new research area concerned with quantifying, detecting and counteracting polarization in human-generated data and machine learning algorithms.We also make a theoretical analysis of how polarization affects learning latent factor models, and how counter-polarization affects these models. In the second part of our dissertation, we investigate the problem of discovering related information by recommendation of tags on social media micro-blogging platforms. Real-time micro-blogging services such as Twitter have recently witnessed exponential growth, with millions of active web users who generate billions of micro-posts to share information, opinions and personal viewpoints, daily. However, these posts are inherently noisy and unstructured because they could be in any format, hence making them difficult to organize for the purpose of retrieval of relevant information. One way to solve this problem is using hashtags, which are quickly becoming the standard approach for annotation of various information on social media, such that varied posts about the same or related topic are annotated with the same hashtag. However hashtags are not used in a consistent manner and most importantly, are completely optional to use. This makes them unreliable as the sole mechanism for searching for relevant information. We investigate mechanisms for consolidating the hashtag space using recommender systems. Our methods are general enough that they can be used for hashtag annotation in various social media services such as twitter, as well as for general item recommendations on systems that rely on implicit user interest data such as e-learning and news sites, or explicit user ratings, such as e-commerce and online entertainment sites. To conclude, we propose a methodology to extract stories based on two types of hashtag co-occurrence graphs. Our research in hashtag recommendation was able to exploit the textual content that is available as part of user messages or posts, and thus resulted in hybrid recommendation strategies. Using content within this context can bridge polarization boundaries. However, when content is not available, is missing, or is unreliable, as in the case of platforms that are rich in multimedia and multilingual posts, the content option becomes less powerful and pure collaborative filtering regains its important role, along with the challenges of polarization
Serendipitous News Discovery Increases News Consumption in News Recommender Systems
News recommender system users obtain news via incidental exposure to news and
experience serendipity in the incidental news consumption. Serendipitous news discovery, the
same as serendipity, refers to discovering unexpected and useful information unintentionally.
Researchers suggest building serendipitous news recommender systems and increasing
serendipitous news discovery to increase the diversity of the news consumption. However, the
impacts of serendipitous news discovery on news consumption are uninvestigated, and rare
research provides theoretical guidance to the serendipitous news recommender systems. The thesis
investigated the impacts of serendipitous news discovery on news consumption with a serendipityrelated
emotion, surprise, as a mediator and need for activation as a moderator. 463 participants
recruited from Amazon MTurk completed the online survey-experiment. The findings suggest that
surprise mediates the correlations between serendipitous news discovery and news consumption.
Users who experience higher serendipitous news discovery indicate more positive attitudes
on news consumption in the news recommender systems. The results also indicate the possibility
that the lack of constant serendipitous news discovery may lead to the consumption of the news
similar to the news that trigger serendipity. The research suggests that serendipitous news
discovery increases news consumption, including news selection and reading
Personality-based recommendation: human curiosity applied to recommendation systems using implicit information from social networks
Tesis por compendioEn el día a día, las personas suelen confiar en recomendaciones, tradicionalmente aportadas por otras personas (familia, amigos, etc.) para sus decisiones más variadas. En el mundo digital esto no es diferente, dado que los sistemas de recomendación están presentes en todas partes y de modo transparente. El principal objetivo de estos sistemas es el de ayudar en el proceso de toma de decisiones, generando recomendaciones de su interés y basadas en sus gustos. Dichas recomendaciones van desde productos en sitios web de comercio electrónico, como libros o lugares a visitar, además de qué comer o cuánto tiempo uno debe caminar al día para tener una vida sana, con quién salir o a quién seguir en las redes sociales.
Esta es un área en ascensión. Por un lado, tenemos cada vez más usuarios en internet cuya vida está digitalizada, dado que lo que se hace en el "mundo real" está representado en cierto modo en el "mundo digital". Por otro lado, sufrimos una sobrecarga de información, que puede mitigarse mediante el uso de un sistema de recomendación. Sin embargo, estos sistemas también enfrentan algunos problemas, como el problema del arranque en frío y su necesidad de ser cada vez más "humanos", "personalizados" y "precisos" para satisfacer las exigencias de usuarios y empresas.
En este desafiante escenario, los sistemas de recomendación basados en la personalidad se están estudiando cada vez más, ya que son capaces de enfrentar esos problemas. Algunos proyectos recientes proponen el uso de la personalidad humana en los recomendadores, ya sea en su conjunto o individualmente por rasgos. Esta tesis está dedicada a este nuevo área de recomendación basada en la personalidad, centrándose en uno de sus rasgos más importantes, la curiosidad. Además, para explotar la información ya existente en internet, obtendremos de forma implícita información de las redes sociales.
Por lo tanto, este trabajo tiene como objetivo proporcionar una mejor experiencia al usuario final a través de un nuevo enfoque que ofrece una alternativa a algunos de los retos identificados en los sistemas de recomendación basados en la personalidad. Entre estas mejoras, el uso de las redes sociales para alimentar los sistemas de recomendación reduce el problema del arranque en frío y, al mismo tiempo, proporciona datos valiosos para la predicción de la personalidad humana. Por otro lado, la curiosidad no ha sido utilizada por ninguno de los sistemas de recomendación estudiados; casi todos han usado la personalidad general de un individuo a través de los Cinco Grandes rasgos de la personalidad. Sin embargo, los estudios psicológicos confirman que la curiosidad es un rasgo relevante en el proceso de elegir un item, cuestión directamente relacionada con los sistemas de recomendación.
En resumen, creemos que un sistema de recomendación que mida implícitamente la curiosidad y la utilice en el proceso de recomendar nuevos ítems, especialmente en el sector turístico, podría claramente mejorar la capacidad de estos sistemas en términos de precisión, serendipidad y novedad, permitiendo a los usuarios obtener niveles positivos de satisfacción con las recomendaciones.
Esta tesis realiza un estudio exhaustivo del estado del arte, donde destacamos trabajos sobre sistemas de recomendación, la personalidad humana desde el punto de vista de la psicología tradicional y positiva y finalmente cómo se combinan ambos aspectos. Luego, desarrollamos una aplicación en línea capaz de extraer implícitamente información del perfil de usuario en una red social, generando predicciones de uno o más rasgos de su personalidad. Finalmente, desarrollamos el sistema CURUMIM, capaz de generar recomendaciones en línea con diferentes propiedades, combinando la curiosidad y algunas características sociodemográficas (como el nivel de educación) extraídas de Facebook. El sistema ha sido probado y evaluado en el contexto turístico por usuarios rEn el dia a dia, les persones solen confiar en recomanacions, tradicionalment aportades per altres persones (família, amics, etc.) per a les seues decisions més variades. En el món digital això no és diferent, atès que els sistemes de recomanació estan presents a tot arreu i de manera transparent. El principal objectiu d'aquests sistemes és el d'ajudar en el procés de presa de decisions, generant recomanacions del seu interès i basades en els seus gustos. Aquestes recomanacions van des de productes en pàgines web de comerç electrònic, com a llibres o llocs a visitar, a més de què menjar o quant temps una persona ha de caminar al dia per a tindre una vida sana, amb qui eixir o a qui seguir en les xarxes socials.
Aquesta és una àrea en ascensió. D'una banda, tenim cada vegada més usuaris en internet la vida de les quals està digitalitzada, atès que el que es fa en el "món real" està representat en certa manera en el "món digital". D'altra banda, patim una sobrecàrrega d'informació, que pot mitigar-se mitjançant l'ús d'un sistema de recomanació. No obstant això, aquests sistemes també enfronten alguns problemes, com el problema de l'arrencada en fred i la seua necessitat de ser cada vegada més "humans", "personalitzats" i "precisos" per a satisfer les exigències d'usuaris i empreses.
En aquest desafiador escenari, els sistemes de recomanació basats en la personalitat s'estan estudiant cada vegada més, ja que són capaços d'enfrontar eixos problemes. Alguns projectes recents proposen l'ús de la personalitat humana en els recomendadors, ja siga en el seu conjunt o individualment per trets. Aquesta tesi està dedicada a aquest nou àrea de recomanació basada en la personalitat, centrant-se en un dels seus trets més importants, la curiositat. A més, per a explotar la informació ja existent en internet, obtindrem de forma implícita informació de les xarxes socials.
Per tant, aquest treball té com a objectiu proporcionar una millor experiència a l'usuari final a través d'un nou enfocament que ofereix una alternativa a alguns dels reptes identificats en els sistemes de recomanació basats en la personalitat. Entre aquestes millores, l'ús de les xarxes socials per a alimentar els sistemes de recomanació redueix el problema de l'arrencada en fred i, al mateix temps, proporciona dades valuoses per a la predicció de la personalitat humana. D'altra banda, la curiositat no ha sigut utilitzada per cap dels sistemes de recomanació estudiats; quasi tots han usat la personalitat general d'un individu a través dels Cinc Grans trets de la personalitat. No obstant això, els estudis psicològics confirmen que la curiositat és un tret rellevant en el procés de triar un item, qüestió directament relacionada amb els sistemes de recomanació.
En resum, creiem que un sistema de recomanació que mesure implícitament la curiositat i la utilitze en el procés de recomanar nous ítems, especialment en el sector turístic, podria clarament millorar la capacitat d'aquests sistemes en termes de precisió, sorpresa i novetat, permetent als usuaris obtindre nivells positius de satisfacció amb les recomanacions.
Aquesta tesi realitza un estudi exhaustiu de l'estat de l'art, on destaquem treballs sobre sistemes de recomanació, la personalitat humana des del punt de vista de la psicologia tradicional i positiva i finalment com es combinen tots dos aspectes. Després, desenvolupem una aplicació en línia capaç d'extraure implícitament informació del perfil d'usuari en una xarxa social, generant prediccions d'un o més trets de la seua personalitat. Finalment, desenvolupem el sistema CURUMIM, capaç de generar recomanacions en línia amb diferents propietats, combinant la curiositat i algunes característiques sociodemogràfiques (com el nivell d'educació) extretes de Facebook. El sistema ha sigut provat i avaluat en el context turístic per usuaris reals. Els resultats demostren la seua capacitat perIn daily life, people usually rely on recommendations, traditionally given by other people (family, friends, etc.) for their most varied decisions. In the digital world, this is not different, given that recommender systems are present everywhere in such a way that we no longer realize. The main goal of these systems is to assist users in the decision-making process, generating recommendations that are of their interest and based on their tastes. These recommendations range from products in e-commerce websites, like books to read or places to visit to what to eat or how long one should walk a day to have a healthy life, who to date or who one should follow on social networks.
And this is an increasing area. On the one hand, we have more and more users on the internet whose life is somewhat digitized, given than what one does in the "real world" is represented in a certain way in the "digital world". On the other hand, we suffer from information overload, which can be mitigated by the use of recommendation systems. However, these systems also face some problems, such as the cold start problem and their need to be more and more "human", "personalised" and "precise" in order to meet the yearning of users and companies.
In this challenging scenario, personality-based recommender systems are being increasingly studied, since they are able to face these problems. Some recent projects have proposed the use of the human personality in recommenders, whether as a whole or individually by facet in order to meet those demands. Therefore, this thesis is devoted to this new area of personality-based recommendation, focusing on one of its most important traits, the curiosity. Additionally, in order to exploit the information already present on the internet, we will implicitly obtain information from social networks.
Thus, this work aims to build a better experience for the end user through a new approach that offers an option for some of the gaps identified in personality-based recommendation systems. Among these gap improvements, the use of social networks to feed the recommender systems soften the cold start problem and, at the same time, it provides valuable data for the prediction of the human personality. Another found gap is that the curiosity was not used by any of the studied recommender systems; almost all of them have used the overall personality of an individual through the Big Five personality traits. However, psychological studies confirm that the curiosity is a relevant trait in the process of choosing an item, which is directly related to recommendation systems.
In summary, we believe that a recommendation system that implicitly measures the curiosity and uses it in the process of recommending new items, especially in the tourism sector, could clearly improve the capacity of these systems in terms of accuracy, serendipity and novelty, allowing users to obtain positive levels of satisfaction with the recommendations.
This thesis begins with an exhaustive study of the state of the art, where we highlight works about recommender systems, the human personality from the point of view of traditional and positive psychology and how these aspects are combined. Then, we develop an online application capable of implicitly extracting information from the user profile in a social network, thus generating predictions of one or more personality traits. Finally, we develop the CURUMIM system, able to generate online recommendations with different properties, combining the curiosity and some sociodemographic characteristics (such as level of education) extracted from Facebook. The system is tested and assessed within the tourism context by real users. The results demonstrate its ability to generate novel and serendipitous recommendations, while maintaining a good level of accuracy, independently of the degree of curiosity of the users.Menk Dos Santos, A. (2018). Personality-based recommendation: human curiosity applied to recommendation systems using implicit information from social networks [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/114798TESISCompendi
An Evaluation of the Use of Diversity to Improve the Accuracy of Predicted Ratings in Recommender Systems
The diversity; versus accuracy trade off, has become an important area of research within recommender systems as online retailers attempt to better serve their customers and gain a competitive advantage through an improved customer experience. This dissertation attempted to evaluate the use of diversity measures in predictive models as a means of improving predicted ratings. Research literature outlines a number of influencing factors such as personality, taste, mood and social networks in addition to approaches to the diversity challenge post recommendation. A number of models were applied included DecisionStump, Linear Regression, J48 Decision Tree and Naive Bayes. Various evaluation metrics such as precision, recall, ROC area, mean squared error and correlation coefficient were used to evaluate the model types. The results were below a benchmark selected during the literature review. The experiment did not demonstrate that diversity measures as inputs improve the accuracy of predicted ratings. However, the evaluation results for the model without diversity measures were low also and comparable to those with diversity indicating that further research in this area may be worthwhile. While the experiment conducted did not clearly demonstrate that the inclusion of diversity measures as inputs improve the accuracy of predicted ratings, approaches to data extraction, pre-processing, and model selection could inform further research. Areas of further research identified within this paper may also add value for those interested in this topic
Recommended from our members
Human-Centered Technologies for Inclusive Collection and Analysis of Public-Generated Data
The meteoric rise in the popularity of public engagement platforms such as social media, customer review websites, and public input solicitation efforts strives for establishing an inclusive environment for the public to share their thoughts, ideas, opinions, and experiences. Many decisions made at a personal, local, or national scale are often fueled by data generated by the public. As such, inclusive collection, analysis, sensemaking, and utilization of pubic-generated data are crucial to support the exercise of successful decision-making processes. However, people often struggle to engage, participate, and share their opinions due to inaccessibility, the rigidity of traditional public engagement methods, and the lack of options to provide opinions while avoiding potential confrontations. Concurrently, data analysts and decision-makers grapple with the challenges of analyzing, sensemaking, and making informed decisions based on public-generated data, which includes high dimensionality, ambiguity present in human language, and a lack of tools and techniques catered to their needs. Novel technological interventions are therefore necessary to enable the public to share their input without barriers and allow decision-makers to capture, forage, peruse, and sublimate public-generated data into concrete and actionable insights.
The goal of this dissertation is to demonstrate how human-centered approaches involve the stakeholders in the design, development, and evaluation of tools and techniques that can lead to inclusive, effective, and efficient approaches to public-generated data collection and analysis to support informed decision-making. To that end, in this dissertation, I first addressed the challenges of empowering the public to share their opinions by exploring two major opinion-sharing avenues --- social media and public consultation. To learn more about people\u27s social media experiences and challenges, I built two technology probes and conducted a qualitative exploratory study with 16 participants. This study is followed up by exploring the challenges of inclusive participation during public consultations such as town halls. Based on a formative study with 66 participants and 20 organizers, I designed and developed CommunityClick to enable reticent share their opinions silently and anonymously during town halls. Equipped with the knowledge and experiences from these works, I designed, developed, and evaluated technologies and methods to facilitate and accelerate informed data-driven decision-making based on increased public-generated data. Based on interviews with 14 analysts and decision-makers in the civic domain, I built a visual analytics system CommunityClick that can facilitate public input analysis by surfacing hidden insights, people\u27s reflections, and priorities. Leveraging the lessons learned during this work, I created a visual text analytics system that supports serendipitous discovery and balanced analysis of textual data to help make informed decisions.
In this work, I contribute an understanding of how people collect and analyze public-generated data to fuel their decisions when they have increased exposure to alternative avenues for opinion-sharing. Through a series of human-centered studies, I highlight the challenges that inhibit inclusivity in opinion sharing and shortcomings of existing methods that prevent decision-makers to account for comprehensive public input that includes marginalized or unpopular opinions. To address these challenges, I designed, developed, and evaluated a collection of interactive systems including CommunityClick, CommunityPulse, and Serendyze. Through a rigorous set of evaluation strategies which include creativity sessions, controlled lab studies, in-the-wild deployment, and field experiments, I involved stakeholders to assess the effectiveness and utility of the built systems. Through the empirical evidence from these studies, I demonstrate how alternative designs for social media could enhance people\u27s social media experiences and enable them to make new connections with others to share opinions. In addition, I show how CommunityClick can be utilized to enable reticent attendees during public consultation to share their opinions while avoiding unwanted confrontation and allowing organizers to capture and account for silent feedback. I highlight how CommunityPulse allowed analysts and decision-makers to examine public input from multiple angles for an accelerated analysis and more informed decision-making. Furthermore, I demonstrate how supporting serendipitous discovery and balanced analysis using Serendyze can lead to more informed data-driven decision-making. I conclude the dissertation with a discussion on future avenues to expand this research including the facilitation of multi-user collaborative analysis, integration of multi-modal signals in the analysis of public-generated data, and potential adoption strategies for decision-support systems designed for inclusive collection and analysis of public-generated data
IMPROVING COLLABORATIVE FILTERING RECOMMENDER BY USING MULTI-CRITERIA RATING AND IMPLICIT SOCIAL NETWORKS TO RECOMMEND RESEARCH PAPERS
Research paper recommender systems (RSs) aim to alleviate the information overload of researchers by suggesting relevant and useful papers. The collaborative filtering in the area of recommending research papers can benefit by using richer user feedback data through multi-criteria rating, and by integrating richer social network data into the recommender algorithm.
Existing approaches using collaborative filtering or hybrid approaches typically allow only one rating criterion (overall liking) for users to evaluate papers. We conducted a qualitative study using focus group to explore the most important criteria for rating research papers that can be used to control the paper recommendation by enabling users to set the weight for each criterion. We investigated also the effect of using different rating criteria on the user interface design and how the user can control the weight of the criteria. We followed that by a quantitative study using a questionnaire to validate our findings from the focus group and to find if the chosen criteria are domain independent.
Combining social network information with collaborative filtering recommendation algorithms has successfully reduced some of the drawbacks of collaborative filtering and increased the accuracy of recommendations. All existing recommendation approaches that combine social network information with collaborative filtering in this domain have used explicit social relations that are initiated by users (e.g. “friendship”, “following”). The results have shown that the recommendations produced using explicit social relations cannot compete with traditional collaborative filtering and suffer from the low user coverage. We argue that the available data in social bookmarking Web sites can be exploited to connect similar users using implicit social connections based on their bookmarking behavior. We explore the implicit social relations between users in social bookmarking Web sites (such as CiteULike and Mendeley), and propose three different implicit social networks to recommend relevant papers to users: readership, co-readership and tag-based implicit social networks. First, for each network, we tested the interest similarities of users who are connected using the proposed implicit social networks and compare them with the interest similarities using two explicit social networks: co-authorship and friendship. We found that the readership implicit social network connects users with more similarities than users who are connected using co-authorship and friendship explicit social networks. Then, we compare the recommendation using three different recommendation approaches and implicit social network alone with the recommendation using implicit and explicit social network. We found that fusing recommendation from implicit and explicit social networks can increase the prediction accuracy, and user coverage. The trade-off between the prediction accuracy and diversity was also studied with different social distances between users. The results showed that the diversity of the recommended list increases with the increase of social distance.
To summarize, the main contributions of this dissertation to the area of research paper recommendation are two-fold. It is the first to explore the use of multi-criteria rating for research papers. Secondly, it proposes and evaluates a novel approach to improve collaborative filtering in both prediction accuracy (performance) and user coverage and diversity (nonperformance measures) in social bookmarking systems for sharing research papers, by defining and exploiting several implicit social networks from usage data that is widely available