630 research outputs found

    Semi-supervised Adversarial Learning for Complementary Item Recommendation

    Full text link
    Complementary item recommendations are a ubiquitous feature of modern e-commerce sites. Such recommendations are highly effective when they are based on collaborative signals like co-purchase statistics. In certain online marketplaces, however, e.g., on online auction sites, constantly new items are added to the catalog. In such cases, complementary item recommendations are often based on item side-information due to a lack of interaction data. In this work, we propose a novel approach that can leverage both item side-information and labeled complementary item pairs to generate effective complementary recommendations for cold items, i.e., for items for which no co-purchase statistics yet exist. Given that complementary items typically have to be of a different category than the seed item, we technically maintain a latent space for each item category. Simultaneously, we learn to project distributed item representations into these category spaces to determine suitable recommendations. The main learning process in our architecture utilizes labeled pairs of complementary items. In addition, we adopt ideas from Cycle Generative Adversarial Networks (CycleGAN) to leverage available item information even in case no labeled data exists for a given item and category. Experiments on three e-commerce datasets show that our method is highly effective.Comment: ACM Web Conference 202

    Text-based Sentiment Analysis and Music Emotion Recognition

    Get PDF
    Nowadays, with the expansion of social media, large amounts of user-generated texts like tweets, blog posts or product reviews are shared online. Sentiment polarity analysis of such texts has become highly attractive and is utilized in recommender systems, market predictions, business intelligence and more. We also witness deep learning techniques becoming top performers on those types of tasks. There are however several problems that need to be solved for efficient use of deep neural networks on text mining and text polarity analysis. First of all, deep neural networks are data hungry. They need to be fed with datasets that are big in size, cleaned and preprocessed as well as properly labeled. Second, the modern natural language processing concept of word embeddings as a dense and distributed text feature representation solves sparsity and dimensionality problems of the traditional bag-of-words model. Still, there are various uncertainties regarding the use of word vectors: should they be generated from the same dataset that is used to train the model or it is better to source them from big and popular collections that work as generic text feature representations? Third, it is not easy for practitioners to find a simple and highly effective deep learning setup for various document lengths and types. Recurrent neural networks are weak with longer texts and optimal convolution-pooling combinations are not easily conceived. It is thus convenient to have generic neural network architectures that are effective and can adapt to various texts, encapsulating much of design complexity. This thesis addresses the above problems to provide methodological and practical insights for utilizing neural networks on sentiment analysis of texts and achieving state of the art results. Regarding the first problem, the effectiveness of various crowdsourcing alternatives is explored and two medium-sized and emotion-labeled song datasets are created utilizing social tags. One of the research interests of Telecom Italia was the exploration of relations between music emotional stimulation and driving style. Consequently, a context-aware music recommender system that aims to enhance driving comfort and safety was also designed. To address the second problem, a series of experiments with large text collections of various contents and domains were conducted. Word embeddings of different parameters were exercised and results revealed that their quality is influenced (mostly but not only) by the size of texts they were created from. When working with small text datasets, it is thus important to source word features from popular and generic word embedding collections. Regarding the third problem, a series of experiments involving convolutional and max-pooling neural layers were conducted. Various patterns relating text properties and network parameters with optimal classification accuracy were observed. Combining convolutions of words, bigrams, and trigrams with regional max-pooling layers in a couple of stacks produced the best results. The derived architecture achieves competitive performance on sentiment polarity analysis of movie, business and product reviews. Given that labeled data are becoming the bottleneck of the current deep learning systems, a future research direction could be the exploration of various data programming possibilities for constructing even bigger labeled datasets. Investigation of feature-level or decision-level ensemble techniques in the context of deep neural networks could also be fruitful. Different feature types do usually represent complementary characteristics of data. Combining word embedding and traditional text features or utilizing recurrent networks on document splits and then aggregating the predictions could further increase prediction accuracy of such models

    Cross domain recommender systems using matrix and tensor factorization

    Get PDF
    Today, the amount and importance of available data on the internet are growing exponentially. These digital data has become a primary source of information and the people’s life bonded to them tightly. The data comes in diverse shapes and from various resources and users utilize them in almost all their personal or social activities. However, selecting a desirable option from the huge list of available options can be really frustrating and time-consuming. Recommender systems aim to ease this process by finding the proper items which are more likely to be interested by users. Undoubtedly, there is not even one social media or online service which can continue its’ work properly without using recommender systems. On the other hand, almost all available recommendation techniques suffer from some common issues: the data sparsity, the cold-start, and the new-user problems. This thesis tackles the mentioned problems using different methods. While, most of the recommender methods rely on using single domain information, in this thesis, the main focus is on using multi-domain information to create cross-domain recommender systems. A cross-domain recommender system is not only able to handle the cold-start and new-user situations much better, but it also helps to incorporate different features exposed in diverse domains together and capture a better understanding of the users’ preferences which means producing more accurate recommendations. In this thesis, a pre-clustering stage is proposed to reduce the data sparsity as well. Various cross-domain knowledge-based recommender systems are suggested to recommend items in two popular social media, the Twitter and LinkedIn, by using different information available in both domains. The state of art techniques in this field, namely matrix factorization and tensor decomposition, are implemented to develop cross-domain recommender systems. The presented recommender systems based on the coupled nonnegative matrix factorization and PARAFAC-style tensor decomposition are evaluated using real-world datasets and it is shown that they superior to the baseline matrix factorization collaborative filtering. In addition, network analysis is performed on the extracted data from Twitter and LinkedIn

    Sparsity-aware neural user behavior modeling in online interaction platforms

    Get PDF
    Modern online platforms offer users an opportunity to participate in a variety of content-creation, social networking, and shopping activities. With the rapid proliferation of such online services, learning data-driven user behavior models is indispensable to enable personalized user experiences. Recently, representation learning has emerged as an effective strategy for user modeling, powered by neural networks trained over large volumes of interaction data. Despite their enormous potential, we encounter the unique challenge of data sparsity for a vast majority of entities, e.g., sparsity in ground-truth labels for entities and in entity-level interactions (cold-start users, items in the long-tail, and ephemeral groups). In this dissertation, we develop generalizable neural representation learning frameworks for user behavior modeling designed to address different sparsity challenges across applications. Our problem settings span transductive and inductive learning scenarios, where transductive learning models entities seen during training and inductive learning targets entities that are only observed during inference. We leverage different facets of information reflecting user behavior (e.g., interconnectivity in social networks, temporal and attributed interaction information) to enable personalized inference at scale. Our proposed models are complementary to concurrent advances in neural architectural choices and are adaptive to the rapid addition of new applications in online platforms. First, we examine two transductive learning settings: inference and recommendation in graph-structured and bipartite user-item interactions. In chapter 3, we formulate user profiling in social platforms as semi-supervised learning over graphs given sparse ground-truth labels for node attributes. We present a graph neural network framework that exploits higher-order connectivity structures (network motifs) to learn attributed structural roles of nodes that identify structurally similar nodes with co-varying local attributes. In chapter 4, we design neural collaborative filtering models for few-shot recommendations over user-item interactions. To address item interaction sparsity due to heavy-tailed distributions, our proposed meta-learning framework learns-to-recommend few-shot items by knowledge transfer from arbitrary base recommenders. We show that our framework consistently outperforms state-of-art approaches on overall recommendation (by 5% Recall) while achieving significant gains (of 60-80% Recall) for tail items with fewer than 20 interactions. Next, we explored three inductive learning settings: modeling spread of user-generated content in social networks; item recommendations for ephemeral groups; and friend ranking in large-scale social platforms. In chapter 5, we focus on diffusion prediction in social networks where a vast population of users rarely post content. We introduce a deep generative modeling framework that models users as probability distributions in the latent space with variational priors parameterized by graph neural networks. Our approach enables massive performance gains (over 150% recall) for users with sparse activities while being faster than state-of-the-art neural models by an order of magnitude. In chapter 6, we examine item recommendations for ephemeral groups with limited or no historical interactions together. To overcome group interaction sparsity, we present self-supervised learning strategies that exploit the preference co-variance in observed group memberships for group recommender training. Our framework achieves significant performance gains (over 30% NDCG) over prior state-of-the-art group recommendation models. In chapter 7, we introduce multi-modal inference with graph neural networks that captures knowledge from multiple feature modalities and user interactions for multi-faceted friend ranking. Our approach achieves notable higher performance gains for critical populations of less-active and low degree users

    Semantic discovery and reuse of business process patterns

    Get PDF
    Patterns currently play an important role in modern information systems (IS) development and their use has mainly been restricted to the design and implementation phases of the development lifecycle. Given the increasing significance of business modelling in IS development, patterns have the potential of providing a viable solution for promoting reusability of recurrent generalized models in the very early stages of development. As a statement of research-in-progress this paper focuses on business process patterns and proposes an initial methodological framework for the discovery and reuse of business process patterns within the IS development lifecycle. The framework borrows ideas from the domain engineering literature and proposes the use of semantics to drive both the discovery of patterns as well as their reuse

    A Normative Classification of Consumer Big Data

    Get PDF
    The big data phenomenon has transformed every area of life and business. Businesses today rely on the volume, velocity, and variety (3Vs) of data available today in product design, advertisement, sales, and post-sale follow up activities. Communication between the firm and the consumer is personalized using data collected on the consumer to match the consumer’s location, time, and needs. Some marketers argue that this has birth a new era of marketing; transformative marketing, in which the firm’s ability to deliver value and to acquire and maintain long-run competitive advantage determined by the firm’s data resources. In other words, data are the currency of the transformative marketing era. This sentiment is pervasive and has led to massive investments in data in recent years. This dissertation puts forward a classification of consumer big data to aid the firm extract value out of big data despite the 3Vs. The classification also demonstrates how value in a transformative marketing era does not have to be created at the expense of the consumer, but with the consumer. Five conceptual dichotomies are put forward in essay two that are more comprehensive than any other classification of data available in the research. Finally, the third essay investigates how the big data phenomenon affects consumer freedom and emotions. Most people agree that freedom is a fundamental human right, and that business practices should respect consumer freedom. However, research on consumer freedom is scant. Two experiments investigate how the characteristics of data collected on consumers affects consumer perception of decision freedom and satisfaction with value propositions. With the big data phenomenon has come a push toward algorithmic decision making. Consumer’s anxiety toward algorithmic decision making is investigated along with the satisfaction derived from decisions made by third parties that collect data on consumers

    O impacto da inteligência artificial no negócio eletrónico

    Get PDF
    Pela importância que a Inteligência Artificial exibe na atualidade, revela-se de grande interesse verificar até que ponto ela está a transformar o Negócio Eletrónico. Para esse efeito, delineou-se uma revisão sistemática com o objetivo de avaliar os impactos da proliferação destes instrumentos. A investigação empreendida pretendeu identificar artigos científicos que, através de pesquisas realizadas a Fontes de Dados Eletrónicas, pudessem responder às questões de investigação implementadas: a) que tipo de soluções, baseadas na Inteligência Artificial (IA), têm sido usadas para melhorar o Negócio Eletrónico (NE); b) em que domínios do NE a IA foi aplicada; c) qual a taxa de sucesso ou fracasso do projeto. Simultaneamente, tiveram de respeitar critérios de seleção, nomeadamente, estar escritos em inglês, encontrarem-se no intervalo temporal 2015/2021 e tratar-se de estudos empíricos, suportados em dados reais. Após uma avaliação de qualidade final, procedeu-se à extração dos dados pertinentes para a investigação, para formulários criados em MS Excel. Estes dados estiveram na base da análise quantitativa e qualitativa que evidenciaram as descobertas feitas e sobre os quais se procedeu, posteriormente, à sua discussão. A dissertação termina com as conclusão e discussão de trabalhos futuros.Due to the importance that Artificial Intelligence exhibits today, it is of great interest to see to what extent it is transforming the Electronic Business. To this end, a systematic review was designed to evaluate the impacts of the proliferation of these instruments. The research aimed to identify scientific articles that, through research carried out on Electronic Data Sources, could answer the research questions implemented: a) what kind of solutions, based on Artificial Intelligence, have been used to improve the Electronic Business; b) in which areas of the Electronic Business Artificial Intelligence has been applied; c) what the success rate or failure of the project is. At the same time, they must comply with selection criteria, to be written in English, to be found in the 2015/2021-time interval and to be empirical studies supported by actual data. After a final quality evaluation, the relevant data for the investigation were extracted for forms created in MS Excel. These data were the basis of the quantitative and qualitative analysis that evidenced the findings found and on which they were subsequently discussed. The dissertation ends with the conclusion and discussion of future works

    DIGITAL WINE: HOW PLATFORMS AND ALGORITHMS WILL RESHAPE THE WINE INDUSTRY

    Get PDF
    La tesi si propone di analizzare come la digitalizzazione e gli approcci basati sui dati, in particolare quelli che sfruttano l'intelligenza artificiale, stiano impattando il settore vitivinicolo e facendo emergere modelli nuovi di business. Quest'ultimo aspetto sarà approfondito tramite due casi studio di piattaforme digitali che, attraverso approcci diversi, stanno contribuendo a generare un ecosistema digitale virtuoso, con potenziali benefici per tutta la catena del valore a livello di settore.The thesis aims to analyze how digitalization and data-driven approaches, in particular those that leverage artificial intelligence, are impacting the wine industry and generating new business models. The latter aspect will be explored through two case studies of digital platforms which, through different approaches, are helping to generate a virtuous digital ecosystem, with potential benefits for the entire value chain at the industry level
    • …
    corecore