225 research outputs found

    Deep Learning based Recommender System: A Survey and New Perspectives

    Full text link
    With the ever-growing volume of online information, recommender systems have been an effective strategy to overcome such information overload. The utility of recommender systems cannot be overstated, given its widespread adoption in many web applications, along with its potential impact to ameliorate many problems related to over-choice. In recent years, deep learning has garnered considerable interest in many research fields such as computer vision and natural language processing, owing not only to stellar performance but also the attractive property of learning feature representations from scratch. The influence of deep learning is also pervasive, recently demonstrating its effectiveness when applied to information retrieval and recommender systems research. Evidently, the field of deep learning in recommender system is flourishing. This article aims to provide a comprehensive review of recent research efforts on deep learning based recommender systems. More concretely, we provide and devise a taxonomy of deep learning based recommendation models, along with providing a comprehensive summary of the state-of-the-art. Finally, we expand on current trends and provide new perspectives pertaining to this new exciting development of the field.Comment: The paper has been accepted by ACM Computing Surveys. https://doi.acm.org/10.1145/328502

    Degradation stage classification via interpretable feature learning

    Get PDF
    Predictive maintenance (PdM) advocates for the usage of machine learning technologies to monitor asset's health conditions and plan maintenance activities accordingly. However, according to the specific degradation process, some health-related measures (e.g. temperature) may be not informative enough to reliably assess the health stage. Moreover, each measure needs to be properly treated to extract the information linked to the health stage. Those issues are usually addressed by performing a manual feature engineering, which results in high management cost and poor generalization capability of those approaches. In this work, we address this issue by coupling a health stage classifier with a feature learning mechanism. With feature learning, minimally processed data are automatically transformed into informative features. Many effective feature learning approaches are based on deep learning. With those, the features are obtained as a non-linear combination of the inputs, thus it is difficult to understand the input's contribution to the classification outcome and so the reasoning behind the model. Still, these insights are increasingly required to interpret the results and assess the reliability of the model. In this regard, we propose a feature learning approach able to (i) effectively extract high-quality features by processing different input signals, and (ii) provide useful insights about the most informative domain transformations (e.g. Fourier transform or probability density function) of the input signals (e.g. vibration or temperature). The effectiveness of the proposed approach is tested with publicly available real-world datasets about bearings' progressive deterioration and compared with the traditional feature engineering approach

    A Comprehensive Survey of Deep Learning in Remote Sensing: Theories, Tools and Challenges for the Community

    Full text link
    In recent years, deep learning (DL), a re-branding of neural networks (NNs), has risen to the top in numerous areas, namely computer vision (CV), speech recognition, natural language processing, etc. Whereas remote sensing (RS) possesses a number of unique challenges, primarily related to sensors and applications, inevitably RS draws from many of the same theories as CV; e.g., statistics, fusion, and machine learning, to name a few. This means that the RS community should be aware of, if not at the leading edge of, of advancements like DL. Herein, we provide the most comprehensive survey of state-of-the-art RS DL research. We also review recent new developments in the DL field that can be used in DL for RS. Namely, we focus on theories, tools and challenges for the RS community. Specifically, we focus on unsolved challenges and opportunities as it relates to (i) inadequate data sets, (ii) human-understandable solutions for modelling physical phenomena, (iii) Big Data, (iv) non-traditional heterogeneous data sources, (v) DL architectures and learning algorithms for spectral, spatial and temporal data, (vi) transfer learning, (vii) an improved theoretical understanding of DL systems, (viii) high barriers to entry, and (ix) training and optimizing the DL.Comment: 64 pages, 411 references. To appear in Journal of Applied Remote Sensin

    Clustering of LMS Use Strategies with Autoencoders

    Get PDF
    Learning Management Systems provide teachers with many functionalities to offer materials to students, interact with them and manage their courses. Recognizing teachersโ€™ instructing styles from their course designs would allow recommendations and best practices to be made. We propose a method that determines teaching style in an unsupervised way from the course structure and use patterns. We define a course classification approach based on deep learning and clustering. We first use an autoencoder to reduce the dimensionality of the input data, while extracting the most important characteristics; thus, we obtain a latent representation of the courses. We then apply clustering techniques to the latent data to group courses based on their use patterns. The results show that this technique improves the clustering performance while avoiding the manual data pre-processing work. Furthermore, the obtained model defines seven course typologies that are clearly related to different use patterns of Learning Management Systems

    Enhancing VAEs for Collaborative Filtering: Flexible Priors & Gating Mechanisms

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ)--์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› :์œตํ•ฉ๊ณผํ•™๊ธฐ์ˆ ๋Œ€ํ•™์› ์œตํ•ฉ๊ณผํ•™๋ถ€(๋””์ง€ํ„ธ์ •๋ณด์œตํ•ฉ์ „๊ณต),2019. 8. ์„œ๋ด‰์›.Since Matrix Factorization based linear models have been dominant in the Collaborative Filtering context for a long time in the past, Neural Network based CF Models for recommendation have started to gain attention recently. One branch of research is based on using deep generative models to model user preferences and Variational Autoencoders where shown to give state-of-the-art results. However, there are some potentially problematic characteristics of the current Variational Autoencoder for CF. The first is the too simplistic prior VAEs incorporate for learning the latent representations of user preference, which may be restricting the model from learning more expressive and richer latent variables that could boost recommendation performance. The other is the models inability to learn deeper representations with more than one hidden layer. Our goal is to incorporate appropriate techniques in order to mitigate the aforementioned problems of Variational Autoencoder CF and further improve the recommendation performance of VAE based Collaborative Fil-tering. We bring the VampPrior, which successfully made improvements for image generation to tackle the restrictive prior problem. We also adopt Gat-ed Linear Units (GLUs) which were used in stacked convolutions for lan-guage modeling to control information flow in the easily deepening auto-encoder framework. We show that such simple priors (in original VAEs) may be too restric-tive to fully model user preferences and setting a more flexible prior gives significant gains. We also show that VAMP priors coupled with gating mechanisms outperform SOTA results including the Variational Autoencoder for Collaborative Filtering by meaningful margins on 4 benchmark datasets (MovieLens, Netflix, Pinterest, Melon).์ตœ๊ทผ ๋‰ด๋Ÿด๋„ท ๊ธฐ๋ฐ˜ ํ˜‘์—…ํ•„ํ„ฐ๋ง ์ถ”์ฒœ์•Œ๊ณ ๋ฆฌ์ฆ˜์ด ์ฃผ๋ชฉ์„ ๋ฐ›๊ณ  ์žˆ๋‹ค. ๊ทธ ์ค‘ ํ•œ ๊ฐˆ๋ž˜์˜ ์—ฐ๊ตฌ๋Š” ๊นŠ์€ ์ƒ์„ฑ๋ชจํ˜• (Deep Generative Model)์„ ์ด์šฉํ•ด ์‚ฌ์šฉ์ž๋“ค์˜ ์„ ํ˜ธ๋ฅผ ๋ชจ๋ธ๋งํ•˜๋Š” ๋ฐฉ๋ฒ•์ด๋‹ค. ์ด์ค‘ Variational Autoencoder๋ฅผ (VAE) ์ด์šฉํ•œ ๋ฐฉ๋ฒ•์ด ์ตœ๊ทผ state-of-the-art (SOTA) ์„ฑ๋Šฅ์„ ๋ณด์—ฌ์ฃผ์—ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ VAE๋ฅผ ์ด์šฉํ•œ ํ˜‘์—…ํ•„ํ„ฐ๋ง ์•Œ๊ณ ๋ฆฌ์ฆ˜์€ ํ˜„์žฌ ๋ช‡ ๊ฐ€์ง€์˜ ์ž ์žฌ์ ์ธ ๋ฌธ์ œ์ ๋“ค์„ ์ง€๋‹ˆ๊ณ  ์žˆ๋‹ค. ์ฒซ ๋ฒˆ์งธ๋Š” ์‚ฌ์šฉ์ž ์„ ํ˜ธ๋ฅผ ์••์ถ•ํ•˜๋Š” ์ž ์žฌ๋ณ€์ˆ˜๋ฅผ ํ•™์Šตํ•˜๋Š” ๊ณผ์ •์—์„œ ๋งค์šฐ ๋‹จ์ˆœํ•œ ์‚ฌ์ „๋ถ„ํฌ๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ๋˜ ๋‹ค๋ฅธ ๋ฌธ์ œ์ ์€ ๋ชจ๋ธ์ด ํ˜„์žฌ ์—ฌ๋Ÿฌ ๋‹จ์„ ์ด์šฉํ•œ ๊นŠ์€ ์ธ์ฝ”๋”์™€ ๋””์ฝ”๋”๋ฅผ ์‚ฌ์šฉํ•˜์ง€ ๋ชปํ•˜๊ณ  ์žˆ๋‹ค๋Š” ๊ฒƒ์ด๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋Š” ์ตœ์‹ ๊ธฐ์ˆ ๋“ค์„ ํ™œ์šฉํ•˜์—ฌ ์•ž์„  ๋ฌธ์ œ์ ๋“ค์„ ํ•ด๊ฒฐํ•˜๊ณ  VAE๋ฅผ ์ด์šฉํ•œ ํ˜‘์—…ํ•„ํ„ฐ๋ง ์•Œ๊ณ ๋ฆฌ์ฆ˜์˜ ์ถ”์ฒœ์„ฑ๋Šฅ์„ ๋”์šฑ ๋†’์ด๋Š” ๊ฒƒ์ด ๋ชฉํ‘œ์ด๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋Š” ํ˜‘์—…ํ•„ํ„ฐ๋ง ๋ฌธ์ œ์— ๋” ๋ณต์žกํ•œ ์‚ฌ์ „๋ถ„ํฌ (Flexible Prior)๋ฅผ ์ ์šฉํ•œ ์ฒซ ์—ฐ๊ตฌ๋กœ์„œ, ๊ธฐ์กด์˜ ๋‹จ์ˆœํ•œ ์‚ฌ์ „๋ถ„ํฌ๊ฐ€ ๋ชจ๋ธ์˜ ํ‘œํ˜„๋ ฅ์„ ์ œํ•œํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ ๋” ๋ณต์žกํ•œ ์‚ฌ์ „๋ถ„ํฌ๋ฅผ ์ •์˜ํ•จ์œผ๋กœ์จ ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ๋”์šฑ ๋†’์ผ ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ์ด๋ฏธ์ง€ ์ƒ์„ฑ ๋ฌธ์ œ์—์„œ ์ข‹์€ ๊ฒฐ๊ณผ๋ฅผ ๋ณด์ธ VampPrior๋ฅผ ์ด์šฉํ•ด ์‹คํ—˜์„ ์ง„ํ–‰ํ•˜์˜€๋‹ค. ๋˜ํ•œ VampPrior๋ฅผ Gating Mechanisim๊ณผ ํ•จ๊ป˜ ์‚ฌ์šฉํ•˜์˜€์„ ๋•Œ ๊ธฐ์กด SOTA๋ฅผ ๋„˜์–ด์„œ๋Š” ์„ฑ๋Šฅ์„ ๋ณด์ž„์„ ์ถ”์ฒœ์•Œ๊ณ ๋ฆฌ์ฆ˜์—์„œ ์‚ฌ์šฉ๋˜๋Š” ๋Œ€ํ‘œ์ ์ธ ๋ฐ์ดํ„ฐ์…‹๋“ค์„ ํ†ตํ•ด ๋ณด์—ฌ์ค€๋‹ค.1 INTRODUCTION 1 1.1 Background and Motivation 1 1.2 Research Goal 3 1.3 Enhancing VAEs for Collaborative Filtering 3 1.4 Experiments 5 1.5 Contributions 5 2 RELATED WORK 7 2.1 Collaborative Filtering 7 2.1.1 Traditional methods & Matrix-Factorization based CF 8 2.1.2 Autoencoders for CF 12 2.2 Deep Generative Models (VAE) 17 2.2.1 Variational Bayes 18 2.2.2 Variational Autoencoder 18 2.3 Variational Autoencoder for Collaborative Filtering 20 2.3.1 VAE for CF 21 2.4 Recent research in Computer Vision & Deep Learning 24 2.4.1 VampPrior 24 2.4.2 Gated CNN 25 3 METHOD 28 3.1 Flexible Prior 29 3.1.1 Motivation 29 3.1.2 VampPrior 30 3.1.3 Hierarchical Stochastic Units 31 3.2 Gating Mechanism 32 3.2.1 Motivation 32 3.2.2 Gated Linear Units 34 4 EXPERIMENT 35 4.1 Setup 35 4.1.1 Baseline Models 35 4.1.2 Proposed Models 37 4.1.3 Strong Generalization 37 4.1.4 Evaluation Metrics 38 4.2 Datasets 38 4.3 Configurations 39 4.4 Results 40 4.4.1 Model Performance 40 4.4.5 Further Analysis on the Effect of Gating 44 5 CONCLUSION 45 Bibliography 47 ๊ตญ๋ฌธ์ดˆ๋ก 51Maste
    • โ€ฆ
    corecore