5 research outputs found

    A deep learning approach for context-aware citation recommendation using rhetorical zone classification and similarity to overcome cold-start problem

    Get PDF
    In the recent decade, the citation recommendation has emerged as an important research topic due to its need for the huge size of published scientific work. Among other citation recommendation techniques, the widely used content-based filtering (CBF) exploits research articlesโ€™ textual content to produce recommendations. However, CBF techniques are prone to the well-known cold-start problem. On the other hand, deep learning has shown its effectiveness in understanding the semantics of the text. The present paper proposes a citation recommendation system using deep learning models to classify rhetorical zones of the research articles and compute similarity using rhetorical zone embeddings that overcome the cold-start problem. Rhetorical zones are the predefined linguistic categories having some common characteristics about the text. A deep learning model is trained using ART and CORE datasets with an accuracy of 76 per cent. The final ranked lists of the recommendations have an average of 0.704 normalized discounted cumulative gain (nDCG) score involving ten domain experts. The proposed system is applicable for both local and global context-aware recommendations

    Distance-based Attention GNN for Node Embedding on Spatial Networks

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(์„์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ๊ณต๊ณผ๋Œ€ํ•™ ๊ฑด์„คํ™˜๊ฒฝ๊ณตํ•™๋ถ€, 2021.8. ์œ ๊ธฐ์œค.๊ณต๊ฐ„ ๋„คํŠธ์›Œํฌ๋Š” ๋…ธ๋“œ์™€ ์—ฃ์ง€๊ฐ€ ๊ฑฐ๋ฆฌ ๊ณต๊ฐ„์— ์†ํ•˜๋Š” ๊ทธ๋ž˜ํ”„ ์ž๋ฃŒ๊ตฌ์กฐ๋กœ ์ง€ํ‘œ๋ฉด ์ƒ์˜ ์—ฌ๋Ÿฌ ํ˜„์ƒ๊ณผ ๋ฌธ์ œ๋ฅผ ํ‘œํ˜„ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ตœ๊ทผ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ์ˆ ์˜ ๊ธ‰๊ฒฉํ•œ ๋ฐœ๋‹ฌ๊ณผ ๋”๋ถˆ์–ด ๊ทธ๋ž˜ํ”„๋ฅผ ํ•™์Šตํ•˜๋Š” ๋‹ค์–‘ํ•œ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ๋“ค์ด ์—ฐ๊ตฌ๋˜๊ณ  ์žˆ์ง€๋งŒ ๋…ธ๋“œ ์‚ฌ์ด์˜ ๊ฑฐ๋ฆฌ ์š”์†Œ๋ฅผ ๊ณ ๋ คํ•˜์ง€ ์•Š์•„ ๊ณต๊ฐ„ ๋„คํŠธ์›Œํฌ๊ฐ€ ๊ฐ€์ง„ ๊ฑฐ๋ฆฌ ์ •๋ณด๋ฅผ ํ™œ์šฉํ•˜์ง€ ๋ชปํ•œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ๋Š” ๊ณต๊ฐ„ ๋„คํŠธ์›Œํฌ์— ์†ํ•˜๋Š” ๊ฑฐ๋ฆฌ ํŠน์„ฑ์„ ๊ฐ€์ง„ ๋…ธ๋“œ๋“ค์˜ ์ด์›ƒ ๊ด€๊ณ„์—์„œ์˜ ์˜ํ–ฅ๋ ฅ์„ ์•Œ ์ˆ˜ ์žˆ๋Š” ๊ฑฐ๋ฆฌ ๊ธฐ๋ฐ˜ ์–ดํ…์…˜ ๊ธฐ๋ฒ•์„ ์†Œ๊ฐœํ•˜๊ณ  ์ด๋ฅผ ํ™œ์šฉํ•œ ๊ทธ๋ž˜ํ”„ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์ธ DWGNN์„ ์ œ์•ˆํ•œ๋‹ค. DWGNN์€ ์ž„๋ฒ ๋”ฉ ๊ณผ์ • ์ค‘ ๋Œ€์ƒ ๋…ธ๋“œ์—๊ฒŒ ๋”์šฑ ๊ฐ€๊นŒ์šด ์ด์›ƒ ๋…ธ๋“œ์˜ ์˜ํ–ฅ๋ ฅ์„ ์ƒ๋Œ€์ ์œผ๋กœ ๋ฉ€๋ฆฌ ์œ„์น˜ํ•œ ์ด์›ƒ ๋…ธ๋“œ๋“ค์˜ ์˜ํ–ฅ๋ ฅ๋ณด๋‹ค ํฌ๊ฒŒ ์ฃผ์–ด ๊ณต๊ฐ„ ๋„คํŠธ์›Œํฌ ์ƒ ๊ฐ ๋…ธ๋“œ์˜ ํŠน์ง• ์ถ”์ถœ์„ ํšจ๊ณผ์ ์œผ๋กœ ์ˆ˜ํ–‰ํ•œ๋‹ค. ์‹ค์„ธ๊ณ„ ๋…ธ๋“œ ์ž„๋ฒ ๋”ฉ ๋ฌธ์ œ์— ๋Œ€ํ•œ ์ œ์•ˆํ•œ ๊ธฐ๋ฒ•์˜ ๋‹ค์–‘ํ•œ ์ ์šฉ์„ ์œ„ํ•˜์—ฌ ์‹ค๋‚ด, ์‹ค์™ธ ๊ณต๊ฐ„ ๋„คํŠธ์›Œํฌ ๋…ธ๋“œ ์ž„๋ฒ ๋”ฉ ์‹คํ—˜์„ ์ง„ํ–‰ํ–ˆ๋‹ค. ๊ธฐ์กด์— ์ •๋ฆฝ๋œ GNN ๋ชจ๋ธ๊ณผ ์ œ์•ˆํ•œ ๊ธฐ๋ฒ•์„ ํ™œ์šฉํ•œ ๋ชจ๋ธ์„ ๋น„๊ตํ•ด๋ณธ ๊ฒฐ๊ณผ ๋…ธ๋“œ ๋ถ„๋ฅ˜์™€ ํŠน์„ฑ ๋ฒกํ„ฐ ์ถ”์ถœ์— ๋”์šฑ ๋›ฐ์–ด๋‚œ ์„ฑ๋Šฅ์„ ๋ณด์ด๋Š” ๊ฒƒ์„ ํ™•์ธํ–ˆ๋‹ค. ์ œ์•ˆํ•œ ๋ชจ๋ธ์„ ๋ชฉ์ ์— ๋งž๋Š” ๋‹ค์–‘ํ•œ ๋”ฅ๋Ÿฌ๋‹ ์•„ํ‚คํ…์ฒ˜ ๋‚ด ํ•˜๋‚˜์˜ ์š”์†Œ๋กœ์„œ ๊ฐœ๋ฐœํ–ˆ์œผ๋ฉฐ ๊ณต๊ฐ„ ๋„คํŠธ์›Œํฌ๋ฅผ ๋Œ€์ƒ์œผ๋กœ ํ•œ ๋‹ค์–‘ํ•œ ๋ถ„์„์„ ์œ„ํ•œ ๋ชจ๋ธ๋กœ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์„ ๊ฒƒ์œผ๋กœ ๊ธฐ๋Œ€ํ•œ๋‹ค.A spatial network is a graph data structure consisting of nodes and edges in a metric space, used to represent various real-world problems and phenomena. Recently, along with the rapid development of deep learning techniques, various Neural Network models for graphs have been studied. However, these models barely utilize distance information as they did not consider the distance factors between nodes present in spatial networks. This paper introduces the distance-based attention mechanism, which can measure the attention values from the relationship among the nodes with distance features in a spatial network, and proposes Distance-Weighted Graph Neural Networks(DWGNN). DWGNN effectively extracts features of each node in a spatial network by making the nearby neighboring nodes more influential to the target node than the distant neighboring nodes in the embedding update process. Two node embedding experiments with indoor and outdoor spatial networks have been conducted, to apply the proposed GNN model to real-world node embedding problems in spatial networks. Compared to the existing GNN models, DWGNN shows performance improvement in node classification tasks and extracting latent feature vectors. The suggested model is developed as an element for various GNN architecture depending on the purpose and is expected to be used as a model for analyzing various problems on spatial networks.1. ์„œ ๋ก  1 1.1. ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ ๋ฐ ๋ชฉ์  1 1.2. ๊ด€๋ จ ์—ฐ๊ตฌ 4 1.2.1. ๋„คํŠธ์›Œํฌ ์ž„๋ฒ ๋”ฉ 5 1.2.2. Graph Neural Networks 10 1.2.2.1. GNN์˜ ๊ธฐ๋ณธ ์ •์˜์™€ Deep Graph Encoder๋กœ์„œ์˜ ๊ธฐ๋Šฅ ์ˆ˜ํ–‰ 10 1.2.2.2. GNN ํ™œ์šฉ ๋ชจ๋ธ 14 1.2.3. ์—ฃ์ง€ ์†์„ฑ์„ ๊ณ ๋ คํ•œ ๋…ธ๋“œ ์ž„๋ฒ ๋”ฉ ๋ชจ๋ธ 16 2. ์—ฐ๊ตฌ ๋ฐฉ๋ฒ• 19 2.1. ๋ฌธ์ œ ์ •์˜ 19 2.2. ๊ฑฐ๋ฆฌ ๊ธฐ๋ฐ˜ ๋„คํŠธ์›Œํฌ ์–ดํ…์…˜ ๋ฉ”์ปค๋‹ˆ์ฆ˜ 22 2.3. Distance-Weighted Graph Neural Networks 28 2.3.1. ๊ฑฐ๋ฆฌ ๊ธฐ๋ฐ˜ Graph ConvNet 28 2.3.2. DWGNN ๋ชจ๋ธ ์•„ํ‚คํ…์ฒ˜ 30 3. ์‹คํ—˜ ์ ์šฉ ๋ฐ ๊ฒฐ๊ณผ 33 3.1. ๊ฒฐ๊ณผ ๋น„๊ต๋ฅผ ์œ„ํ•œ GNN ๋ชจ๋ธ ์ •์˜ 33 3.2. ์‹ค๋‚ด ๋„คํŠธ์›Œํฌ: ๋„๋ฉด ์š”์†Œ ์ถ”์ถœ ๋ฐ ๋ถ„๋ฅ˜ 35 3.2.1. ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๋ฐ ํ™˜๊ฒฝ 37 3.2.2. ๋ชจ๋ธ ๊ตฌ์„ฑ ๋ฐ ํ•™์Šต 39 3.2.3. ์‹คํ—˜ ๊ฒฐ๊ณผ ๋ถ„์„ 41 3.3. ์‹ค์™ธ ๋„คํŠธ์›Œํฌ: ๋ฒ„์Šค ๋„คํŠธ์›Œํฌ ์ •๋ฅ˜์žฅ ์ž„๋ฒ ๋”ฉ 48 3.3.1. ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๋ฐ ํ™˜๊ฒฝ 51 3.3.2. ๋ชจ๋ธ ๊ตฌ์„ฑ ๋ฐ ํ•™์Šต 53 3.3.3. ์‹คํ—˜ ๊ฒฐ๊ณผ ๋ถ„์„ 55 4. ๊ฒฐ ๋ก  59์„

    Exploring attributes, sequences, and time in Recommender Systems: From classical to Point-of-Interest recommendation

    Full text link
    Tesis Doctoral inรฉdita leรญda en la Universidad Autรณnoma de Madrid, Escuela Politรฉcnica Superior, Departamento de Ingenieria Informรกtica. Fecha de lectura: 08-07-2021Since the emergence of the Internet and the spread of digital communications throughout the world, the amount of data stored on the Web has been growing exponentially. In this new digital era, a large number of companies have emerged with the purpose of ltering the information available on the web and provide users with interesting items. The algorithms and models used to recommend these items are called Recommender Systems. These systems are applied to a large number of domains, from music, books, or movies to dating or Point-of-Interest (POI), which is an increasingly popular domain where users receive recommendations of di erent places when they arrive to a city. In this thesis, we focus on exploiting the use of contextual information, especially temporal and sequential data, and apply it in novel ways in both traditional and Point-of-Interest recommendation. We believe that this type of information can be used not only for creating new recommendation models but also for developing new metrics for analyzing the quality of these recommendations. In one of our rst contributions we propose di erent metrics, some of them derived from previously existing frameworks, using this contextual information. Besides, we also propose an intuitive algorithm that is able to provide recommendations to a target user by exploiting the last common interactions with other similar users of the system. At the same time, we conduct a comprehensive review of the algorithms that have been proposed in the area of POI recommendation between 2011 and 2019, identifying the common characteristics and methodologies used. Once this classi cation of the algorithms proposed to date is completed, we design a mechanism to recommend complete routes (not only independent POIs) to users, making use of reranking techniques. In addition, due to the great di culty of making recommendations in the POI domain, we propose the use of data aggregation techniques to use information from di erent cities to generate POI recommendations in a given target city. In the experimental work we present our approaches on di erent datasets belonging to both classical and POI recommendation. The results obtained in these experiments con rm the usefulness of our recommendation proposals, in terms of ranking accuracy and other dimensions like novelty, diversity, and coverage, and the appropriateness of our metrics for analyzing temporal information and biases in the recommendations producedDesde la aparici on de Internet y la difusi on de las redes de comunicaciones en todo el mundo, la cantidad de datos almacenados en la red ha crecido exponencialmente. En esta nueva era digital, han surgido un gran n umero de empresas con el objetivo de ltrar la informaci on disponible en la red y ofrecer a los usuarios art culos interesantes. Los algoritmos y modelos utilizados para recomendar estos art culos reciben el nombre de Sistemas de Recomendaci on. Estos sistemas se aplican a un gran n umero de dominios, desde m usica, libros o pel culas hasta las citas o los Puntos de Inter es (POIs, en ingl es), un dominio cada vez m as popular en el que los usuarios reciben recomendaciones de diferentes lugares cuando llegan a una ciudad. En esta tesis, nos centramos en explotar el uso de la informaci on contextual, especialmente los datos temporales y secuenciales, y aplicarla de forma novedosa tanto en la recomendaci on cl asica como en la recomendaci on de POIs. Creemos que este tipo de informaci on puede utilizarse no s olo para crear nuevos modelos de recomendaci on, sino tambi en para desarrollar nuevas m etricas para analizar la calidad de estas recomendaciones. En una de nuestras primeras contribuciones proponemos diferentes m etricas, algunas derivadas de formulaciones previamente existentes, utilizando esta informaci on contextual. Adem as, proponemos un algoritmo intuitivo que es capaz de proporcionar recomendaciones a un usuario objetivo explotando las ultimas interacciones comunes con otros usuarios similares del sistema. Al mismo tiempo, realizamos una revisi on exhaustiva de los algoritmos que se han propuesto en el a mbito de la recomendaci o n de POIs entre 2011 y 2019, identi cando las caracter sticas comunes y las metodolog as utilizadas. Una vez realizada esta clasi caci on de los algoritmos propuestos hasta la fecha, dise~namos un mecanismo para recomendar rutas completas (no s olo POIs independientes) a los usuarios, haciendo uso de t ecnicas de reranking. Adem as, debido a la gran di cultad de realizar recomendaciones en el ambito de los POIs, proponemos el uso de t ecnicas de agregaci on de datos para utilizar la informaci on de diferentes ciudades y generar recomendaciones de POIs en una determinada ciudad objetivo. En el trabajo experimental presentamos nuestros m etodos en diferentes conjuntos de datos tanto de recomendaci on cl asica como de POIs. Los resultados obtenidos en estos experimentos con rman la utilidad de nuestras propuestas de recomendaci on en t erminos de precisi on de ranking y de otras dimensiones como la novedad, la diversidad y la cobertura, y c omo de apropiadas son nuestras m etricas para analizar la informaci on temporal y los sesgos en las recomendaciones producida
    corecore