655 research outputs found
Graph BI & analytics: current state and future challenges
In an increasingly competitive market, making well-informed decisions requires the analysis of a wide range of heterogeneous, large and complex data. This paper focuses on the emerging field of graph warehousing. Graphs are widespread structures that yield a great expressive power. They are used for modeling highly complex and interconnected domains, and efficiently solving emerging big data application. This paper presents the current status and open challenges of graph BI and analytics, and motivates the need for new warehousing frameworks aware of the topological nature of graphs. We survey the topics of graph modeling, management, processing and analysis in graph warehouses. Then we conclude by discussing future research directions and positioning them within a unified architecture of a graph BI and analytics framework.Peer ReviewedPostprint (author's final draft
Mining Traversal Patterns from Weighted Traversals and Graph
μ€μΈκ³μ λ§μ λ¬Έμ λ€μ κ·Έλνμ κ·Έ κ·Έλνλ₯Ό μννλ νΈλμμ
μΌλ‘ λͺ¨λΈλ§λ μ μλ€. μλ₯Ό λ€λ©΄, μΉ νμ΄μ§μ μ°κ²°κ΅¬μ‘°λ κ·Έλνλ‘ ννλ μ μκ³ , μ¬μ©μμ μΉ νμ΄μ§ λ°©λ¬Έκ²½λ‘λ κ·Έ κ·Έλνλ₯Ό μννλ νΈλμμ
μΌλ‘ λͺ¨λΈλ§λ μ μλ€. μ΄μ κ°μ΄ κ·Έλνλ₯Ό μννλ νΈλμμ
μΌλ‘λΆν° μ€μνκ³ κ°μΉ μλ ν¨ν΄μ μ°Ύμλ΄λ κ²μ μλ―Έ μλ μΌμ΄λ€. μ΄λ¬ν ν¨ν΄μ μ°ΎκΈ° μν μ§κΈκΉμ§μ μ°κ΅¬μμλ μνλ κ·Έλνμ κ°μ€μΉλ₯Ό κ³ λ €νμ§ μκ³ λ¨μν λΉλ°νλ ν¨ν΄λ§μ μ°Ύλ μκ³ λ¦¬μ¦μ μ μνμλ€. μ΄λ¬ν μκ³ λ¦¬μ¦μ νκ³λ λ³΄λ€ μ λ’°μ± μκ³ μ νν ν¨ν΄μ νμ¬νλ λ° μ΄λ €μμ΄ μλ€λ κ²μ΄λ€.
λ³Έ λ
Όλ¬Έμμλ μνλ κ·Έλνμ μ μ μ λΆμ¬λ κ°μ€μΉλ₯Ό κ³ λ €νμ¬ ν¨ν΄μ νμ¬νλ λ κ°μ§ λ°©λ²λ€μ μ μνλ€. 첫 λ²μ§Έ λ°©λ²μ κ·Έλνλ₯Ό μννλ μ 보μ κ°μ€μΉκ° μ‘΄μ¬νλ κ²½μ°μ λΉλ° μν ν¨ν΄μ νμ¬νλ κ²μ΄λ€. κ·Έλν μνμ λΆμ¬λ μ μλ κ°μ€μΉλ‘λ λ λμκ°μ μ΄λ μκ°μ΄λ μΉ μ¬μ΄νΈλ₯Ό λ°©λ¬Έν λ ν νμ΄μ§μμ λ€λ₯Έ νμ΄μ§λ‘ μ΄λνλ μκ° λ±μ΄ λ μ μλ€. λ³Έ λ
Όλ¬Έμμλ μ’ λ μ νν μν ν¨ν΄μ λ§μ΄λνκΈ° μν΄ ν΅κ³νμ μ λ’° ꡬκ°μ μ΄μ©νλ€. μ¦, μ 체 μνμ κ° κ°μ μ λΆμ¬λ κ°μ€μΉλ‘λΆν° μ λ’° ꡬκ°μ ꡬν ν μ λ’° ꡬκ°μ λ΄μ μλ μνλ§μ μ ν¨ν κ²μΌλ‘ μΈμ νλ λ°©λ²μ΄λ€. μ΄λ¬ν λ°©λ²μ μ μ©ν¨μΌλ‘μ¨ λμ± μ λ’°μ± μλ μν ν¨ν΄μ λ§μ΄λν μ μλ€. λν μ΄λ κ² κ΅¬ν ν¨ν΄κ³Ό κ·Έλν μ 보λ₯Ό μ΄μ©νμ¬ ν¨ν΄ κ°μ μ°μ μμλ₯Ό κ²°μ ν μ μλ λ°©λ²κ³Ό μ±λ₯ ν₯μμ μν μκ³ λ¦¬μ¦λ μ μνλ€.
λ λ²μ§Έ λ°©λ²μ κ·Έλνμ μ μ μ κ°μ€μΉκ° λΆμ¬λ κ²½μ°μ κ°μ€μΉκ° κ³ λ €λ λΉλ° μν ν¨ν΄μ νμ¬νλ λ°©λ²μ΄λ€. κ·Έλνμ μ μ μ λΆμ¬λ μ μλ κ°μ€μΉλ‘λ μΉ μ¬μ΄νΈ λ΄μ κ° λ¬Έμμ μ 보λμ΄λ μ€μλ λ±μ΄ λ μ μλ€. μ΄ λ¬Έμ μμλ λΉλ° μν ν¨ν΄μ κ²°μ νκΈ° μνμ¬ ν¨ν΄μ λ°μ λΉλλΏλ§ μλλΌ λ°©λ¬Έν μ μ μ κ°μ€μΉλ₯Ό λμμ κ³ λ €νμ¬μΌ νλ€. μ΄λ₯Ό μν΄ λ³Έ λ
Όλ¬Έμμλ μ μ μ κ°μ€μΉλ₯Ό μ΄μ©νμ¬ ν₯νμ λΉλ° ν¨ν΄μ΄ λ κ°λ₯μ±μ΄ μλ ν보 ν¨ν΄μ κ° λ§μ΄λ λ¨κ³μμ μ κ±°νμ§ μκ³ μ μ§νλ μκ³ λ¦¬μ¦μ μ μνλ€. λν μ±λ₯ ν₯μμ μν΄ ν보 ν¨ν΄μ μλ₯Ό κ°μμν€λ μκ³ λ¦¬μ¦λ μ μνλ€.
λ³Έ λ
Όλ¬Έμμ μ μν λ κ°μ§ λ°©λ²μ λνμ¬ λ€μν μ€νμ ν΅νμ¬ μν μκ° λ° μμ±λλ ν¨ν΄μ μ λ±μ λΉκ΅ λΆμνμλ€.
λ³Έ λ
Όλ¬Έμμλ μνμ κ°μ€μΉκ° μλ κ²½μ°μ κ·Έλνμ μ μ μ κ°μ€μΉκ° μλ κ²½μ°μ λΉλ° μν ν¨ν΄μ νμ¬νλ μλ‘μ΄ λ°©λ²λ€μ μ μνμλ€. μ μν λ°©λ²λ€μ μΉ λ§μ΄λκ³Ό κ°μ λΆμΌμ μ μ©ν¨μΌλ‘μ¨ μΉ κ΅¬μ‘°μ ν¨μ¨μ μΈ λ³κ²½μ΄λ μΉ λ¬Έμμ μ κ·Ό μλ ν₯μ, μ¬μ©μλ³ κ°μΈνλ μΉ λ¬Έμ κ΅¬μΆ λ±μ΄ κ°λ₯ν κ²μ΄λ€.Abstract β
Ά
Chapter 1 Introduction
1.1 Overview
1.2 Motivations
1.3 Approach
1.4 Organization of Thesis
Chapter 2 Related Works
2.1 Itemset Mining
2.2 Weighted Itemset Mining
2.3 Traversal Mining
2.4 Graph Traversal Mining
Chapter 3 Mining Patterns from Weighted Traversals on
Unweighted Graph
3.1 Definitions and Problem Statements
3.2 Mining Frequent Patterns
3.2.1 Augmentation of Base Graph
3.2.2 In-Mining Algorithm
3.2.3 Pre-Mining Algorithm
3.2.4 Priority of Patterns
3.3 Experimental Results
Chapter 4 Mining Patterns from Unweighted Traversals on
Weighted Graph
4.1 Definitions and Problem Statements
4.2 Mining Weighted Frequent Patterns
4.2.1 Pruning by Support Bounds
4.2.2 Candidate Generation
4.2.3 Mining Algorithm
4.3 Estimation of Support Bounds
4.3.1 Estimation by All Vertices
4.3.2 Estimation by Reachable Vertices
4.4 Experimental Results
Chapter 5 Conclusions and Further Works
Reference
Web Mining for Web Personalization
Web personalization is the process of customizing a Web site to the needs of specific users, taking advantage of the knowledge acquired from the analysis of the user\u27s navigational behavior (usage data) in correlation with other information collected in the Web context, namely, structure, content, and user profile data. Due to the explosive growth of the Web, the domain of Web personalization has gained great momentum both in the research and commercial areas. In this article we present a survey of the use of Web mining for Web personalization. More specifically, we introduce the modules that comprise a Web personalization system, emphasizing the Web usage mining module. A review of the most common methods that are used as well as technical issues that occur is given, along with a brief overview of the most popular tools and applications available from software vendors. Moreover, the most important research initiatives in the Web usage mining and personalization areas are presented
Intelligent agents for matching information providers and consumers on the World-Wide-Web
In this paper, we discuss the various issues in designing intelligent software systems to assist world-wide-web users in locating relevant information. We identify a number of key components in such intelligent systems. These include a web document database management system, a client-based goal-directed search engine, an intelligent learning agent which discovers users' topics of interest by studying their browsing behavior, and an intelligent agent which monitors `hot' web sites. We give examples and suggestions on how these components are designed and implemented. We also describe the architecture of a prototype system that integrates the various components.published_or_final_versio
Information Visualization and Visual Data Mining
Data visualization is the graphical display of abstract information for two purposes: sense-making (also called data analysis) and communication. Important stories live in our data and data visualization is a powerful means to discover and understand these stories, and then to present them to others. In this paper, we propose a classification of information visualization and visual data mining techniques which is based on the data type to be visualized, the visualization technique and the interaction and distortion technique. We exemplify the classification using a few examples, most of them referring to techniques and systems presented in this special issue
- β¦