153,676 research outputs found

    M2HGCL: Multi-Scale Meta-Path Integrated Heterogeneous Graph Contrastive Learning

    Full text link
    Inspired by the successful application of contrastive learning on graphs, researchers attempt to impose graph contrastive learning approaches on heterogeneous information networks. Orthogonal to homogeneous graphs, the types of nodes and edges in heterogeneous graphs are diverse so that specialized graph contrastive learning methods are required. Most existing methods for heterogeneous graph contrastive learning are implemented by transforming heterogeneous graphs into homogeneous graphs, which may lead to ramifications that the valuable information carried by non-target nodes is undermined thereby exacerbating the performance of contrastive learning models. Additionally, current heterogeneous graph contrastive learning methods are mainly based on initial meta-paths given by the dataset, yet according to our deep-going exploration, we derive empirical conclusions: only initial meta-paths cannot contain sufficiently discriminative information; and various types of meta-paths can effectively promote the performance of heterogeneous graph contrastive learning methods. To this end, we propose a new multi-scale meta-path integrated heterogeneous graph contrastive learning (M2HGCL) model, which discards the conventional heterogeneity-homogeneity transformation and performs the graph contrastive learning in a joint manner. Specifically, we expand the meta-paths and jointly aggregate the direct neighbor information, the initial meta-path neighbor information and the expanded meta-path neighbor information to sufficiently capture discriminative information. A specific positive sampling strategy is further imposed to remedy the intrinsic deficiency of contrastive learning, i.e., the hard negative sample sampling issue. Through extensive experiments on three real-world datasets, we demonstrate that M2HGCL outperforms the current state-of-the-art baseline models.Comment: Accepted to the conference of ADMA2023 as an Oral presentatio

    Logic Programs as Declarative and Procedural Bias in Inductive Logic Programming

    Get PDF
    Machine Learning is necessary for the development of Artificial Intelligence, as pointed out by Turing in his 1950 article ``Computing Machinery and Intelligence''. It is in the same article that Turing suggested the use of computational logic and background knowledge for learning. This thesis follows a logic-based machine learning approach called Inductive Logic Programming (ILP), which is advantageous over other machine learning approaches in terms of relational learning and utilising background knowledge. ILP uses logic programs as a uniform representation for hypothesis, background knowledge and examples, but its declarative bias is usually encoded using metalogical statements. This thesis advocates the use of logic programs to represent declarative and procedural bias, which results in a framework of single-language representation. We show in this thesis that using a logic program called the top theory as declarative bias leads to a sound and complete multi-clause learning system MC-TopLog. It overcomes the entailment-incompleteness of Progol, thus outperforms Progol in terms of predictive accuracies on learning grammars and strategies for playing Nim game. MC-TopLog has been applied to two real-world applications funded by Syngenta, which is an agriculture company. A higher-order extension on top theories results in meta-interpreters, which allow the introduction of new predicate symbols. Thus the resulting ILP system Metagol can do predicate invention, which is an intrinsically higher-order logic operation. Metagol also leverages the procedural semantic of Prolog to encode procedural bias, so that it can outperform both its ASP version and ILP systems without an equivalent procedural bias in terms of efficiency and accuracy. This is demonstrated by the experiments on learning Regular, Context-free and Natural grammars. Metagol is also applied to non-grammar learning tasks involving recursion and predicate invention, such as learning a definition of staircases and robot strategy learning. Both MC-TopLog and Metagol are based on a ⊀\top-directed framework, which is different from other multi-clause learning systems based on Inverse Entailment, such as CF-Induction, XHAIL and IMPARO. Compared to another ⊀\top-directed multi-clause learning system TAL, Metagol allows the explicit form of higher-order assumption to be encoded in the form of meta-rules.Open Acces

    Attention Graph for Multi-Robot Social Navigation with Deep Reinforcement Learning

    Full text link
    Learning robot navigation strategies among pedestrian is crucial for domain based applications. Combining perception, planning and prediction allows us to model the interactions between robots and pedestrians, resulting in impressive outcomes especially with recent approaches based on deep reinforcement learning (RL). However, these works do not consider multi-robot scenarios. In this paper, we present MultiSoc, a new method for learning multi-agent socially aware navigation strategies using RL. Inspired by recent works on multi-agent deep RL, our method leverages graph-based representation of agent interactions, combining the positions and fields of view of entities (pedestrians and agents). Each agent uses a model based on two Graph Neural Network combined with attention mechanisms. First an edge-selector produces a sparse graph, then a crowd coordinator applies node attention to produce a graph representing the influence of each entity on the others. This is incorporated into a model-free RL framework to learn multi-agent policies. We evaluate our approach on simulation and provide a series of experiments in a set of various conditions (number of agents / pedestrians). Empirical results show that our method learns faster than social navigation deep RL mono-agent techniques, and enables efficient multi-agent implicit coordination in challenging crowd navigation with multiple heterogeneous humans. Furthermore, by incorporating customizable meta-parameters, we can adjust the neighborhood density to take into account in our navigation strategy

    Exploiting Style Transfer-based Task Augmentation for Cross-Domain Few-Shot Learning

    Full text link
    In cross-domain few-shot learning, the core issue is that the model trained on source domains struggles to generalize to the target domain, especially when the domain shift is large. Motivated by the observation that the domain shift between training tasks and target tasks usually can reflect in their style variation, we propose Task Augmented Meta-Learning (TAML) to conduct style transfer-based task augmentation to improve the domain generalization ability. Firstly, Multi-task Interpolation (MTI) is introduced to fuse features from multiple tasks with different styles, which makes more diverse styles available. Furthermore, a novel task-augmentation strategy called Multi-Task Style Transfer (MTST) is proposed to perform style transfer on existing tasks to learn discriminative style-independent features. We also introduce a Feature Modulation module (FM) to add random styles and improve generalization of the model. The proposed TAML increases the diversity of styles of training tasks, and contributes to training a model with better domain generalization ability. The effectiveness is demonstrated via theoretical analysis and thorough experiments on two popular cross-domain few-shot benchmarks

    Neural Interactive Collaborative Filtering

    Full text link
    In this paper, we study collaborative filtering in an interactive setting, in which the recommender agents iterate between making recommendations and updating the user profile based on the interactive feedback. The most challenging problem in this scenario is how to suggest items when the user profile has not been well established, i.e., recommend for cold-start users or warm-start users with taste drifting. Existing approaches either rely on overly pessimistic linear exploration strategy or adopt meta-learning based algorithms in a full exploitation way. In this work, to quickly catch up with the user's interests, we propose to represent the exploration policy with a neural network and directly learn it from the feedback data. Specifically, the exploration policy is encoded in the weights of multi-channel stacked self-attention neural networks and trained with efficient Q-learning by maximizing users' overall satisfaction in the recommender systems. The key insight is that the satisfied recommendations triggered by the exploration recommendation can be viewed as the exploration bonus (delayed reward) for its contribution on improving the quality of the user profile. Therefore, the proposed exploration policy, to balance between learning the user profile and making accurate recommendations, can be directly optimized by maximizing users' long-term satisfaction with reinforcement learning. Extensive experiments and analysis conducted on three benchmark collaborative filtering datasets have demonstrated the advantage of our method over state-of-the-art methods

    심측 신경망 검색 기법을 μ‚¬μš©ν•œ 이미지 볡원

    Get PDF
    ν•™μœ„λ…Όλ¬Έ(박사) -- μ„œμšΈλŒ€ν•™κ΅λŒ€ν•™μ› : κ³΅κ³ΌλŒ€ν•™ 전기·정보곡학뢀, 2021.8. μ•ˆμ€€μ˜.Image restoration is an important technology which can be used as a pre-processing step to increase the performances of various vision tasks. Image super-resolution is one of the important task in image restoration which restores a high-resolution (HR) image from low-resolution (LR) observation. The recent progress of deep convolutional neural networks has enabled great success in single image super-resolution (SISR). its performance is also being increased by deepening the networks and developing more sophisticated network structures. However, finding an optimal structure for the given problem is a difficult task, even for human experts. For this reason, neural architecture search (NAS) methods have been introduced, which automate the procedure of constructing the structures. In this dissertation, I propose a new single image super-resolution framework by using neural architecture search (NAS) method. As the performance improves, the network becomes more complex and deeper, so I apply NAS algorithm to find the optimal network while reducing the effort in network design. In detail, the proposed scheme is summarized to three topics: image super-resolution using efficient neural architecture search, multi-branch neural architecture search for lightweight image super-resolution, and neural architecture search for image super-resolution using meta-transfer learning. At first, I expand the NAS to the super-resolution domain and find a lightweight densely connected network named DeCoNASNet. I use a hierarchical search strategy to find the best connection with local and global features. In this process, I define a complexity-based-penalty and add it to the reward term of REINFORCE algorithm. Experiments show that my DeCoNASNet outperforms the state-of-the-art lightweight super-resolution networks designed by handcraft methods and existing NAS-based design. I propose a new search space design with multi-branch structure to enlarge the search space for capturing multi-scale features, resulting in better reconstruction on grainy areas. I also adopt parameter sharing scheme in multi-branch network to share their information and reduce the whole network parameter. Experiments show that the proposed method finds an optimal SISR network about twenty times faster than the existing methods, while showing comparable performance in terms of PSNR vs. parameters. Comparison of visual quality validates that the proposed SISR network reconstructs texture areas better than the previous methods because of the enlarged search space to find multi-scale features. Lastly, I apply meta-transfer learning to the NAS procedure for image super-resolution. I train the controller and child network with the meta-learning scheme, which enables the controllers to find promising network for several scale simultaneously. Furthermore, meta-trained child network is reused as the pre-trained parameters for final evaluation phase to improve the final image super-resolution results even better and search-evaluation gap problem is efficiently reduced.이미지 볡원은 λ‹€μ–‘ν•œ μ˜μƒμ²˜λ¦¬ 문제의 μ„±λŠ₯을 높이기 μœ„ν•œ μ „ 처리 λ‹¨κ³„λ‘œ μ‚¬μš©ν•  수 μžˆλŠ” μ€‘μš”ν•œ κΈ°μˆ μ΄λ‹€. 이미지 κ³ ν•΄μƒλ„ν™”λŠ” 이미지 볡원방법 쀑 μ€‘μš”ν•œ 문제의 ν•˜λ‚˜λ‘œμ¨ μ €ν•΄μƒλ„μ˜ 이미지λ₯Ό κ³ ν•΄μƒλ„μ˜ μ΄λ―Έμ§€λ‘œ λ³΅μ›ν•˜λŠ” 방법이닀. μ΅œκ·Όμ—λŠ” μ»¨λ²Œλ£¨μ…˜ 신경망 (CNN)을 μ‚¬μš©ν•˜λŠ” λ”₯ λŸ¬λ‹(deep learning) 기반의 방법듀이 단일 이미지 고해상도화 (SISR) 문제λ₯Ό ν‘ΈλŠ”λ° 많이 μ‚¬μš©λ˜κ³  μžˆλ‹€. 일반적으둜 이미지 고해상도화 μ„±λŠ₯은 CNN을 깊게 μŒ“κ±°λ‚˜ λ³΅μž‘ν•œ ꡬ쑰λ₯Ό μ„€κ³„ν•¨μœΌλ‘œμ¨ ν–₯μƒμ‹œν‚¬ 수 μžˆλ‹€. κ·ΈλŸ¬λ‚˜ 주어진 λ¬Έμ œμ— λŒ€ν•œ 졜적의 ꡬ쑰λ₯Ό μ°ΎλŠ” 것은 ν•΄λ‹Ή λΆ„μ•Όμ˜ 전문가라 ν•  지라도 μ–΄λ ΅κ³  μ‹œκ°„μ΄ 였래 κ±Έλ¦¬λŠ” μž‘μ—…μ΄λ‹€. μ΄λŸ¬ν•œ 이유둜 신경망 ꡬ좕 절차λ₯Ό μžλ™ν™”ν•˜λŠ” 신경망 ꡬ쑰 검색 (NAS) 방법이 λ„μž…λ˜μ—ˆλ‹€. 이 λ…Όλ¬Έμ—μ„œλŠ” 신경망 ꡬ쑰 검색 (NAS) 방법을 μ‚¬μš©ν•˜μ—¬ μƒˆλ‘œμš΄ 단일 이미지 고해상도화 방법을 μ œμ•ˆν•˜μ˜€λ‹€. 이 λ…Όλ¬Έμ—μ„œ μ œμ•ˆν•œ 방법은 크게 μ„Έ κ°€μ§€λ‘œ μš”μ•½ ν•  수 μžˆλ‹€. μ΄λŠ” 효율적인 신경망 검색기법(ENAS)을 μ΄μš©ν•œ 이미지 고해상도화, 병렬 신경망 검색 기법을 μ΄μš©ν•œ 이미지 고해상도화, 메타 전솑 ν•™μŠ΅μ„ μ΄μš©ν•˜λŠ” 신경망 검색기법을 ν†΅ν•œ 이미지 고해상도화 이닀. μš°μ„ , μš°λ¦¬λŠ” 주둜 μ˜μƒ λΆ„λ₯˜μ— μ“°μ΄λ˜ 신경망 검색 기법을 μ˜μƒ 고해상도화에 μ μš©ν•˜μ˜€μœΌλ©°, DeCoNASNet이라 λͺ…λͺ…λœ 신경망 ꡬ쑰λ₯Ό μ„€κ³„ν•˜μ˜€λ‹€. λ˜ν•œ 계측적 검색 μ „λž΅μ„ μ‚¬μš©ν•˜μ—¬ 지역/μ „μ—­ 피쳐(feature) 합병을 μœ„ν•œ μ΅œμƒμ˜ μ—°κ²° 방법을 κ²€μƒ‰ν•˜μ˜€λ‹€. 이 κ³Όμ •μ—μ„œ ν•„μš” λ³€μˆ˜κ°€ μ μœΌλ©΄μ„œ 쒋은 μ„±λŠ₯을 λ‚Ό 수 μžˆλ„λ‘ λ³΅μž‘μ„± 기반 νŽ˜λ„ν‹° (complexity-based penalty) λ₯Ό μ •μ˜ν•˜κ³  이λ₯Ό REINFORCE μ•Œκ³ λ¦¬μ¦˜μ˜ 보상 μ‹ ν˜Έμ— μΆ”κ°€ν•˜μ˜€λ‹€. μ‹€ν—˜ κ²°κ³Ό DeCoNASNet은 기쑴의 μ‚¬λžŒμ΄ 직접 μ„€κ³„ν•œ 신경망과 신경망 검색 기법을 기반으둜 μ„€κ³„λœ 졜근의 고해상도화 ꡬ쑰의 μ„±λŠ₯을 λŠ₯κ°€ν•˜λŠ” 것을 확인 ν•  수 μžˆμ—ˆλ‹€. μš°λ¦¬λŠ” λ˜ν•œ μ—¬λŸ¬ 크기의 피쳐(feature)λ₯Ό ν•™μŠ΅ν•˜κΈ° μœ„ν•΄ 신경망 검색 κΈ°λ²•μ˜ 검색 곡간을 ν™•λŒ€ν•˜μ—¬ 병렬 신경망을 μ„€κ³„ν•˜λŠ” 방법을 μ œμ•ˆν•˜μ˜€λ‹€. 이 λ•Œ, λ³‘λ ¬μ‹ κ²½λ§μ˜ 각 μœ„μΉ˜μ—μ„œ 맀개 λ³€μˆ˜λ₯Ό κ³΅μœ ν•  수 μžˆλ„λ‘ ν•˜μ—¬ λ³‘λ ¬μ‹ κ²½λ§μ˜ 각 ꡬ쑰끼리 정보λ₯Ό κ³΅μœ ν•˜κ³  전체 ꡬ쑰λ₯Ό μ„€κ³„ν•˜λŠ”λ° ν•„μš”ν•œ 맀개 λ³€μˆ˜λ₯Ό 쀄이도둝 ν•˜μ˜€λ‹€. μ‹€ν—˜ κ²°κ³Ό μ œμ•ˆλœ 방법을 톡해 맀개 λ³€μˆ˜ 크기 λŒ€λΉ„ μ„±λŠ₯이 쒋은 신경망 ꡬ쑰λ₯Ό 찾을 수 μžˆμ—ˆλ‹€. μ‹€ν—˜ κ²°κ³Όλ₯Ό 톡해 ν™•μž₯된 검색 κ³΅κ°„μ—μ„œ μ—¬λŸ¬ 크기의 피쳐 (feature)λ₯Ό ν•™μŠ΅ν•˜μ˜€κΈ° λ•Œλ¬Έμ— 이전 방법보닀 λ³΅μž‘ν•œ μ˜μ—­μ„ 더 잘 λ³΅μ›ν•˜λŠ” 것을 ν™•μΈν•˜μ˜€λ‹€. λ§ˆμ§€λ§‰μœΌλ‘œ 메타 전솑 ν•™μŠ΅(meta-transfer learning)을 신경망 검색에 μ μš©ν•˜μ—¬ λ‹€μ–‘ν•œ 크기의 이미지 고해상도화 문제λ₯Ό ν•΄κ²°ν•˜λŠ” 방법을 μ œμ•ˆν•˜μ˜€λ‹€. 이 λ…Όλ¬Έμ—μ„œλŠ” 메타 전솑 ν•™μŠ΅ 방법을 톡해 μ œμ–΄κΈ°κ°€ μ—¬λŸ¬ 크기의 쒋은 신경망 ꡬ쑰λ₯Ό λ™μ‹œμ— 찾을 수 μžˆλ„λ‘ μ„€κ³„ν•˜μ˜€λ‹€. λ˜ν•œ 메타 ν›ˆλ ¨λœ 신경망 κ΅¬μ‘°λŠ” μ΅œμ’… μ„±λŠ₯ 평가 μ‹œ ν•™μŠ΅μ˜ μ‹œμž‘μ μœΌλ‘œ μž¬μ‚¬μš© λ˜μ–΄ μ΅œμ’… 이미지 고해상도화 μ„±λŠ₯을 λ”μš± ν–₯μƒμ‹œν‚¬ 수 μžˆμ—ˆμœΌλ©°, 효과적으둜 검색-평가 괴리 문제λ₯Ό ν•΄κ²°ν•˜μ˜€λ‹€.1 INTRODUCTION 1 1.1 contribution 3 1.2 contents 4 2 Neural Architecture Search for Image Super-Resolution Using Densely Constructed Search Space: DeCoNAS 5 2.1 Introduction 5 2.2 Proposed Method 9 2.2.1 Overall structure of DeCoNASNet 9 2.2.2 Constructing the DNB 11 2.2.3 Constructing controller for the DeCoNASNet 13 2.2.4 Training DeCoNAS and complexity-based penalty 13 2.3 Experimental results 15 2.3.1 Settings 15 2.3.2 Results 16 2.3.3 Ablation study 21 2.4 Summary 22 3 Multi-Branch Neural Architecture Search for Lightweight Image Super-resolution 23 3.1 Introduction 23 3.2 Related Work 26 3.2.1 Single image super-resolution 26 3.2.2 Neural architecture search 27 3.2.3 Image super-resolution with neural architecture search 29 3.3 Method 32 3.3.1 Overview of the Proposed MBNAS 32 3.3.2 Controller and complexity-based penalty 33 3.3.3 MBNASNet 35 3.3.4 Multi-scale block with partially shared Nodes 37 3.3.5 MBNAS 38 3.4 datasets and experiments 39 3.4.1 Settings 39 3.4.2 Experiments on single image super-resolution (SISR) 41 3.5 Discussion 48 3.5.1 Effect of the complexity-based penalty to the performance of controller 49 3.5.2 Effect of multi-branch structure and partial parameter sharing scheme 50 3.5.3 Effect of gradient flow control weights and complexity-based penalty coefficient 51 3.6 Summary 52 4 Meta-transfer learning for simultaneous search of various scale image super-resolution 54 4.1 Introduction 54 4.2 Related Work 56 4.2.1 Single image super-resolution 56 4.2.2 Neural architecture search 57 4.2.3 Image super-resolution with neural architecture search 58 4.2.4 Meta-learning 59 4.3 Method 59 4.3.1 Meta-learning 60 4.3.2 Meta-transfer learning 62 4.3.3 Transfer-learning 63 4.4 datasets and experiments 63 4.4.1 Settings 63 4.4.2 Experiments on single image super-resolution(SISR) 64 4.5 Summary 66 5 Conclusion 69 Abstract (In Korean) 80λ°•

    Spiking Generative Adversarial Networks With a Neural Network Discriminator: Local Training, Bayesian Models, and Continual Meta-Learning

    Get PDF
    Neuromorphic data carries information in spatio-temporal patterns encoded by spikes. Accordingly, a central problem in neuromorphic computing is training spiking neural networks (SNNs) to reproduce spatio-temporal spiking patterns in response to given spiking stimuli. Most existing approaches model the input-output behavior of an SNN in a deterministic fashion by assigning each input to a specific desired output spiking sequence. In contrast, in order to fully leverage the time-encoding capacity of spikes, this work proposes to train SNNs so as to match distributions of spiking signals rather than individual spiking signals. To this end, the paper introduces a novel hybrid architecture comprising a conditional generator, implemented via an SNN, and a discriminator, implemented by a conventional artificial neural network (ANN). The role of the ANN is to provide feedback during training to the SNN within an adversarial iterative learning strategy that follows the principle of generative adversarial network (GANs). In order to better capture multi-modal spatio-temporal distribution, the proposed approach -- termed SpikeGAN -- is further extended to support Bayesian learning of the generator's weight. Finally, settings with time-varying statistics are addressed by proposing an online meta-learning variant of SpikeGAN. Experiments bring insights into the merits of the proposed approach as compared to existing solutions based on (static) belief networks and maximum likelihood (or empirical risk minimization)
    • …
    corecore