3,388 research outputs found

    Neural Attributed Community Search at Billion Scale

    Full text link
    Community search has been extensively studied in the past decades. In recent years, there is a growing interest in attributed community search that aims to identify a community based on both the query nodes and query attributes. A set of techniques have been investigated. Though the recent methods based on advanced learning models such as graph neural networks (GNNs) can achieve state-of-the-art performance in terms of accuracy, we notice that 1) they suffer from severe efficiency issues; 2) they directly model community search as a node classification problem and thus cannot make good use of interdependence among different entities in the graph. Motivated by these, in this paper, we propose a new neurAL attrIbuted Community sEarch model for large-scale graphs, termed ALICE. ALICE first extracts a candidate subgraph to reduce the search scope and subsequently predicts the community by the Consistency-aware Net , termed ConNet. Specifically, in the extraction phase, we introduce the density sketch modularity that uses a unified form to combine the strengths of two existing powerful modularities, i.e., classical modularity and density modularity. Based on the new modularity metric, we first adaptively obtain the candidate subgraph, formed by the k-hop neighbors of the query nodes, with the maximum modularity. Then, we construct a node-attribute bipartite graph to take attributes into consideration. After that, ConNet adopts a cross-attention encoder to encode the interaction between the query and the graph. The training of the model is guided by the structure-attribute consistency and the local consistency to achieve better performance. Extensive experiments over 11 real-world datasets including one billion-scale graph demonstrate the superiority of ALICE in terms of accuracy, efficiency, and scalability

    Efficient Unsupervised Community Search with Pre-trained Graph Transformer

    Full text link
    Community search has aroused widespread interest in the past decades. Among existing solutions, the learning-based models exhibit outstanding performance in terms of accuracy by leveraging labels to 1) train the model for community score learning, and 2) select the optimal threshold for community identification. However, labeled data are not always available in real-world scenarios. To address this notable limitation of learning-based models, we propose a pre-trained graph Transformer based community search framework that uses Zero label (i.e., unsupervised), termed TransZero. TransZero has two key phases, i.e., the offline pre-training phase and the online search phase. Specifically, in the offline pretraining phase, we design an efficient and effective community search graph transformer (CSGphormer) to learn node representation. To pre-train CSGphormer without the usage of labels, we introduce two self-supervised losses, i.e., personalization loss and link loss, motivated by the inherent uniqueness of node and graph topology, respectively. In the online search phase, with the representation learned by the pre-trained CSGphormer, we compute the community score without using labels by measuring the similarity of representations between the query nodes and the nodes in the graph. To free the framework from the usage of a label-based threshold, we define a new function named expected score gain to guide the community identification process. Furthermore, we propose two efficient and effective algorithms for the community identification process that run without the usage of labels. Extensive experiments over 10 public datasets illustrate the superior performance of TransZero regarding both accuracy and efficiency

    Massive Dirac fermions in moir\'e superlattices: a route towards topological flat minibands

    Full text link
    We demonstrate a generic mechanism to realize topological flat minibands by confining massive Dirac fermions in a periodic moir\'e potential, which can be achieved in a heterobilayer of transition metal dichalcogenides. We show that the topological phase can be protected by the symmetry of moir\'e potential and survive to arbitrarily large Dirac band gap. We take the MoTe2_2/WSe2_2 heterobilayer as an example and find that the topological phase can be driven by a vertical electric field. By projecting the Coulomb interaction onto the topological fat minibands, we identify a correlated Chern insulator at half filling and a quantum valley-spin Hall insulator at full filling which explains the topological states observed in the MoTe2_2/WSe2_2 in experiment. Our work clarifies the importance of Dirac structure for the topological minibands and unveils a general strategy to design topological moir\'e materials.Comment: 13 pages, 12 figure

    Diffractive Efficiency Prediction of Surface Relief Grating Waveguide Using Artificial Neural Network

    Get PDF
    This study aims to develop lightweight and comfortable wearable devices using surface-relief grating, which can be designed to meet different diffraction conditions. However, extensive calculations must be performed to obtain the impact of the variation in the structural dimensions. The finite element method is used to solve the diffractive efficiency and then replaced by trained artificial neural networks with a single hidden layer containing 25 neurons. By using raw data with geometric parameters as the features, the performance of the network is investigated with different numbers of raw data; in addition, the regression analysis shows a high R-value of approximately 0.999. The predicted results are compared with those calculated from the simulation. The diffraction efficiency tendencies vary with the different geometric parameters, which show a high level of agreement between the predicted and calculated data; this confirms that the proposed method supports and reduces the burden of extensive calculations

    Power of linkage analysis using traits generated from simulated longitudinal data of the Framingham Heart Study

    Get PDF
    The Framingham Heart Study is a very successful longitudinal research for cardiovascular diseases. The completion of a 10-cM genome scan in Framingham families provided an opportunity to evaluate linkage using longitudinal data. Several descriptive traits based on simulated longitudinal data from the Genetic Analysis Workshop 13 (GAW13) were generated, and linkage analyses were performed for these traits. We compared the power of detecting linkage for baseline and slope genes in the simulated data of GAW13 using these traits. We found that using longitudinal traits based on multiple follow-ups may not be more powerful than using cross-sectional traits for genetic linkage analysis
    • …
    corecore