3,388 research outputs found
Neural Attributed Community Search at Billion Scale
Community search has been extensively studied in the past decades. In recent
years, there is a growing interest in attributed community search that aims to
identify a community based on both the query nodes and query attributes. A set
of techniques have been investigated. Though the recent methods based on
advanced learning models such as graph neural networks (GNNs) can achieve
state-of-the-art performance in terms of accuracy, we notice that 1) they
suffer from severe efficiency issues; 2) they directly model community search
as a node classification problem and thus cannot make good use of
interdependence among different entities in the graph. Motivated by these, in
this paper, we propose a new neurAL attrIbuted Community sEarch model for
large-scale graphs, termed ALICE. ALICE first extracts a candidate subgraph to
reduce the search scope and subsequently predicts the community by the
Consistency-aware Net , termed ConNet. Specifically, in the extraction phase,
we introduce the density sketch modularity that uses a unified form to combine
the strengths of two existing powerful modularities, i.e., classical modularity
and density modularity. Based on the new modularity metric, we first adaptively
obtain the candidate subgraph, formed by the k-hop neighbors of the query
nodes, with the maximum modularity. Then, we construct a node-attribute
bipartite graph to take attributes into consideration. After that, ConNet
adopts a cross-attention encoder to encode the interaction between the query
and the graph. The training of the model is guided by the structure-attribute
consistency and the local consistency to achieve better performance. Extensive
experiments over 11 real-world datasets including one billion-scale graph
demonstrate the superiority of ALICE in terms of accuracy, efficiency, and
scalability
Efficient Unsupervised Community Search with Pre-trained Graph Transformer
Community search has aroused widespread interest in the past decades. Among
existing solutions, the learning-based models exhibit outstanding performance
in terms of accuracy by leveraging labels to 1) train the model for community
score learning, and 2) select the optimal threshold for community
identification. However, labeled data are not always available in real-world
scenarios. To address this notable limitation of learning-based models, we
propose a pre-trained graph Transformer based community search framework that
uses Zero label (i.e., unsupervised), termed TransZero. TransZero has two key
phases, i.e., the offline pre-training phase and the online search phase.
Specifically, in the offline pretraining phase, we design an efficient and
effective community search graph transformer (CSGphormer) to learn node
representation. To pre-train CSGphormer without the usage of labels, we
introduce two self-supervised losses, i.e., personalization loss and link loss,
motivated by the inherent uniqueness of node and graph topology, respectively.
In the online search phase, with the representation learned by the pre-trained
CSGphormer, we compute the community score without using labels by measuring
the similarity of representations between the query nodes and the nodes in the
graph. To free the framework from the usage of a label-based threshold, we
define a new function named expected score gain to guide the community
identification process. Furthermore, we propose two efficient and effective
algorithms for the community identification process that run without the usage
of labels. Extensive experiments over 10 public datasets illustrate the
superior performance of TransZero regarding both accuracy and efficiency
Massive Dirac fermions in moir\'e superlattices: a route towards topological flat minibands
We demonstrate a generic mechanism to realize topological flat minibands by
confining massive Dirac fermions in a periodic moir\'e potential, which can be
achieved in a heterobilayer of transition metal dichalcogenides. We show that
the topological phase can be protected by the symmetry of moir\'e potential and
survive to arbitrarily large Dirac band gap. We take the MoTe/WSe
heterobilayer as an example and find that the topological phase can be driven
by a vertical electric field. By projecting the Coulomb interaction onto the
topological fat minibands, we identify a correlated Chern insulator at half
filling and a quantum valley-spin Hall insulator at full filling which explains
the topological states observed in the MoTe/WSe in experiment. Our work
clarifies the importance of Dirac structure for the topological minibands and
unveils a general strategy to design topological moir\'e materials.Comment: 13 pages, 12 figure
Diffractive Efficiency Prediction of Surface Relief Grating Waveguide Using Artificial Neural Network
This study aims to develop lightweight and comfortable wearable devices using surface-relief grating, which can be designed to meet different diffraction conditions. However, extensive calculations must be performed to obtain the impact of the variation in the structural dimensions. The finite element method is used to solve the diffractive efficiency and then replaced by trained artificial neural networks with a single hidden layer containing 25 neurons. By using raw data with geometric parameters as the features, the performance of the network is investigated with different numbers of raw data; in addition, the regression analysis shows a high R-value of approximately 0.999. The predicted results are compared with those calculated from the simulation. The diffraction efficiency tendencies vary with the different geometric parameters, which show a high level of agreement between the predicted and calculated data; this confirms that the proposed method supports and reduces the burden of extensive calculations
Power of linkage analysis using traits generated from simulated longitudinal data of the Framingham Heart Study
The Framingham Heart Study is a very successful longitudinal research for cardiovascular diseases. The completion of a 10-cM genome scan in Framingham families provided an opportunity to evaluate linkage using longitudinal data. Several descriptive traits based on simulated longitudinal data from the Genetic Analysis Workshop 13 (GAW13) were generated, and linkage analyses were performed for these traits. We compared the power of detecting linkage for baseline and slope genes in the simulated data of GAW13 using these traits. We found that using longitudinal traits based on multiple follow-ups may not be more powerful than using cross-sectional traits for genetic linkage analysis
- …