13 research outputs found

    Exploratory factor analysis of graphical features for link prediction in social networks

    Get PDF
    Social Networks attract much attention due to their ability to replicate social interactions at scale. Link prediction, or the assessment of which unconnected nodes are likely to connect in the future, is an interesting but non-trivial research area. Three approaches exist to deal with the link prediction problem: feature-based models, Bayesian probabilistic models, probabilistic relational models. In feature-based methods, graphical features are extracted and used for classification. Usually, these features are subdivided into three feature groups based on their formula. Some formulas are extracted based on neighborhood graph traverse. Accordingly, there exists three groups of features, neighborhood features, path-based features, node-based features. In this paper, we attempt to validate the underlying structure of topological features used in feature-based link prediction. The results of our analysis indicate differing results from the prevailing grouping of these features, which indicates that current literatures\u27 classification of feature groups should be redefined. Thus, the contribution of this work is exploring the factor loading of graphical features in link prediction in social networks. To the best of our knowledge, there is no prior studies had addressed it

    Classification of multi-class imbalanced data streams using a dynamic data-balancing technique

    No full text
    The performance of classification algorithms with imbalanced streaming data depends upon efficient re-balancing strategy for learning tasks. The difficulty becomes more elevated with multi-class highly imbalanced streaming data. In this paper, we investigate the multi-class imbalance problem in data streams and develop an adaptive framework to cope with imbalanced data scenarios. The proposed One-Vs-All Adaptive Window re-Balancing with Retain Knowledge (OVA-AWBReK) classification framework will combine OVA binarization with Automated Re-balancing Strategy (ARS) using Racing Algorithm (RA). We conducted experiments on highly imbalanced datasets to demonstrate the use of the proposed OVA-AWBReK framework. The results show that OVA-AWBReK framework can enhance the classification performance of the multi-class highly imbalanced data
    corecore