2 research outputs found

    Review of Extreme Multilabel Classification

    Full text link
    Extreme multilabel classification or XML, is an active area of interest in machine learning. Compared to traditional multilabel classification, here the number of labels is extremely large, hence, the name extreme multilabel classification. Using classical one versus all classification wont scale in this case due to large number of labels, same is true for any other classifiers. Embedding of labels as well as features into smaller label space is an essential first step. Moreover, other issues include existence of head and tail labels, where tail labels are labels which exist in relatively smaller number of given samples. The existence of tail labels creates issues during embedding. This area has invited application of wide range of approaches ranging from bit compression motivated from compressed sensing, tree based embeddings, deep learning based latent space embedding including using attention weights, linear algebra based embeddings such as SVD, clustering, hashing, to name a few. The community has come up with a useful set of metrics to identify correctly the prediction for head or tail labels.Comment: 46 pages, 13 figure

    Improving Expressivity of Graph Neural Networks using Localization

    Full text link
    In this paper, we propose localized versions of Weisfeiler-Leman (WL) algorithms in an effort to both increase the expressivity, as well as decrease the computational overhead. We focus on the specific problem of subgraph counting and give localized versions of kβˆ’k-WL for any kk. We analyze the power of Local kβˆ’k-WL and prove that it is more expressive than kβˆ’k-WL and at most as expressive as (k+1)βˆ’(k+1)-WL. We give a characterization of patterns whose count as a subgraph and induced subgraph are invariant if two graphs are Local kβˆ’k-WL equivalent. We also introduce two variants of kβˆ’k-WL: Layer kβˆ’k-WL and recursive kβˆ’k-WL. These methods are more time and space efficient than applying kβˆ’k-WL on the whole graph. We also propose a fragmentation technique that guarantees the exact count of all induced subgraphs of size at most 4 using just 1βˆ’1-WL. The same idea can be extended further for larger patterns using k>1k>1. We also compare the expressive power of Local kβˆ’k-WL with other GNN hierarchies and show that given a bound on the time-complexity, our methods are more expressive than the ones mentioned in Papp and Wattenhofer[2022a]
    corecore