Search CORE

10 research outputs found

Discriminative frequent subgraph mining with optimality guarantees

Author: Agrawal
Bairoch
Borgelt
Borgelt
Boros
Bringmann
Dash
Deshpande
Dobson
Ein-Dor
Huan
Inokuchi
Jin
Kubinyi
Kuramochi
Nemhauser
Nijssen
Przulj
Radivojac
Saigo
Saigo
van 't Veer
Vanetik
Wernicke
Yan
Yang
Zimmermann
Publication venue: 'Wiley'
Publication date
Field of study

Mining Representative Unsubstituted Graph Patterns Using Prior Similarity Matrix

Author: Dhifli Wajdi
Nguifo Engelbert Mephu
Saidi Rabie
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

One of the most powerful techniques to study protein structures is to look for recurrent fragments (also called substructures or spatial motifs), then use them as patterns to characterize the proteins under study. An emergent trend consists in parsing proteins three-dimensional (3D) structures into graphs of amino acids. Hence, the search of recurrent spatial motifs is formulated as a process of frequent subgraph discovery where each subgraph represents a spatial motif. In this scope, several efficient approaches for frequent subgraph discovery have been proposed in the literature. However, the set of discovered frequent subgraphs is too large to be efficiently analyzed and explored in any further process. In this paper, we propose a novel pattern selection approach that shrinks the large number of discovered frequent subgraphs by selecting the representative ones. Existing pattern selection approaches do not exploit the domain knowledge. Yet, in our approach we incorporate the evolutionary information of amino acids defined in the substitution matrices in order to select the representative subgraphs. We show the effectiveness of our approach on a number of real datasets. The results issued from our experiments show that our approach is able to considerably decrease the number of motifs while enhancing their interestingness

arXiv.org e-Print Archive

HAL Clermont Université

Hal-Diderot

Adversarial Attacks on Code Models with Discriminative Graph Patterns

Author: Le Xuan Bach D.
Lo David
Nguyen Thanh-Dat
Patanamon
Thongtanunam
Zhou Yang
Publication venue
Publication date: 21/08/2023
Field of study

Pre-trained language models of code are now widely used in various software engineering tasks such as code generation, code completion, vulnerability detection, etc. This, in turn, poses security and reliability risks to these models. One of the important threats is \textit{adversarial attacks}, which can lead to erroneous predictions and largely affect model performance on downstream tasks. Current adversarial attacks on code models usually adopt fixed sets of program transformations, such as variable renaming and dead code insertion, leading to limited attack effectiveness. To address the aforementioned challenges, we propose a novel adversarial attack framework, GraphCodeAttack, to better evaluate the robustness of code models. Given a target code model, GraphCodeAttack automatically mines important code patterns, which can influence the model's decisions, to perturb the structure of input code to the model. To do so, GraphCodeAttack uses a set of input source codes to probe the model's outputs and identifies the \textit{discriminative} ASTs patterns that can influence the model decisions. GraphCodeAttack then selects appropriate AST patterns, concretizes the selected patterns as attacks, and inserts them as dead code into the model's input program. To effectively synthesize attacks from AST patterns, GraphCodeAttack uses a separate pre-trained code model to fill in the ASTs with concrete code snippets. We evaluate the robustness of two popular code models (e.g., CodeBERT and GraphCodeBERT) against our proposed approach on three tasks: Authorship Attribution, Vulnerability Prediction, and Clone Detection. The experimental results suggest that our proposed approach significantly outperforms state-of-the-art approaches in attacking code models such as CARROT and ALERT

arXiv.org e-Print Archive

Explainable Classification of Brain Networks via Contrast Subgraphs

Author: Bonchi Francesco
Gionis Aristides
Lanciano Tommaso
Publication venue
Publication date: 01/01/2020
Field of study

Mining human-brain networks to discover patterns that can be used to discriminate between healthy individuals and patients affected by some neurological disorder, is a fundamental task in neuroscience. Learning simple and interpretable models is as important as mere classification accuracy. In this paper we introduce a novel approach for classifying brain networks based on extracting contrast subgraphs, i.e., a set of vertices whose induced subgraphs are dense in one class of graphs and sparse in the other. We formally define the problem and present an algorithmic solution for extracting contrast subgraphs. We then apply our method to a brain-network dataset consisting of children affected by Autism Spectrum Disorder and children Typically Developed. Our analysis confirms the interestingness of the discovered patterns, which match background knowledge in the neuroscience literature. Further analysis on other classification tasks confirm the simplicity, soundness, and high explainability of our proposal, which also exhibits superior classification accuracy, to more complex state-of-the-art methods.Comment: To be published at KDD 202

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Discriminative frequent subgraph mining with optimality guarantees

Author: Alex Smola
Arthur Gretton
Hans-peter Kriegel
Hong Cheng
Jiawei Han
Karsten M. Borgwardt
Le Song
Marisa Thoma
Philip S. Yu
Xifeng Yan
Publication venue
Publication date: 01/10/2010
Field of study

The goal of frequent subgraph mining is to detect subgraphs that frequently occur in a dataset of graphs. In classification settings, one is often interested in discovering discriminative frequent subgraphs, whose presence or absence is indicative of the class membership of a graph. In this article, we propose an approach to feature selection on frequent subgraphs, called CORK, that combines two central advantages. First, it optimizes a submodular quality criterion, which means that we can yield a near-optimal solution using greedy feature selection. Second, our submodular quality function criterion can be integrated into gSpan, the state-of-the-art tool for frequent subgraph mining, and help to prune the search space for discriminative frequent subgraphs even during frequent subgraph mining

CiteSeerX

MPG.PuRe

Similarity search applications in medical images

Author: Petri Marisa
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 17/01/2012
Field of study

Digitale Hochschulschriften der LMU

Data-Mining Techniques for Call-Graph-Based Software-Defect Localisation

Author: Eichinger Frank
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2011
Field of study

Defect localisation is an important problem in software engineering. This dissertation investigates call-graph-mining-based software defect localisation, which supports software developers by providing hints where defects might be located. It extends the state-of-the-art by proposing new graph representations and mining techniques for weighted graphs. This leads to a broader range of detectable defects, to an increased localisation precision and to enhanced scalability

KITopen