Search CORE

14 research outputs found

Analyse de code en boîte noire pour la rétro ingénierie via acquisition de contraintes et synthèse de code

Author: Menguy Grégoire
Publication venue: HAL CCSD
Publication date: 14/03/2023
Field of study

Software always becomes larger and more complex, making crucial tasks like code testing, verification, or code understanding highly difficult for humans. Hence the need for methods to reason about code automatically. These are usually white-box, and use the code syntax to deduce its properties. While they have proven very powerful, they also show limitations: they need the source code, the code size and the data structures' complexity degrade their efficiency, they are highly impacted by syntactic code complexity amplified by optimizations obfuscations. This thesis explores how black-box code analysis can infer valuable properties for reverse engineering through data-driven learning. First, we consider the function contracts inference problem, which aims to infer over which inputs a code function can be executed to get good behaviors only. We extend the constraint acquisition learning framework, notably solving one of its major flaws: the dependency on a human user. It leads to PreCA, the first black-box approach enjoying clear theoretical guarantees. It makes PreCA especially suitable for development uses. Second, we consider the deobfuscation problem, which aims to simplify obfuscated code. Our proposal, Xyntia, synthesizes code block semantics through S-metaheuristics to offer an understandable version of the code. Xyntia significantly improves the state-of-the-art in terms of robustness and speed. In addition, we propose the two first protections efficient against black-box deobfuscation.Les logiciels sont de plus en plus grands et complexes. Ainsi, certaines tâches comme le test et la vérification de code, ou la compréhension de code, sont de plus en plus difficiles à réaliser pour un humain. D'où la nécessité de développer des méthodes d'analyse automatique. Celles-ci sont usuellement en boîte blanche, utilisant la syntaxe du code pour déduire ses propriétés. Elles sont très efficaces mais présentent certaines limitations: le code source est nécessaire, la taille et la complexité syntaxique du code (accentuée par des optimisations et de l'obfuscation) impactent leur efficacité. Cette thèse explore comment les méthodes en boîte noire peuvent inférer des propriétés utiles pour la rétro-ingénierie. Nous étudions, tout d'abord, l'inférence de contrat de fonction qui tente d'apprendre sur quelles entrées une fonction peut être exécutée pour obtenir les sorties souhaitées. Nous adaptons l'acquisition de contraintes, en résolvant une de ses principales limitations: la dépendance à un être humain. En ressort PreCA, la première approche totalement boîte noire offrant des garanties claires de correction. PreCA est ainsi particulièrement approprié pour l'aide au développement. Nous étudions ensuite la déobfuscation, qui vise à simplifier du code obfusqué. Nous proposons Xyntia qui synthétise, via des S-métaheuristiques, une version compréhensible de blocs de code. Xyntia est plus rapide et robuste que l'état de l'art. De plus, nous proposons les deux premières protections contre la déobfuscation en boîte noire

HAL-CEA

Analyse de code en boîte noire pour la rétro ingénierie via acquisition de contraintes et synthèse de code

Author: Menguy Grégoire
Publication venue: HAL CCSD
Publication date: 14/03/2023
Field of study

Thèses en Ligne

Active disjunctive constraint acquisition

Author: Bardin Sébastien
Gotlieb Arnaud
Lazaar Nadjib
Menguy Grégoire
Publication venue: IJCAI Organization
Publication date: 02/09/2023
Field of study

International audienceConstraint acquisition (CA) is a method for learning users' concepts by representing them as a conjunction of constraints. While this approach works well for many combinatorial problems over finite domains, some applications require the acquisition of disjunctive constraints, possibly coming from logical implications or negations. In this paper, we propose the first CA algorithm tailored to the automatic inference of disjunctive constraints, named DCA. A key ingredient there, is to build upon the computation of maximal satisfiable subsets. We demonstrate experimentally that DCA is faster and more effective than traditional CA with added disjunctive constraints, even for ultra-metric constraints with up to 5 variables. We also apply DCA to precondition acquisition in software verification, where it outperforms the previous CA-based approach PreCA, being 2.5 times faster. Specifically, in our evaluation DCA infers more preconditions in just 5 minutes than PreCA does in an hour, without requiring prior knowledge about disjunction size. Our results demonstrate the potential of DCA for improving the efficiency and scalability of constraint acquisition in the disjunctive case, enabling a wide range of novel applications

HAL-CEA

Automated program analysis: revisiting precondition inference through constraint acquisition

Author: Bardin Sébastien
Gotlieb Arnaud
Lazaar Nadjib
Menguy Grégoire
Publication venue: Curran Associates, Inc. ( May 2023 )
Publication date: 23/07/2022
Field of study

HAL-CEA

Analyse de Code Automatique: Revisiter l'Inférence de Préconditions via l'Acquisition de Contraintes

Author: Bardin Sébastien
Gotlieb Arnaud
Lazaar Nadjib
Menguy Grégoire
Publication venue: HAL CCSD
Publication date: 03/07/2023
Field of study

International audienceProgram annotations under the form of function pre/postconditions are crucial for many software engineering and program verification applications. Unfortunately, these are rarely available and must be retrofit by hand. This paper explores how Constraint Acquisition (CA) can be leveraged to automatically infer program preconditions. This leads to P RE C A, which infers preconditions from input-output observations only, and presents clear correctness guarantees.Les annotations de programme, sous forme de pré/postconditions de fonctions, sont cruciales pour accomplir différentes tâches, de l'ingénierie logicielle à la vérification de code. Malheureusement, ces annotations sont rarement fournies et doivent donc être rétro-ingéniées manuellement. Dans notre article, nous étudions comment l'acquisition de contraintes peut être utilisée pour inférer des préconditions. Cela a conduit à PRECA, un outil qui infère des préconditions à partir d'observations d'exécution du code uniquement, et assurant des garanties claires de correction. Mots-clés Acquisition de contraintes, analyse de code, précondition

HAL-CEA

Search-based local black-box deobfuscation: understand, improve and mitigate

Author: Bardin Sébastien
Bonichon Richard
de Souza Lima Cauim
Menguy Grégoire
Publication venue: HAL CCSD
Publication date: 15/11/2021
Field of study

International audienceCode obfuscation aims at protecting Intellectual Property and other secrets embedded into software from being retrieved. Recent works leverage advances in artificial intelligence (AI) with the hope of getting blackbox deobfuscators completely immune to standard (whitebox) protection mechanisms. While promising, this new field of AI-based, and more specifically search-based blackbox deobfuscation, is still in its infancy. In this article we deepen the state of search-based blackbox deobfuscation in three key directions: understand the current state-of-the-art, improve over it and design dedicated protection mechanisms. In particular, we define a novel generic framework for search-based blackbox deobfuscation encompassing prior work and highlighting key components; we are the first to point out that the search space underlying code deobfuscation is too unstable for simulation-based methods (e.g., Monte Carlo Tree Search used in prior work) and advocate the use of robust methods such as S-metaheuristics; we propose the new optimized search-based blackbox deobfuscator Xyntia which significantly outperforms prior work in terms of success rate (especially with small time budget) while being completely immune to the most recent anti-analysis code obfuscation methods; and finally we propose two novel protections against search-based blackbox deobfuscation, allowing to counter Xyntia powerful attacks

HAL-CEA

Actes de la conférence CAID 2021 (Conference on Artificial Intelligence for Defense)

Author: Aguas Elkin
Allali Ahmed
Barbaresco Frédéric
Bardin Sébastien
Bartoli Nathalie
Bazin Alexis
Bernardin Narayan
Bhor Tejas
Blanc Gregory
Bonichon Richard
Boudguiga Aimen
Brunessaux Stephan
Bécue Adrien
Caquot Léonard
Carron Bruno
Ceillier Tugdual
Cejka Thomas
Charayron Rémy
Charrier Tristan
Chevalier Yannick
Chihani Zakaria
Corbineau Marie-Caroline
Corruble Vincent
Costé Benjamin
de de Souza Lima Cauim
Debar Hervé
Dey Alexandre
El Bezzaz Semlali Badre
El Fallah-Seghrouchni Amal
Ferrari Romain
Flory Pierre-Elisée
Furon Teddy
Garcia Ricardo
Genetay Edouard
Gregorutti Baptiste
Guilhot-Gaudeffroy Jean-Eudes
Hynek Karel
Héliou Alice
Iancheruk Aleksei
Kapusta Katarzyna
Kassab Rami
La Rocca Daria
Lagrave Pierre-Yves
Lambert Anthony
Lefèvre Thierry
Medad Amine
Menguy Grégoire
Morlier Joseph
Nguema Alexandre Peter
Papadopoulou Georgia
Papaioannou Ioannis
Pasquero Oudomsack Pierre
Plana Robert
Rodriguez Julien
Rousset Mathias
Sirdey Renaud
Stan Oana
Thouvenot Vincent
Tit Karim
Totel Éric
Traonouez Louis-Marie
Vardoulias George
Vekinis Andreas
Vidal Vincent
Vincent Marc
Zuber Martin
Publication venue: HAL CCSD
Publication date: 18/11/2021
Field of study

6th European Cyber Week (6ème semaine européenne de la cybersécurité et de la cyberdéfense

INRIA a CCSD electronic archive server

Actes de la conférence CAID 2021 (Conference on Artificial Intelligence for Defense)

Author: Aguas Elkin
Allali Ahmed
Barbaresco Frédéric
Bardin Sébastien
Bartoli Nathalie
Bazin Alexis
Bernardin Narayan
Bhor Tejas
Blanc Gregory
Bonichon Richard
Boudguiga Aimen
Brunessaux Stephan
Bécue Adrien
Caquot Léonard
Carron bruno
Ceillier Tugdual
Cejka Thomas
Charayron Rémy
Charrier Tristan
Chevalier Yannick
Chihani Zakaria
Corbineau Marie-Caroline
Corruble Vincent
Costé Benjamin
de de Souza Lima Cauim
Debar Hervé
Dey Alexandre
El Bezzaz Semlali Badre
El Fallah-Seghrouchni Amal
Ferrari Romain
Flory Pierre-Elisée
Furon Teddy
Garcia Ricardo
Genetay Edouard
Gregorutti Baptiste
Guilhot-Gaudeffroy Jean-Eudes
Hynek Karel
Héliou Alice
IANCHERUK Aleksei
Kapusta Katarzyna
Kassab Rami
La Rocca Daria
Lagrave Pierre-Yves
Lambert Anthony
Lefèvre Thierry
Medad Amine
Menguy Grégoire
Morlier Joseph
Nguema Alexandre Peter
Papadopoulou Georgia
Papaioannou Ioannis
Pasquero Oudomsack Pierre
Plana Robert
Rodriguez Julien
Rousset Mathias
Sirdey Renaud
Stan Oana
Thouvenot Vincent
Tit Karim
Totel Éric
Traonouez Louis-Marie
Vardoulias George
Vekinis Andreas
Vidal Vincent
Vincent Marc
Zuber Martin
Publication venue: HAL CCSD
Publication date: 18/11/2021
Field of study

6th European Cyber Week (6ème semaine européenne de la cybersécurité et de la cyberdéfense)International audienc

HAL-CentraleSupelec

Actes de la conférence CAID 2021 (Conference on Artificial Intelligence for Defense)

Author: Aguas Elkin
Allali Ahmed
Barbaresco Frédéric
Bardin Sébastien
Bartoli Nathalie
Bazin Alexis
Bernardin Narayan
Bhor Tejas
Blanc Gregory
Bonichon Richard
Boudguiga Aimen
Brunessaux Stephan
Bécue Adrien
Caquot Léonard
Carron Bruno
Ceillier Tugdual
Cejka Thomas
Charayron Rémy
Charrier Tristan
Chevalier Yannick
Chihani Zakaria
Corbineau Marie-Caroline
Corruble Vincent
Costé Benjamin
de de Souza Lima Cauim
Debar Hervé
Dey Alexandre
El Bezzaz Semlali Badre
El Fallah-Seghrouchni Amal
Ferrari Romain
Flory Pierre-Elisée
Furon Teddy
Garcia Ricardo
Genetay Edouard
Gregorutti Baptiste
Guilhot-Gaudeffroy Jean-Eudes
Hynek Karel
Héliou Alice
Iancheruk Aleksei
Kapusta Katarzyna
Kassab Rami
La Rocca Daria
Lagrave Pierre-Yves
Lambert Anthony
Lefèvre Thierry
Medad Amine
Menguy Grégoire
Morlier Joseph
Nguema Alexandre Peter
Papadopoulou Georgia
Papaioannou Ioannis
Pasquero Oudomsack Pierre
Plana Robert
Rodriguez Julien
Rousset Mathias
Sirdey Renaud
Stan Oana
Thouvenot Vincent
Tit Karim
Totel Éric
Traonouez Louis-Marie
Vardoulias George
Vekinis Andreas
Vidal Vincent
Vincent Marc
Zuber Martin
Publication venue: HAL CCSD
Publication date: 18/11/2021
Field of study

6th European Cyber Week (6ème semaine européenne de la cybersécurité et de la cyberdéfense)International audienc

HAL-Rennes 1