Search CORE

4 research outputs found

Localisation de sources de données et optimisation de requêtes réparties en environnement pair-à-pair

Author: Al King Raddad
Publication venue
Publication date: 11/05/2010
Field of study

Malgré leur succès dans le domaine du partage de fichiers, les systèmes P2P sont capables d'évaluer uniquement des requêtes simples basées sur la recherche d'un fichier en utilisant son nom. Récemment, plusieurs travaux de recherche sont effectués afin d'étendre ces systèmes pour qu'ils permettent le partage de données avec une granularité fine (i.e. un attribut atomique) et l'évaluation de requêtes complexes (i.e. requêtes SQL). A cause des caractéristiques des systèmes P2P (e.g. grande-échelle, instabilité et autonomie de nœuds), il n'est pas pratique d'avoir un catalogue global qui contient souvent des informations sur: les schémas, les données et les hôtes des sources de données. L'absence d'un catalogue global rend plus difficiles: (i) la localisation de sources de données en prenant en compte l'hétérogénéité de schémas et (ii) l'optimisation de requêtes. Dans notre thèse, nous proposons une approche pour l'évaluation des requêtes SQL en environnement P2P. Notre approche est fondée sur une ontologie de domaine et sur des formules de similarité pour résoudre l'hétérogénéité sémantique des schémas locaux. Quant à l'hétérogénéité structurelle de ces schémas, elle est résolue grâce à l'extension d'un algorithme de routage de requêtes (i.e. le protocole Chord) par des Indexes de structure. Concernant l'optimisation de requêtes, nous proposons de profiter de la phase de localisation de sources de données pour obtenir toutes les méta-données nécessaires pour générer un plan d'exécution proche de l'optimal. Afin de montrer la faisabilité et la validité de nos propositions, nous effectuons une évaluation des performances et nous discutons les résultats obtenus.Despite of their great success in the file sharing domain, P2P systems support only simple queries usually based on looking up a file by using its name. Recently, several research works have made to extend P2P systems to be able to share data having a fine granularity (i.e. atomic attribute) and to process queries written with a highly expressive language (i.e. SQL). The characteristics of P2P systems (e.g. large-scale, node autonomy and instability) make impractical to have a global catalog that stores often information about data, schemas and data source hosts. Because of the absence of a global catalog, two problems become more difficult: (i) locating data sources with taking into account the schema heterogeneity and (ii) query optimization. In our thesis, we propose an approach for processing SQL queries in a P2P environment. To solve the semantic heterogeneity between local schemas, our approach is based on domain ontology and on similarity formulas. As for the structural heterogeneity of local schemas, it is solved by the extension of a query routing method (i.e. Chord protocol) with Structure Indexes. Concerning the query optimization problem, we propose to take advantage of the data source localization phase to obtain all metadata required for generating a close to optimal execution plan. Finally, in order to show the feasibility and the validity of our propositions, we carry out performance evaluations and we discuss the obtained results

Thèses en ligne de l'Université Toulouse III - Paul Sabatier

On the Relationship Between Differential Algebra and Tropical Differential Algebraic Geometry

Author: Boulier François
Falkensteiner Sebastian
Noordman Marc Paul
Sánchez Omar León
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/08/2021
Field of study

Proceedings - University of Groningen

University of Groningen

Dissertations of the University of Groningen

Knowledge Discovery from XML documents: PAKDD 2006 Workshop Proceedings First International Workshop, KDXD 2006, Singapore, April 9, 2006.

Author: Nayak Richi
Zaki Mohammad
Publication venue: Springer
Publication date: 01/01/2006
Field of study

The KDXD'06 (Knowledge Discovery from XML Documents) workshop is\ud the first international workshop running this year in conjunction\ud with the PAKDD'06 conference. The workshop provides an important\ud forum for the dissemination and exchange of new ideas and,\ud research related to XML data discovery and retrieval.\ud \ud The eXtensible Markup Language (XML) has become a standard\ud language for data representation and exchange. With the continuous\ud growth in XML data sources, the ability to manage collections of\ud XML documents and discover knowledge from them for decision\ud support becomes increasingly important. Due to the inherent\ud flexibility of XML, in both structure and semantics, inferring\ud important knowledge from XML data is faced with new challenges as\ud well as benefits. The objective of the workshop is to bring\ud together researchers and practitioners to discuss all aspects of\ud the emerging XML data management challenges. Thus, the topics of\ud interest included, but were not limited to: XML data mining\ud methods; XML data mining applications; XML data management\ud emerging issues and challenges; XML in improving knowledge\ud discovery process; and Benchmarks and mining performance using XML\ud databases.\ud \ud The workshop received 26 submissions. We would like to thank all\ud those who submitted their work to the workshop under relatively\ud pressuring time deadlines. We have selected 10 high quality full\ud papers for the discussion and presentation in the workshop and for\ud inclusion in the proceedings after peer-reviews by at least three\ud members of the Program Committee. Accepted papers have been\ud grouped in three sessions and allocated equal presentation time\ud slots. The first session is on XML data mining methods of\ud classification, clustering and association. The second session\ud focuses on the XML data reasoning and querying methods. Query\ud Optimization. And, the last session is on XML data applications of\ud transportation and security .\ud \ud Special thanks go to the program committee members who shared\ud their expertise and time to make KDXD'06 a success. The final\ud quality of selected papers depends on their efforts.\ud \ud Last but least, we would like to thank the organizers of PAKDD\ud 2006 for hosting KDXD'06

Queensland University of Technology ePrints Archive