Search CORE

17 research outputs found

Rocchio\u27s Model Based on Vector Space Basis Change for Pseudo Relevance Feedback

Author: Hattab Hawete
Mbarek Rabeb
Tmar Mohamed
Publication venue: OASIcs - OpenAccess Series in Informatics. 3rd Symposium on Languages, Applications and Technologies
Publication date: 01/01/2014
Field of study

Rocchio\u27s relevance feedback model is a classic query expansion method and it has been shown to be effective in boosting information retrieval performance. The main problem with this method is that the relevant and the irrelevant documents overlap in the vector space because they often share same terms (at least the terms of the query). With respect to the initial vector space basis (index terms), it is difficult to select terms that separate relevant and irrelevant documents. The Vector Space Basis Change is used to separate relevant and irrelevant documents without any modification on the query term weights. In this paper, first, we study how to incorporate Vector Space Basis Change into the Rocchio\u27s model. Second, we propose Rocchio\u27s models based on Vector Space Basis Change, called VSBCRoc models. Experimental results on a TREC collection show that our proposed models are effective

Dagstuhl Research Online Publication Server

Impact of Ngrams-based indexing on XML retrieval

Author: Ben Aouicha Mohamed
Boughanem Mohand
Tmar Mohamed
Publication venue: Demetra EOOD
Publication date: 01/01/2009
Field of study

We present in this paper a statistical approach of term clustering. This approach is based on a statistical analysis of NGrams shared by a pair of terms and is inspired from the t f × idf criterion commonly used in information retrieval. Being statistical, the approach is completely independent from the lexical and grammatical characteristics of the language in which documents to be indexed are written. Classical indexing is often based on stemming, which consists of transforming a term into its radical. This allows to provide large issues for customized information access. As for us, we consider that this can be made by building term clusters and perform information retrieval based on this concept. This approach is used for XML retrieval, therefore some experiments have been undertaken into a dataset provided by INEX to show its impact compared to Porter stemming method

Research at Sofia University

Bulgarian OpenAIRE Repository

Proposition pour l’intégration des réseaux petits mondes en recherche d’information

Author: Abid Mohamed
Boughanem Mohand
Khazri Mohamed
Tmar Mohamed
Publication venue: 'Centre pour la Communication Scientifique Directe (CCSD)'
Publication date: 01/01/2009
Field of study

International audienceWe propose in this paper an approach for document clustering. It consists of representing the corpus as a document graph, where the links are defined by some criteria. These links are quantified by simialrity measures. We aim join this context into the approach of classification to constitute small-worlds networks of homogeneous documents. The homogeneity of the clusters is measured according to the properties of small worlds. The clusters, as well as their proprietes, allow to rerank search results. Some experiments were done on a corpus provided by TREC and the obtained results show the contribution of small-worlds networks in information retrieval.Nous proposons dans ce papier une approche de classification d’un corpus de documents. Elle consiste en une représentation du corpus sous forme de graphe, où les liens sont définis par certains critères. Ces liens sont quantifiés par des mesures de similarité. Nous visons à intégrer ce contexte dans l’approche de classification afin de constituer des réseaux petits mondes de documents homogènes. L’homogénéité des classes est valuée suivant les propriétés des réseaux petits mondes. Les classes, ainsi que leurs propriétés, nous servent au ré-ordonnancement de documents résultats de recherche. Quelques expérimentations ont été menées sur un corpus issu de TREC 1 et les résultats obtenus montrent l’apport des réseaux petits mondes en recherche d’information

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Episciences.org

HAL Descartes

Hal-Diderot

XML Retrieval

Author: Abid Mohamed
Ben Aouicha Mohamed
Boughanem Mohand
Tmar Mohamed
Publication venue: 'IntechOpen'
Publication date: 01/11/2009
Field of study

Non

IntechOpen

A Re-Ranking Method Based on Irrelevant Documents in Ad-Hoc Retrieval

Author: Boughanem Mohand
Hattab Hawete
Mbarek Rabeb
Tmar Mohamed
Publication venue: OASIcs - OpenAccess Series in Informatics. 5th Symposium on Languages, Applications and Technologies (SLATE\u2716)
Publication date: 01/01/2016
Field of study

In this paper, we propose a novel approach for document re-ranking, which relies on the concept of negative feedback represented by irrelevant documents. In a previous paper, a pseudo-relevance feedback method is introduced using an absorbing document ~d which best fits the user\u27s need. The document ~d is orthogonal to the majority of irrelevant documents. In this paper, this document is used to re-rank the initial set of ranked documents in Ad-hoc retrieval. The evaluation carried out on a standard document collection shows the effectiveness of the proposed approach

Dagstuhl Research Online Publication Server

Automatic Diagnosis of Breast Tissue

Author: Atef Boujelben
Hedi Tmar
Jameleddine Mnif
Mohamed Abid
Publication venue: 'IntechOpen'
Publication date: 27/01/2012
Field of study

IntechOpen

Modèle auto-adaptatif de filtrage d'information (apprentissage incrémental du profil et de la fonction de décision)

Author: CHRISMENT Claude
TMAR Mohamed
Publication venue
Publication date: 01/01/2002
Field of study

TOULOUSE3-BU Sciences (315552104) / SudocSudocFranceF

OpenGrey Repository

Investigating the combination of structural and textual information about multimedia retrieval Sana FAKHFAKH

Author: Mohamed Tmar
Walid Mahdi
Publication venue
Publication date
Field of study

Abstract—The expansion of structured information in different applications introduces a new ambiguity in multimedia retrieval in semi-structured documents. We investigate in this paper the combination of textual and structural context for multimedia retrieval in XML document thus we present a indexing model which combines textual and structural information. We propose a geometric method who use implicitly of textual and structural context of XML elements and we are particularly interested by improve the effectiveness of various structural factors for multimedia retrieval. Using a geometric metric, we can represent structural information in XML document with a vector for each element. Given a textual query, our model lets us combine scores obtained from each sources of evidence and return a list of relevant retrieved multimedia element. Experimental evaluation is carried out using the INEX Ad Hoc Task 2007 and the Image CLEF Wikipedia Retrieval Task 2010. The results show that combination of scores of textual modality and structural modality significantly improves compared results of using a single modality. Keywords—Geometric distance; multimedia retrieval; element; structure; document modeling I

CiteSeerX