Search CORE

28 research outputs found

A Graph-based approach to derive the geodesic distance on Statistical manifolds: Application to Multimedia Information Retrieval

Author: Abbad Zakariae
Alaoui Said Ouatik El
Hassouni Mohammed El
Maliani Ahmed Drissi El
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/06/2021
Field of study

In this paper, we leverage the properties of non-Euclidean Geometry to define the Geodesic distance (GD) on the space of statistical manifolds. The Geodesic distance is a real and intuitive similarity measure that is a good alternative to the purely statistical and extensively used Kullback-Leibler divergence (KLD). Despite the effectiveness of the GD, a closed-form does not exist for many manifolds, since the geodesic equations are hard to solve. This explains that the major studies have been content to use numerical approximations. Nevertheless, most of those do not take account of the manifold properties, which leads to a loss of information and thus to low performances. We propose an approximation of the Geodesic distance through a graph-based method. This latter permits to well represent the structure of the statistical manifold, and respects its geometrical properties. Our main aim is to compare the graph-based approximation to the state of the art approximations. Thus, the proposed approach is evaluated for two statistical manifolds, namely the Weibull manifold and the Gamma manifold, considering the Content-Based Texture Retrieval application on different databases

arXiv.org e-Print Archive

Crossref

No todas las preguntas son (igualmente) difíciles, una aproximación híbrida a la CQA en árabe

Author: El Alaoui Said Ouatik
Lahbari Imane
Rodríguez Hontoria Horacio
Publication venue: Sociedad Española para el Procesamiento del Lenguaje Natural
Publication date: 01/01/2018
Field of study

In the past we faced the problem of Community Question Answering using an unified approach. Some of the questions, however, are easier to be approached by a conventional rule-based system. In this paper we explore this direction.En el pasado hemos abordado la búsqueda de respuestas en comunidades usando un enfoque uniforme. Sin embargo, algunas preguntas pueden ser respondidas utilizando métodos basados en reglas. En este trabajo exploramos esta dirección.Dr. Rodríguez has been partially funded by Spanish project "GraphMed" (TIN2016-77820-C3-3R)

Repositorio Institucional de la Universidad de Alicante

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

UPC-USMBA at SemEval-2017 Task 3: Combining multiple approaches for CQA for Arabic

Author: El Adlouni Yassine
El Alaoui Said Ouatik
Ennahnahi Noureddine
Lahbari Imane
Meknassi Mohammed
Rodríguez Hontoria Horacio
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

This paper presents a description of the participation of the UPC-USMBA team in the SemEval 2017 Task 3, subtask D, Arabic. Our approach for facing the task is based on a performance of a set of atomic classifiers (lexical string-based, vectorial, and rule-based) whose results are later combined. Our primary submission has obtained good results: 2nd (from 3 participants) in MAP, and 1st in inaccuracy.Peer ReviewedPostprint (published version

Crossref

UPCommons. Portal del coneixement obert de la UPC

Une mesure de similarité par une approche noyau pour l'indexation et la recherche par le contenu dans les bases de données hétérogènes

Author: Daoudi Imane
Idrissi Khalid
Ouatik Said
Publication venue: HAL CCSD
Publication date: 19/03/2009
Field of study

National audienceDans ce papier, nous proposons une méthode de recherche d'images basée sur une mesure de similarité par une approche noyau. Notre méthode apporte une amélioration au niveau de la mesure de similarité des descripteurs d'attribut hétérogènes (couleurs, texture, forme, etc.) par rapport à une simple distance Euclidienne. L'idée de base repose sur la transposition des vecteurs de données dans un espace à noyau. Une approche de similarité qui consiste à utiliser des produits scalaires comme fonction de similarité a été utilisée pour comparer les vecteurs de donnés hétérogènes. Plusieurs expérimentations ont été effectuées pour la recherche des k plus proches voisins. Les résultats obtenus sur la qualité de la recherche montrent l'intérêt de notre propo

Hal-Diderot

Une mesure de similarité par une approche noyau pour l'indexation et la recherche par le contenu dans les bases de données hétérogènes

Author: Daoudi Imane
Idrissi Khalid
Ouatik Said
Publication venue: HAL CCSD
Publication date: 19/03/2009
Field of study

Hal-Diderot

Une mesure de similarité par une approche noyau pour l'indexation et la recherche par le contenu dans les bases de données hétérogènes

Author: Daoudi Imane
Idrissi Khalid
Ouatik Said
Publication venue: HAL CCSD
Publication date: 19/03/2009
Field of study

HAL

ARABIC TEXT SUMMARIZATION BASED ON LATENT SEMANTIC ANALYSIS TO ENHANCE ARABIC DOCUMENTS CLUSTERING

Author: Abdelmonaime Lachkar
Hanane Froud
Said Alaoui Ouatik
Publication venue
Publication date: 24/04/2020
Field of study

ABSTRACT Arabic Documents Clustering is an important task for obtaining good results with the traditional Information Retrieval (IR) systems especially with the rapid growth of the number of online documents present in Arabic language. Documents clustering aim to automatically group similar documents in one cluster using different similarity/distance measures. This task is often affected by the documents length, useful information on the documents is often accompanied by a large amount of noise, and therefore it is necessary to eliminate this noise while keeping useful information to boost the performance of Documents clustering. In this paper, we propose to evaluate the impact of text summarization using the Latent Semantic Analysis Model on Arabic Documents Clustering in order t

CiteSeerX

Kernel Based Approach for High Dimensional Heterogeneous Image Features Management in CBIR Context

Author: Daoudi Imane
El Alaoui Ouatik Said
Idrissi Khalid
Publication venue: HAL CCSD
Publication date: 01/01/2008
Field of study

International audienceIn this paper we address a challenge of the problem of the dimensionality curse and the semantic gap reduction for content based image retrieval in large and heterogeneous databases. The strength of our idea resides in building an effective multidimensional indexing method based on kernel principal component analysis (KPCA) which supports efficiently similarity search of the heterogeneous vectors (color, texture, shape) and maps data vectors on a low feature space that is partitioned into regions. An efficient approach to approximate feature space regions is proposed with the corresponding upper and lower distance bounds. Finally, relevance feedback mechanism is exploited to create a flexible retrieval metric in order to reduce the semantic gap between the user need and the data representation. Experimental evaluations show that the use of region approximation approach with relevance feedback can significantly improve both the quality and the CPU time of the result

Hal-Diderot