Search CORE

224 research outputs found

Searching by approximate personal-name matching

Author: Camps Pare Rafael
Daude Ventura Jordi
Publication venue
Publication date: 01/01/2003
Field of study

We discuss the design, building and evaluation of a method to access theinformation of a person, using his name as a search key, even if it has deformations. We present a similarity function, the DEA function, based on the probabilities of the edit operations accordingly to the involved letters and their position, and using a variable threshold. The efficacy of DEA is quantitatively evaluated, without human relevance judgments, very superior to the efficacy of known methods. A very efficient approximate search technique for the DEA function is also presented based on a compacted trie-tree structure.Postprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Smart detection of offensive words in social media using the soundex algorithm and permuterm index

Author: Abukaraki Anas
Abukhalil Tamer
Al Rawashdeh Tawfiq
Al-Jaafreh Moha'med
Alksasbeh Malek Z.
Alqaralleh Bassam A. Y.
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/10/2021
Field of study

Offensive posts in the social media that are inappropriate for a specific age, level of maturity, or impression are quite often destined more to unadult than adult participants. Nowadays, the growth in the number of the masked offensive words in the social media is one of the ethically challenging problems. Thus, there has been growing interest in development of methods that can automatically detect posts with such words. This study aimed at developing a method that can detect the masked offensive words in which partial alteration of the word may trick the conventional monitoring systems when being posted on social media. The proposed method progresses in a series of phases that can be broken down into a pre-processing phase, which includes filtering, tokenization, and stemming; offensive word extraction phase, which relies on using the soundex algorithm and permuterm index; and a post-processing phase that classifies the users’ posts in order to highlight the offensive content. Accordingly, the method detects the masked offensive words in the written text, thus forbidding certain types of offensive words from being published. Results of evaluation of performance of the proposed method indicate a 99% accuracy of detection of offensive words

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science

Matching health information seekers' queries to medical terms

Author: A Gaudinat
A Keselman
A Mykowiecka
A Stanier
AT McCray
C Boyer
C Grouin
C Senger
E Brill
Elise Prieur-Gaston
F Abad Garcia
F Brouard
G Stoilos
J Crowell
JW Wilbur
K Kuckich
L Peters
L Yujian
LF Soualmia
Lina F Soualmia
LJ Peterson
M Douyère
M Kernigham
P Ruch
SJ Grannis
SJ Nelson
SM Meystre
Stéfan J Darmoni
T Koch
T Yarkoni
Thierry Lecroq
VI Levenshtein
VJ Hodge
W Winkler
Zied Moalla
Ö Uzuner
Ö Uzuner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref