55 research outputs found
Statistical Features for Image Retrieval: A Quantitative Comparison
In this paper we present a comparison between various statistical descriptors and analyze their goodness in
classifying textural images. The chosen statistical descriptors have been proposed by Tamura, Battiato and
Haralick. In this work we also test a combination of the three descriptors for texture analysis. The databases
used in our study are the well-known Brodatz’s album and DDSM(Heath et al., 1998). The computed features
are classified using the Naive Bayes, the RBF, the KNN, the Random Forest and Random Tree models. The
results obtained from this study show that we can achieve a high classification accuracy if the descriptors are
used all together
Shape matching by curve modelling and alignment
Automatic information retrieval in the eld of shape recognition has been widely covered by many
research elds. Various techniques have been developed using different approaches such as intensity-based, modelbased
and shape-based methods. Whichever is the way to represent the objects in images, a recognition method
should be robust in the presence of scale change, translation and rotation. In this paper we present a new recognition
method based on a curve alignment technique, for planar image contours. The method consists of various phases
including extracting outlines of images, detecting signicant points and aligning curves. The dominant points can
be manually or automatically detected. The matching phase uses the idea of calculating the overlapping indices
between shapes as similarity measures. To evaluate the effectiveness of the algorithm, two databases of 216 and
99 images have been used. A performance analysis and comparison is provided by precision-recall curves
Snarci at SemEval-2024 Task 4: Themis Model for Binary Classification of Memes
This paper introduces an approach developed for multimodal meme analysis, specifically targeting the identification of persuasion techniques embedded within memes. Our methodology integrates Large Language Models (LLMs) and contrastive learning image encoders to discern the presence of persuasive elements in memes across diverse platforms. By capitalizing on the contextual understanding facilitated by LLMs and the discriminative power of contrastive learning for image encoding, our framework provides a robust solution for detecting and classifying memes with persuasion techniques. The system was used in Task 4 of Semeval 2024, precisely for Substask 2b (binary classification of presence of persuasion techniques). It showed promising results overall, achieving a Macro-F1=0.7986 on the English test data (i.e., the language the system was trained on) and Macro-F1=0.66777/0.47917/0.5554, respectively, on the other three “surprise” languages proposed by the task organizers, i.e., Bulgarian, North Macedonian and Arabic. The paper provides an overview of the system, along with a discussion of the results obtained and its main limitations
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024
The 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 addresses maritime
computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface
Vehicles (USV). Three challenges categories are considered: (i) UAV-based
Maritime Object Tracking with Re-identification, (ii) USV-based Maritime
Obstacle Segmentation and Detection, (iii) USV-based Maritime Boat Tracking.
The USV-based Maritime Obstacle Segmentation and Detection features three
sub-challenges, including a new embedded challenge addressing efficicent
inference on real-world embedded devices. This report offers a comprehensive
overview of the findings from the challenges. We provide both statistical and
qualitative analyses, evaluating trends from over 195 submissions. All
datasets, evaluation code, and the leaderboard are available to the public at
https://macvi.org/workshop/macvi24.Comment: Part of 2nd Workshop on Maritime Computer Vision (MaCVi) 2024 IEEE
Xplore submission as part of WACV 202
1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results
The 1 Workshop on Maritime Computer Vision (MaCVi) 2023 focused
on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned
Surface Vehicle (USV), and organized several subchallenges in this domain: (i)
UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking,
(iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime
Obstacle Detection. The subchallenges were based on the SeaDronesSee and MODS
benchmarks. This report summarizes the main findings of the individual
subchallenges and introduces a new benchmark, called SeaDronesSee Object
Detection v2, which extends the previous benchmark by including more classes
and footage. We provide statistical and qualitative analyses, and assess trends
in the best-performing methodologies of over 130 submissions. The methods are
summarized in the appendix. The datasets, evaluation code and the leaderboard
are publicly available at https://seadronessee.cs.uni-tuebingen.de/macvi.Comment: MaCVi 2023 was part of WACV 2023. This report (38 pages) discusses
the competition as part of MaCV
Recognition of Shapes by Attributed Skeletal Graphs
In this paper, we propose a framework to address the problem of generic 2-D shape recognition. The aim is mainly on using the potential strength of skeleton of discrete objects in computer vision and pattern recognition where features of objects are needed for classification. We propose to represent the medial axis characteristic points as an attributed skeletal graph to model the shape. The information about the object shape and its topology is totally embedded in them and this allows the comparison of different objects by graph matching algorithms. The experimental results demonstrate the correctness in detecting its characteristic points and in computing a more regular and effective representation for a perceptual indexing. The matching process, based on a revised graduated assignment algorithm, has produced encouraging results, showing the potential of the developed method in a variety of computer vision and pattern recognition domains. The results demonstrate its robustness in the presence of scale, reflection and rotation transformations and prove the ability to handle noise and occlusions
Diagnostica per immagini: un modello di percezione
Dottorato di ricerca in matematica applicata ed informatica. 6. ciclo. Coordinatore L. M. RicciardiConsiglio Nazionale delle Ricerche - Biblioteca Centrale - P.le Aldo Moro, 7, Rome; Biblioteca Nazionale Centrale - P.za Cavalleggeri, 1, Florence / CNR - Consiglio Nazionale delle RichercheSIGLEITItal
- …