Search CORE

16 research outputs found

Stamp detection in scanned documents

Author: Forczmański Paweł
Publication venue: 'Uniwersytetu Marii Curie-Sklodowskiej w Lublinie'
Publication date: 01/01/2010
Field of study

The article presents current challenges in stamp detection problem. It is a crucial topic these days since more and more traditional paper documents are being scanned in order to be archived, sent through the net or just printed. Moreover, an electronic version of paper document stored on a hard drive can be taken as forensic evidence of possible crime. The main purpose of the method presented in the paper is to detect, localize and segment stamps (imprints) from the scanned document. The problem is not trivial since there is no such thing like stamp standard. There are many variations in size, shape, complexity and ink color. It should be remembered that the scanned document may be degraded in quality and the stamp can be placed on a relatively complicated background. The algorithm consists of several steps: color segmentation and pixel classification, regular shapes detection, candidates segmentation and verification. The paper includes also the initial results of selected experiments on real documents having different types of stamps

Biblioteka Nauki - repozytorium artykuÅÃ³w

Crossref

University of Maria Curie-Skłodowska (UMCS): Scientific e-Journals / Uniwersytet Marii Curie-Skłodowskiej: e-czasopisma naukowe

A novel method for extracting and recognizing logos

Author: Asefnezhad Arash
Faez Karim
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 18/09/2012
Field of study

Nowadays, the high volume of archival documents has made it exigent to store documents in electronic databases. A text logo represents the ownership of the text, and different texts can be categorized by it; for this reason, different methods have been presented for extracting and recognizing logos. The methods presented earlier, suffer problems such as, error of logo detection and recognition and slow speed. The proposed method of this study is composed of three sections: In the first section, the exact position of the logo can be identified by the pyramidal tree structure and horizontal and vertical analysis, and in the second section, the logo can be extracted through the algorithm of the boundary extension of feature rectangles. In the third section, after normalizing the size of the logo and eliminating the skew angle, for feature extraction, we first blocked the region encompassing the logo, and then we extract a particular feature by the parameter of the center of gravity of connected component each block. Finally, we use the KNN classification for the recognition of the logo.DOI:http://dx.doi.org/10.11591/ijece.v2i5.129

IAES journal

Institute of Advanced Engineering and Science

A novel shape descriptor based on salient keypoints detection for binary image matching and retrieval

Author: Chatbri Houssem
Kameyama Keisuke
Kwan Paul
Little Suzanne
O'Connor Noel E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/04/2018
Field of study

We introduce a shape descriptor that extracts keypoints from binary images and automatically detects the salient ones among them. The proposed descriptor operates as follows: First, the contours of the image are detected and an image transformation is used to generate background information. Next, pixels of the transformed image that have specific characteristics in their local areas are used to extract keypoints. Afterwards, the most salient keypoints are automatically detected by filtering out redundant and sensitive ones. Finally, a feature vector is calculated for each keypoint by using the distribution of contour points in its local area. The proposed descriptor is evaluated using public datasets of silhouette images, handwritten math expressions, hand-drawn diagram sketches, and noisy scanned logos. Experimental results show that the proposed descriptor compares strongly against state of the art methods, and that it is reliable when applied on challenging images such as fluctuated handwriting and noisy scanned images. Furthermore, we integrate our descripto

Research UNE

Crossref

DCU Online Research Access Service

Deep Learning for Logo Detection: A Survey

Author: Hou Qiang
Hou Sujuan
Jiang Shuqiang
Li Jiacheng
Min Weiqing
Zhao Yanna
Zheng Yuanjie
Publication venue
Publication date: 09/10/2022
Field of study

When logos are increasingly created, logo detection has gradually become a research hotspot across many domains and tasks. Recent advances in this area are dominated by deep learning-based solutions, where many datasets, learning strategies, network architectures, etc. have been employed. This paper reviews the advance in applying deep learning techniques to logo detection. Firstly, we discuss a comprehensive account of public datasets designed to facilitate performance evaluation of logo detection algorithms, which tend to be more diverse, more challenging, and more reflective of real life. Next, we perform an in-depth analysis of the existing logo detection strategies and the strengths and weaknesses of each learning strategy. Subsequently, we summarize the applications of logo detection in various fields, from intelligent transportation and brand monitoring to copyright and trademark compliance. Finally, we analyze the potential challenges and present the future directions for the development of logo detection to complete this survey

arXiv.org e-Print Archive

Use of language models for document stream segmentation

Author: Belaïd Abdel
Belaïd Yolande
Neche Chems,
Publication venue: HAL CCSD
Publication date: 01/01/2020
Field of study

International audiencePage stream segmentation into single documents is a very common task which is practiced in companies and administrations when processing their incoming mail. It is not a straightforward task because the limits of the documents are not always obvious, and it is not always easy to find common features between the pages of the same document. In this paper, we seek to compare existing segmentation models and propose a new segmentation one based on GRUs (Gated Recurrent Unit) and an attention mechanism, named AGRU. This model uses the text content of the previous page and the current page to determine if both pages belong to the same document. So, due to its attention mechanism, this model is capable to recognize words that define the first page of a document. Training and evaluation are carried out on two datasets: Tobacco-800 and READ-Corpus. The former is a public dataset on which our model reaches an F1 score equal to 90%, and the later is private for which our model reaches an F1 score equal to 96%

Crossref

INRIA a CCSD electronic archive server

スケッチ問い合わせを用いた文書画像内容検索

Author: Houssem Chatbri
Publication venue
Publication date: 01/01/2016
Field of study

この博士論文は、全文公表に適さないやむを得ない事由があり要約のみを公表していましたが、解消したため、令和２（2020）年4月20日に全文を公表しました。筑波大学 (University of Tsukuba)201

Tsukuba Repository

A Hierarchical Cluster Tree Approach Leveraging Delaunay Triangulation

Author: Costin-Anton Boiangiu
Cristian Avatavului
Publication venue: EduSoft publishing
Publication date: 01/10/2023
Field of study

This research introduces a robust and reliable technique for structuring document image pages hierarchically, harnessing the power of Delaunay triangulation. Central to our approach is the formation of a cluster tree, which encapsulates the page's content through strategically exploiting layout elements arrangements and their relative distances. By applying our technique, we proficiently categorize the page into distinct clusters encompassing images, titles, and paragraphs. The consequent hierarchical framework, founded on the cluster tree, establishes a durable and trustworthy blueprint of the document layout, thereby accelerating document comprehension and examination.</p

Directory of Open Access Journals