Search CORE

3,535 research outputs found

Rotation-invariant features for multi-oriented text detection in natural images.

Author: Bai Xiang
Liu Wenyu
Ma Yi
Tu Zhuowen
Yao Cong
Zhang Xin
Publication venue: eScholarship, University of California
Publication date: 01/01/2013
Field of study

Texts in natural scenes carry rich semantic information, which can be used to assist a wide range of applications, such as object recognition, image/video retrieval, mapping/navigation, and human computer interaction. However, most existing systems are designed to detect and recognize horizontal (or near-horizontal) texts. Due to the increasing popularity of mobile-computing devices and applications, detecting texts of varying orientations from natural images under less controlled conditions has become an important but challenging task. In this paper, we propose a new algorithm to detect texts of varying orientations. Our algorithm is based on a two-level classification scheme and two sets of features specially designed for capturing the intrinsic characteristics of texts. To better evaluate the proposed method and compare it with the competing algorithms, we generate a comprehensive dataset with various types of texts in diverse real-world scenes. We also propose a new evaluation protocol, which is more suitable for benchmarking algorithms for detecting texts in varying orientations. Experiments on benchmark datasets demonstrate that our system compares favorably with the state-of-the-art algorithms when handling horizontal texts and achieves significantly enhanced performance on variant texts in complex natural scenes

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Automatic Palaeographic Exploration of Genizah Manuscripts

Author: Choueka Yaakov
Dershowitz Nachum
German Tanya
Potikha Liza
Shweka Roni
Wolf Lior
Publication venue: Books on Demand (BoD)
Publication date: 01/01/2011
Field of study

The Cairo Genizah is a collection of hand-written documents containing approximately 350,000 fragments of mainly Jewish texts discovered in the late 19th century. The fragments are today spread out in some 75 libraries and private collections worldwide, but there is an ongoing effort to document and catalogue all extant fragments. Palaeographic information plays a key role in the study of the Genizah collection. Script style, and–more specifically–handwriting, can be used to identify fragments that might originate from the same original work. Such matched fragments, commonly referred to as “joins”, are currently identified manually by experts, and presumably only a small fraction of existing joins have been discovered to date. In this work, we show that automatic handwriting matching functions, obtained from non-specific features using a corpus of writing samples, can perform this task quite reliably. In addition, we explore the problem of grouping various Genizah documents by script style, without being provided any prior information about the relevant styles. The automatically obtained grouping agrees, for the most part, with the palaeographic taxonomy. In cases where the method fails, it is due to apparent similarities between related scripts

Kölner UniversitätsPublikationsServer

Predicting tropical forest stand structure parameters from Fourier transform of very high-resolution remotely sensed canopy images

Author: Couteron Pierre
Nicolini Eric,
Paget P.
Pélissier Raphaël
Publication venue: 'Wiley'
Publication date: 01/01/2005
Field of study

1. Predicting stand structure parameters for tropical forests from remotely sensed data has numerous important applications, such as estimating above-ground biomass and carbon stocks and providing spatial information for forest mapping and management planning, as well as detecting potential ecological determinants of plant species distributions. As an alternative to direct measurement of physical attributes of the vegetation and individual tree crown delineation, we present a powerful holistic approach using an index of canopy texture that can be extracted from either digitized air photographs or satellite images by means of two-dimensional spectral analysis by Fourier transform. 2. We defined an index of canopy texture from the ordination of the Fourier spectra computed for 3545 1-ha square images of an undisturbed tropical rain forest in French Guiana. This index expressed a gradient of coarseness vs. fineness resulting from the relative importance of small, medium and large spatial frequencies in the Fourier spectra. 3. Based on 12 1-ha control plots, the canopy texture index showed highly significant correlations with tree density (R2 = 0·80), diameter of the tree of mean basal area (R2 = 0·71), distribution of trees into d.b.h. classes (R2 = 0·64) and mean canopy height (R2 = 0·57), which allowed us to produce reasonable predictive maps of stand structure parameters from digital aerial photographs. 4. Synthesis and applications. Two-dimensional Fourier analysis is a powerful method for obtaining quantitative characterization of canopy texture, with good predictive ability on stand structure parameters. Forest departments should use routine forest inventory operations to set up and feed regional databases, featuring both tree diameter figures and digital canopy images, with the ultimate aims of calibrating robust regression relationships and deriving predictive maps of stand structure parameters over large areas of tropical forests. Such maps would be particularly useful for forest classification and to guide field assessment of tropical forest resources and biodiversity

HAL-IRD

HAL-CIRAD

Evaluation of a Change Detection Methodology by Means of Binary Thresholding Algorithms and Informational Fusion Processes

Author: Aach
Agueda Arquero
Alberga
Asner
Benediktsson
Bordley
Bruzzone
Bruzzone
Bujor
Champion
Chang
Chatelain
Clark
Estibaliz Martinez
Fawcett
Fung
Gonzalo Pajares
Haralick
Hermosilla
Inglada
Iñigo Molina
Javier Sanchez
Kapur
Kastner
Kayitakire
Khoshelham
Le Hegarat-Mascle
Li
Li
Locatelli
Lu
Luo
Melgani
Metternicht
Moser
Nacerdine
Otsu
Pajares
Peddle
Radke
Richards
Ridler
Rosin
Sahoo
Sezgin
Shackelford
Shanbhag
Solberg
Yen
Yonhong
Yuan
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/01/2012
Field of study

Landcover is subject to continuous changes on a wide variety of temporal and spatial scales. Those changes produce significant effects in human and natural activities. Maintaining an updated spatial database with the occurred changes allows a better monitoring of the Earth’s resources and management of the environment. Change detection (CD) techniques using images from different sensors, such as satellite imagery, aerial photographs, etc., have proven to be suitable and secure data sources from which updated information can be extracted efficiently, so that changes can also be inventoried and monitored. In this paper, a multisource CD methodology for multiresolution datasets is applied. First, different change indices are processed, then different thresholding algorithms for change/no_change are applied to these indices in order to better estimate the statistical parameters of these categories, finally the indices are integrated into a change detection multisource fusion process, which allows generating a single CD result from several combination of indices. This methodology has been applied to datasets with different spectral and spatial resolution properties. Then, the obtained results are evaluated by means of a quality control analysis, as well as with complementary graphical representations. The suggested methodology has also been proved efficiently for identifying the change detection index with the higher contribution

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

Archivo Digital UPM

The Effects of Character-Level Data Augmentation on Style-Based Dating of Historical Manuscripts

Author: Dhali Maruf
Koopmans Lisa
Schomaker Lambert
Publication venue: arXiv
Publication date: 15/12/2022
Field of study

Identifying the production dates of historical manuscripts is one of the main goals for paleographers when studying ancient documents. Automatized methods can provide paleographers with objective tools to estimate dates more accurately. Previously, statistical features have been used to date digitized historical manuscripts based on the hypothesis that handwriting styles change over periods. However, the sparse availability of such documents poses a challenge in obtaining robust systems. Hence, the research of this article explores the influence of data augmentation on the dating of historical manuscripts. Linear Support Vector Machines were trained with k-fold cross-validation on textural and grapheme-based features extracted from historical manuscripts of different collections, including the Medieval Paleographical Scale, early Aramaic manuscripts, and the Dead Sea Scrolls. Results show that training models with augmented data improve the performance of historical manuscripts datin g by 1% - 3% in cumulative scores. Additionally, this indicates further enhancement possibilities by considering models specific to the features and the documents’ script

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Text extraction in natural scenes using region-based method

Author: Huang Zhihu
Leng Jinsong
Publication venue: Edith Cowan University, Research Online, Perth, Western Australia
Publication date: 01/01/2014
Field of study

Text in images is a very important clue for image indexing and retrieving. Unfortunately, it is a challenging work to accurately and robustly extract text from a complex background image. In this paper, a novel region-based text extraction method is proposed. In doing so, the candidate text regions are detected by 8-connected objects detection algorithm based on the edge image. Then the non-text regions are filtered out using shape, texture and stroke width rules. Finally, the remaining regions are grouped into text lines. Since stroke width is the intrinsic and particular characteristics of the text, the accuracy of the non-text filter are notably promoted. The improved Stroke Width Transform in the paper is less computing complexities and more accurate. Experimental results on sample ICDAR competition Dataset and our dataset show that the proposed method has the best performance compared with other five methods

Research Online @ ECU