Search CORE

16,061 research outputs found

Unconstrained Scene Text and Video Text Recognition for Arabic Script

Author: Jain Mohit
Jawahar C. V.
Mathew Minesh
Publication venue
Publication date: 07/11/2017
Field of study

Building robust recognizers for Arabic has always been challenging. We demonstrate the effectiveness of an end-to-end trainable CNN-RNN hybrid architecture in recognizing Arabic text in videos and natural scenes. We outperform previous state-of-the-art on two publicly available video text datasets - ALIF and ACTIV. For the scene text recognition task, we introduce a new Arabic scene text dataset and establish baseline results. For scripts like Arabic, a major challenge in developing robust recognizers is the lack of large quantity of annotated data. We overcome this by synthesising millions of Arabic text images from a large vocabulary of Arabic words and phrases. Our implementation is built on top of the model introduced here [37] which is proven quite effective for English scene text recognition. The model follows a segmentation-free, sequence to sequence transcription approach. The network transcribes a sequence of convolutional features from the input image to a sequence of target labels. This does away with the need for segmenting input image into constituent characters/glyphs, which is often difficult for Arabic script. Further, the ability of RNNs to model contextual dependencies yields superior recognition results.Comment: 5 page

arXiv.org e-Print Archive

Crossref

Print-Scan Resilient Text Image Watermarking Based on Stroke Direction Modulation for Chinese Document Authentication

Author: Sun G.
Sun X.
Tan L.
Publication venue: Společnost pro radioelektronické inženýrství
Publication date: 01/04/2012
Field of study

Print-scan resilient watermarking has emerged as an attractive way for document security. This paper proposes an stroke direction modulation technique for watermarking in Chinese text images. The watermark produced by the idea offers robustness to print-photocopy-scan, yet provides relatively high embedding capacity without losing the transparency. During the embedding phase, the angle of rotatable strokes are quantized to embed the bits. This requires several stages of preprocessing, including stroke generation, junction searching, rotatable stroke decision and character partition. Moreover, shuffling is applied to equalize the uneven embedding capacity. For the data detection, denoising and deskewing mechanisms are used to compensate for the distortions induced by hardcopy. Experimental results show that our technique attains high detection accuracy against distortions resulting from print-scan operations, good quality photocopies and benign attacks in accord with the future goal of soft authentication

Directory of Open Access Journals

Digital library of Brno University of Technology

Deep Adaptive Learning for Writer Identification based on Single Handwritten Word Images

Author: He Sheng
Schomaker Lambert
Publication venue: 'Elsevier BV'
Publication date: 28/09/2018
Field of study

There are two types of information in each handwritten word image: explicit information which can be easily read or derived directly, such as lexical content or word length, and implicit attributes such as the author's identity. Whether features learned by a neural network for one task can be used for another task remains an open question. In this paper, we present a deep adaptive learning method for writer identification based on single-word images using multi-task learning. An auxiliary task is added to the training process to enforce the emergence of reusable features. Our proposed method transfers the benefits of the learned features of a convolutional neural network from an auxiliary task such as explicit content recognition to the main task of writer identification in a single procedure. Specifically, we propose a new adaptive convolutional layer to exploit the learned deep features. A multi-task neural network with one or several adaptive convolutional layers is trained end-to-end, to exploit robust generic features for a specific main task, i.e., writer identification. Three auxiliary tasks, corresponding to three explicit attributes of handwritten word images (lexical content, word length and character attributes), are evaluated. Experimental results on two benchmark datasets show that the proposed deep adaptive learning method can improve the performance of writer identification based on single-word images, compared to non-adaptive and simple linear-adaptive approaches.Comment: Under view of Pattern Recognitio

arXiv.org e-Print Archive

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

A Bottom Up Procedure for Text Line Segmentation of Latin Script

Author: Jain Himanshu
Kumar Archana Praveen
Publication venue
Publication date: 09/10/2017
Field of study

In this paper we present a bottom up procedure for segmentation of text lines written or printed in the Latin script. The proposed method uses a combination of image morphology, feature extraction and Gaussian mixture model to perform this task. The experimental results show the validity of the procedure.Comment: Accepted and presented at the IEEE conference "International Conference on Advances in Computing, Communications and Informatics (ICACCI) 2017

arXiv.org e-Print Archive

Crossref

From Physics Model to Results: An Optimizing Framework for Cross-Architecture Code Generation

Author: Blazewicz Marek
Brandt Steven R.
Ciznicki Milosz
Hinder Ian
Kierzynka Michal
Koppelman David M.
Löffler Frank
Schnetter Erik
Tao Jian
Publication venue: 'IOS Press'
Publication date: 01/01/2013
Field of study

Starting from a high-level problem description in terms of partial differential equations using abstract tensor notation, the Chemora framework discretizes, optimizes, and generates complete high performance codes for a wide range of compute architectures. Chemora extends the capabilities of Cactus, facilitating the usage of large-scale CPU/GPU systems in an efficient manner for complex applications, without low-level code tuning. Chemora achieves parallelism through MPI and multi-threading, combining OpenMP and CUDA. Optimizations include high-level code transformations, efficient loop traversal strategies, dynamically selected data and instruction cache usage strategies, and JIT compilation of GPU code tailored to the problem characteristics. The discretization is based on higher-order finite differences on multi-block domains. Chemora's capabilities are demonstrated by simulations of black hole collisions. This problem provides an acid test of the framework, as the Einstein equations contain hundreds of variables and thousands of terms.Comment: 18 pages, 4 figures, accepted for publication in Scientific Programmin

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

Louisiana State University

MPG.PuRe

Cytoplasmic p53 couples oncogene-driven glucose metabolism to apoptosis and is a therapeutic target in glioblastoma.

Author: A Magi
AM Spence
Anthony Letai
AV Follis
AV Vaseva
BA Tannous
BJ Altman
Brian Higgins
C Tovar
CW Brennan
D Jiang
D Nathanson
DA Nathanson
DA Reardon
David A Nathanson
DR Green
E Cerami
E Strom
E Tasdemir
EH Cheng
EQ Lee
G Lessene
H Dai
H Takanaga
Harley I Kornblum
I Babic
I Vivanco
J Deng
J Gao
J Lee
J Lehár
J Montero
J Pomerantz
Jason T Lee
JC Liu
JE Chipuk
JE Chipuk
JE Chipuk
JIJ Leu
Jonathan E Tsang
JP Kruse
K Masui
L Qu
Laura Gosa
Lisa Ta
M Mihara
M-K Han
MG Vander Heiden
Mitra Dehghan Harati
MJ Lee
Nicholas A Bayley
OD Maddocks
P Nagesh Rao
Paul S Mischel
Peter M Clark
PM Clark
PY Wen
Q Ding
R Haq
RGW Verhaak
RJ DeBerardinis
Steven J Bensinger
TF Cloughesy
Timothy F Cloughesy
Veerle W Daniels
W Blake Gilmore
W Wei
WH Yang
William H Yong
Wilson X Mai
Y Zhang
Y Zhao
Publication venue: eScholarship, University of California
Publication date: 01/11/2017
Field of study

Cross-talk among oncogenic signaling and metabolic pathways may create opportunities for new therapeutic strategies in cancer. Here we show that although acute inhibition of EGFR-driven glucose metabolism induces only minimal cell death, it lowers the apoptotic threshold in a subset of patient-derived glioblastoma (GBM) cells. Mechanistic studies revealed that after attenuated glucose consumption, Bcl-xL blocks cytoplasmic p53 from triggering intrinsic apoptosis. Consequently, targeting of EGFR-driven glucose metabolism in combination with pharmacological stabilization of p53 with the brain-penetrant small molecule idasanutlin resulted in synthetic lethality in orthotopic glioblastoma xenograft models. Notably, neither the degree of EGFR-signaling inhibition nor genetic analysis of EGFR was sufficient to predict sensitivity to this therapeutic combination. However, detection of rapid inhibitory effects on [18F]fluorodeoxyglucose uptake, assessed through noninvasive positron emission tomography, was an effective predictive biomarker of response in vivo. Together, these studies identify a crucial link among oncogene signaling, glucose metabolism, and cytoplasmic p53, which may potentially be exploited for combination therapy in GBM and possibly other malignancies

Crossref

eScholarship - University of California