Search CORE

362 research outputs found

Extracting textual overlays from social media videos using neural networks

Author: Bielski Adam
Cyrta Paweł
Słucki Adam
Trzcinski Tomasz
Publication venue
Publication date: 01/05/2018
Field of study

Textual overlays are often used in social media videos as people who watch them without the sound would otherwise miss essential information conveyed in the audio stream. This is why extraction of those overlays can serve as an important meta-data source, e.g. for content classification or retrieval tasks. In this work, we present a robust method for extracting textual overlays from videos that builds up on multiple neural network architectures. The proposed solution relies on several processing steps: keyframe extraction, text detection and text recognition. The main component of our system, i.e. the text recognition module, is inspired by a convolutional recurrent neural network architecture and we improve its performance using synthetically generated dataset of over 600,000 images with text prepared by authors specifically for this task. We also develop a filtering method that reduces the amount of overlapping text phrases using Levenshtein distance and further boosts system's performance. The final accuracy of our solution reaches over 80A% and is au pair with state-of-the-art methods.Comment: International Conference on Computer Vision and Graphics (ICCVG) 201

arXiv.org e-Print Archive

Crossref

Recognition of Characters from Streaming Videos

Author: Aniruddha Sinha
Arpan Pal
Tanushyam Chattopadhyay
Publication venue: 'IntechOpen'
Publication date: 17/08/2010
Field of study

Non

IntechOpen

Segmenting characters from license plate images with little prior knowledge

Author: He X
Jia W
Wu Q
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2010
Field of study

In this paper, to enable a fast and robust system for automatically recognizing license plates with various appearances, new and simple but efficient algorithms are developed to segment characters from extracted license plate images. Our goal is to segment characters properly from a license plate image region. Different from existing methods for segmenting degraded machine-printed characters, our algorithms are based on very weak assumptions and use no prior knowledge about the format of the plates, in order for them to be applicable to wider applications. Experimental results demonstrate promising efficiency and flexibility of the proposed scheme. © 2010 IEEE

OPUS - University of Technology Sydney

Automatic fine-grained area detection for thin client systems

Author: De Turck Filip
Demeester Piet
Develder Chris
Dhoedt Bart
Simoens Pieter
Staelens Nicolas
Vankeirsbilck Bert
Verslype Dieter
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Ghent University Academic Bibliography

Archivsystem Ask23

Text Extraction in Video Images

Author: Yen Shwu-huey
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

[[abstract]]We propose a method to extract text information from video sequences. First the frequency of high horizontal energy in a video frame is examined to extract text blocks. Structural operations are then performed to remove the background so that the text can be extracted for later recognition. Experiments show that the method is efficient and effective for extracting text from various video documents.[[notice]]補正完畢[[conferencetype]]國際[[conferencedate]]20080714~20080717[[booktype]]紙本[[booktype]]電子版[[iscallforpapers]]Y[[conferencelocation]]Yokohama, Japa

Tamkang University Institutional Repository

InfraPhenoGrid: A scientific workflow infrastructure for Plant Phenomics on the Grid

Author: Artzet Simon
Chopard Jérôme
Cohen-Boulakia Sarah
Dupuis Dimitri
Fournier Christian
Mielewczik Michael
Negre Vincent
Neveu Pascal
Parigot Didier
Pradal Christophe
Valduriez Patrick
Publication venue: 'Elsevier BV'
Publication date: 01/06/2016
Field of study

International audiencePlant phenotyping consists in the observation of physical and biochemical traits of plant genotypes in response to environmental conditions. Challenges , in particular in context of climate change and food security, are numerous. High-throughput platforms have been introduced to observe the dynamic growth of a large number of plants in different environmental conditions. Instead of considering a few genotypes at a time (as it is the case when phenomic traits are measured manually), such platforms make it possible to use completely new kinds of approaches. However, the data sets produced by such widely instrumented platforms are huge, constantly augmenting and produced by increasingly complex experiments, reaching a point where distributed computation is mandatory to extract knowledge from data. In this paper, we introduce InfraPhenoGrid, the infrastructure we designed and deploy to efficiently manage data sets produced by the PhenoArch plant phenomics platform in the context of the French Phenome Project. Our solution consists in deploying scientific workflows on a Grid using a middle-ware to pilot workflow executions. Our approach is user-friendly in the sense that despite the intrinsic complexity of the infrastructure, running scientific workflows and understanding results obtained (using provenance information) is kept as simple as possible for end-users

HAL-CentraleSupelec

INRIA a CCSD electronic archive server