101,344 research outputs found
AON: Towards Arbitrarily-Oriented Text Recognition
Recognizing text from natural images is a hot research topic in computer
vision due to its various applications. Despite the enduring research of
several decades on optical character recognition (OCR), recognizing texts from
natural images is still a challenging task. This is because scene texts are
often in irregular (e.g. curved, arbitrarily-oriented or seriously distorted)
arrangements, which have not yet been well addressed in the literature.
Existing methods on text recognition mainly work with regular (horizontal and
frontal) texts and cannot be trivially generalized to handle irregular texts.
In this paper, we develop the arbitrary orientation network (AON) to directly
capture the deep features of irregular texts, which are combined into an
attention-based decoder to generate character sequence. The whole network can
be trained end-to-end by using only images and word-level annotations.
Extensive experiments on various benchmarks, including the CUTE80,
SVT-Perspective, IIIT5k, SVT and ICDAR datasets, show that the proposed
AON-based method achieves the-state-of-the-art performance in irregular
datasets, and is comparable to major existing methods in regular datasets.Comment: Accepted by CVPR201
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
In the past decade, Convolutional Neural Networks (CNNs) have demonstrated
state-of-the-art performance in various Artificial Intelligence tasks. To
accelerate the experimentation and development of CNNs, several software
frameworks have been released, primarily targeting power-hungry CPUs and GPUs.
In this context, reconfigurable hardware in the form of FPGAs constitutes a
potential alternative platform that can be integrated in the existing deep
learning ecosystem to provide a tunable balance between performance, power
consumption and programmability. In this paper, a survey of the existing
CNN-to-FPGA toolflows is presented, comprising a comparative study of their key
characteristics which include the supported applications, architectural
choices, design space exploration methods and achieved performance. Moreover,
major challenges and objectives introduced by the latest trends in CNN
algorithmic research are identified and presented. Finally, a uniform
evaluation methodology is proposed, aiming at the comprehensive, complete and
in-depth evaluation of CNN-to-FPGA toolflows.Comment: Accepted for publication at the ACM Computing Surveys (CSUR) journal,
201
Generating Innovations in Economic Variables
Stock prices should respond only to unpredictable components of economic news (âinnovationsâ) in efficient markets. While innovations used in empirical investigations of the economic underpinnings of stock market risk should at least satisfy this basic requirement this may not guarantee satisfactory research results. Three methods of generating innovations are evaluated for a variety of economic variables. First differencing produces unsatisfactory serially correlated innovations in general. Both ARIMA and Kalman Filter innovations are unpredictable, but in a further evaluation the component scores from Principal Components Analysis are regressed against economic innovations using PcGets. The results are far less noisy when Kalman Filter innovations are used.Macroeconomic variables, Innovations, stock returns, principal components analysis
DeepSoft: A vision for a deep model of software
Although software analytics has experienced rapid growth as a research area,
it has not yet reached its full potential for wide industrial adoption. Most of
the existing work in software analytics still relies heavily on costly manual
feature engineering processes, and they mainly address the traditional
classification problems, as opposed to predicting future events. We present a
vision for \emph{DeepSoft}, an \emph{end-to-end} generic framework for modeling
software and its development process to predict future risks and recommend
interventions. DeepSoft, partly inspired by human memory, is built upon the
powerful deep learning-based Long Short Term Memory architecture that is
capable of learning long-term temporal dependencies that occur in software
evolution. Such deep learned patterns of software can be used to address a
range of challenging problems such as code and task recommendation and
prediction. DeepSoft provides a new approach for research into modeling of
source code, risk prediction and mitigation, developer modeling, and
automatically generating code patches from bug reports.Comment: FSE 201
Voronoi-Like grid systems for tall buildings
In the context of innovative patterns for tall buildings, Voronoi tessellation is certainly worthy of interest. It is an irregular biomimetic pattern based on the Voronoi diagram, which derives from the direct observation of natural structures. The paper is mainly focused on the application of this nature-inspired typology to load-resisting systems for tall buildings, investigating the potential of non-regular grids on the global mechanical response of the structure. In particular, the study concentrates on the periodic and non-periodic Voronoi tessellation, describing the procedure for generating irregular patterns through parametric modeling and illustrates the homogenization-based approach proposed in the literature for dealing with unconventional patterns. To appreciate the consistency of preliminary design equations, numerical and analytical results are compared. Moreover, since the mechanical response of the building strongly depends on the parameters of the microstructure, the paper focuses on the influence of the grid arrangement on the global lateral stiffness, therefore on the displacement constraint, which is an essential requirement in the design of tall buildings. To this end, five case studies, accounting for different levels of irregularity and relative density, are generated and analyzed through static and modal analysis in the elastic field. In addition, the paper focuses on the mechanical response of a pattern with gradual rarefying density to evaluate its applicability to tall buildings. Displacement based optimizations are carried out to assess the adequate member cross sections that provide the maximum contribution in restraining deflection with the minimum material weight. The results obtained for all the models generated are compared and discussed to outline a final evaluation of the Voronoi structures. In addition to the wind loading scenario, the efficiency of the building model with varying density Voronoi pattern, is tested for seismic ground motion through a response spectrum analysis. The potential applications of Voronoi tessellation for tall buildings is demonstrated for both regions with high wind load conditions and areas of high seismicity
Mesh-based Autoencoders for Localized Deformation Component Analysis
Spatially localized deformation components are very useful for shape analysis
and synthesis in 3D geometry processing. Several methods have recently been
developed, with an aim to extract intuitive and interpretable deformation
components. However, these techniques suffer from fundamental limitations
especially for meshes with noise or large-scale deformations, and may not
always be able to identify important deformation components. In this paper we
propose a novel mesh-based autoencoder architecture that is able to cope with
meshes with irregular topology. We introduce sparse regularization in this
framework, which along with convolutional operations, helps localize
deformations. Our framework is capable of extracting localized deformation
components from mesh data sets with large-scale deformations and is robust to
noise. It also provides a nonlinear approach to reconstruction of meshes using
the extracted basis, which is more effective than the current linear
combination approach. Extensive experiments show that our method outperforms
state-of-the-art methods in both qualitative and quantitative evaluations
- âŠ