1,384 research outputs found
A survey on mouth modeling and analysis for Sign Language recognition
© 2015 IEEE.Around 70 million Deaf worldwide use Sign Languages (SLs) as their native languages. At the same time, they have limited reading/writing skills in the spoken language. This puts them at a severe disadvantage in many contexts, including education, work, usage of computers and the Internet. Automatic Sign Language Recognition (ASLR) can support the Deaf in many ways, e.g. by enabling the development of systems for Human-Computer Interaction in SL and translation between sign and spoken language. Research in ASLR usually revolves around automatic understanding of manual signs. Recently, ASLR research community has started to appreciate the importance of non-manuals, since they are related to the lexical meaning of a sign, the syntax and the prosody. Nonmanuals include body and head pose, movement of the eyebrows and the eyes, as well as blinks and squints. Arguably, the mouth is one of the most involved parts of the face in non-manuals. Mouth actions related to ASLR can be either mouthings, i.e. visual syllables with the mouth while signing, or non-verbal mouth gestures. Both are very important in ASLR. In this paper, we present the first survey on mouth non-manuals in ASLR. We start by showing why mouth motion is important in SL and the relevant techniques that exist within ASLR. Since limited research has been conducted regarding automatic analysis of mouth motion in the context of ALSR, we proceed by surveying relevant techniques from the areas of automatic mouth expression and visual speech recognition which can be applied to the task. Finally, we conclude by presenting the challenges and potentials of automatic analysis of mouth motion in the context of ASLR
Consideration of some factors affecting low-frequency fuselage noise transmission for propeller aircraft
Possible reasons for disagreement between measured and predicted trends of sidewall noise transmission at low frequency are investigated using simplified analysis methods. An analytical model combining incident plane acoustic waves with an infinite flat panel is used to study the effects of sound incidence angle, plate structural properties, frequency, absorption, and the difference between noise reduction and transmission loss. Analysis shows that these factors have significant effects on noise transmission but they do not account for the differences between measured and predicted trends at low frequencies. An analytical model combining an infinite flat plate with a normally incident acoustic wave having exponentially decaying magnitude along one coordinate is used to study the effect of a localized source distribution such as is associated with propeller noise. Results show that the localization brings the predicted low-frequency trend of noise transmission into better agreement with measured propeller results. This effect is independent of low-frequency stiffness effects that have been previously reported to be associated with boundary conditions
Acoustic fatigue: Overview of activities at NASA Langley
A number of aircraft and spacecraft configurations are being considered for future development. These include high-speed turboprop aircraft, advanced vertical take-off and landing fighter aircraft, and aerospace planes for hypersonic intercontinental cruise or flight to orbit and return. Review of the acoustic environment expected for these vehicles indicates levels high enough that acoustic fatigue must be considered. Unfortunately, the sonic fatique design technology used for current aircraft may not be adequate for these future vehicles. This has resulted in renewed emphasis on acoustic fatigue research at the NASA Langley Research Center. The overall objective of the Langley program is to develop methods and information for design of aerospace vehicles that will resist acoustic fatigue. The program includes definition of the acoustic loads acting on structures due to exhaust jets of boundary layers, and subsequent determination of the stresses within the structure due to these acoustic loads. Material fatigue associated with the high frequency structural stress reversal patterns resulting from acoustic loadings is considered to be an area requiring study, but no activity is currently underway
Face flow
In this paper, we propose a method for the robust and efficient computation of multi-frame optical flow in an expressive sequence of facial images. We formulate a novel energy minimisation problem for establishing dense correspondences between a neutral template and every frame of a sequence. We exploit the highly correlated nature of human expressions by representing dense facial motion using a deformation basis. Furthermore, we exploit the even higher correlation between deformations in a given input sequence by imposing a low-rank prior on the coefficients of the deformation basis, yielding temporally consistent optical flow. Our proposed model-based formulation, in conjunction with the inverse compositional strategy and low-rank matrix optimisation that we adopt, leads to a highly efficient algorithm for calculating facial flow. As experimental evaluation, we show quantitative experiments on a challenging novel benchmark of face sequences, with dense ground truth optical flow provided by motion capture data. We also provide qualitative results on a real sequence displaying fast motion and occlusions. Extensive quantitative and qualitative comparisons demonstrate that the proposed method outperforms state-of-the-art optical flow and dense non-rigid registration techniques, whilst running an order of magnitude faster
Structure tensor total variation
This is the final version of the article. Available from Society for Industrial and Applied Mathematics via the DOI in this record.We introduce a novel generic energy functional that we employ to solve inverse imaging problems
within a variational framework. The proposed regularization family, termed as structure tensor
total variation (STV), penalizes the eigenvalues of the structure tensor and is suitable for both
grayscale and vector-valued images. It generalizes several existing variational penalties, including
the total variation seminorm and vectorial extensions of it. Meanwhile, thanks to the structure
tensor’s ability to capture first-order information around a local neighborhood, the STV functionals
can provide more robust measures of image variation. Further, we prove that the STV regularizers
are convex while they also satisfy several invariance properties w.r.t. image transformations. These
properties qualify them as ideal candidates for imaging applications. In addition, for the discrete
version of the STV functionals we derive an equivalent definition that is based on the patch-based
Jacobian operator, a novel linear operator which extends the Jacobian matrix. This alternative
definition allow us to derive a dual problem formulation. The duality of the problem paves the
way for employing robust tools from convex optimization and enables us to design an efficient
and parallelizable optimization algorithm. Finally, we present extensive experiments on various
inverse imaging problems, where we compare our regularizers with other competing regularization
approaches. Our results are shown to be systematically superior, both quantitatively and visually
Towards longitudinal data analytics in Parkinson's Disease
The CloudUPDRS app has been developed as a Class I med- ical device to assess the severity of motor symptoms for Parkinson’s Disease using a fully automated data capture and signal analysis pro- cess based on the standard Unified Parkinson’s Disease Rating Scale. In this paper we report on the design and development of the signal pro- cessing and longitudinal data analytics microservices developed to carry out these assessments and to forecast the long-term development of the disease. We also report on early findings from the application of these techniques in the wild with a cohort of early adopters
A Two-Player Game of Life
We present a new extension of Conway's game of life for two players, which we
call p2life. P2life allows one of two types of token, black or white, to
inhabit a cell, and adds competitive elements into the birth and survival rules
of the original game. We solve the mean-field equation for p2life and determine
by simulation that the asymptotic density of p2life approaches 0.0362.Comment: 7 pages, 3 figure
A 3D morphable model learnt from 10,000 faces
This is the final version of the article. It is the open access version, provided by the Computer Vision Foundation. Except for the watermark, it is identical to the IEEE published version. Available from IEEE via the DOI in this record.We present Large Scale Facial Model (LSFM) - a 3D Morphable Model (3DMM) automatically constructed from 9,663 distinct facial identities. To the best of our knowledge LSFM is the largest-scale Morphable Model ever constructed, containing statistical information from a huge variety of the human population. To build such a large model we introduce a novel fully automated and robust Morphable Model construction pipeline. The dataset that LSFM is trained on includes rich demographic information about each subject, allowing for the construction of not only a global 3DMM but also models tailored for specific age, gender or ethnicity groups. As an application example, we utilise the proposed model to perform age classification from 3D shape alone. Furthermore, we perform a systematic analysis of the constructed 3DMMs that showcases their quality and descriptive power. The presented extensive qualitative and quantitative evaluations reveal that the proposed 3DMM achieves state-of-the-art results, outperforming existing models by a large margin. Finally, for the benefit of the research community, we make publicly available the source code of the proposed automatic 3DMM construction pipeline. In addition, the constructed global 3DMM and a variety of bespoke models tailored by age, gender and ethnicity are available on application to researchers involved in medically oriented research.J. Booth is funded by an EPSRC
DTA from Imperial College London, and holds a Qualcomm
Innovation Fellowship. A. Roussos is funded by
the Great Ormond Street Hospital Childrens Charity (Face
Value: W1037). The work of S. Zafeiriou was partially
funded by the EPSRC project EP/J017787/1 (4D-FAB)
Estimating Correspondences of Deformable Objects “In-the-wild”
During the past few years we have witnessed the development of many methodologies for building and fitting Statistical Deformable Models (SDMs). The construction of accurate SDMs requires careful annotation of images with regards to a consistent set of landmarks. However, the manual annotation of a large amount of images is a tedious, laborious and expensive procedure. Furthermore, for several deformable objects, e.g. human body, it is difficult to define a consistent set of landmarks, and, thus, it becomes impossible to train humans in order to accurately annotate a collection of images. Nevertheless, for the majority of objects, it is possible to extract the shape by object segmentation or even by shape drawing. In this paper, we show for the first time, to the best of our knowledge, that it is possible to construct SDMs by putting object shapes in dense correspondence. Such SDMs can be built with much less effort for a large battery of objects. Additionally, we show that, by sampling the dense model, a part-based SDM can be learned with its parts being in correspondence. We employ our framework to develop SDMs of human arms and legs, which can be used for the segmentation of the outline of the human body, as well as to provide better and more consistent annotations for body joints
- …