Search CORE

39 research outputs found

GEMINI: A Generic Multi-Modal Natural Interface Framework for Videogames

Author: G. Saon
H. Sakoe
J. Lockman
L.A. Schwarz
M. Arantes
P.Y. Shih
T. Yamada
Publication venue
Publication date: 01/01/2013
Field of study

In recent years videogame companies have recognized the role of player engagement as a major factor in user experience and enjoyment. This encouraged a greater investment in new types of game controllers such as the WiiMote, Rock Band instruments and the Kinect. However, the native software of these controllers was not originally designed to be used in other game applications. This work addresses this issue by building a middleware framework, which maps body poses or voice commands to actions in any game. This not only warrants a more natural and customized user-experience but it also defines an interoperable virtual controller. In this version of the framework, body poses and voice commands are respectively recognized through the Kinect's built-in cameras and microphones. The acquired data is then translated into the native interaction scheme in real time using a lightweight method based on spatial restrictions. The system is also prepared to use Nintendo's Wiimote as an auxiliary and unobtrusive gamepad for physically or verbally impractical commands. System validation was performed by analyzing the performance of certain tasks and examining user reports. Both confirmed this approach as a practical and alluring alternative to the game's native interaction scheme. In sum, this framework provides a game-controlling tool that is totally customizable and very flexible, thus expanding the market of game consumers.Comment: WorldCIST'13 Internacional Conferenc

arXiv.org e-Print Archive

Crossref

Use of Markov processes in writing recognition

Author: BELAÏD (A.)
SAON (G.)
Publication venue: GRETSI, Saint Martin d'Hères, France
Publication date: 01/01/1997
Field of study

In this paper, we present a brief survey on the use of different types of Markov models in writing recognition . Recognition is done by a posteriori pattern class probability calculus . This computation implies several terms which, according to the dependency hypotheses akin to the considered application, can be decomposed in elementary conditional probabilities . Under the assumption that the pattern may be modeled as a uni- or two-dimensional stochastic process (random field) presenting Markovian properties, local maximisations of these probabilities result in maximum pattern likelihood . We have studied throughout the article several cases of subpattern probability conditioning. Each case is accompanied by practical illustrations related to the field of writing recognition .Dans cet article, nous présentons une étude sur l'emploi de différents types de modèles de Markov en reconnaissance de l'écriture. La reconnaissance est obtenue par calcul de la probabilité a posteriori de la classe d'une forme. Ce calcul fait intervenir plusieurs termes qui, suivant certaines hypothèses de dépendance liées à l'application traitée, peuvent se décomposer en probabilités conditionnelles élémentaires. Si l'on suppose que la forme suit un processus stochastique uni- ou bidimensionnel qui de plus vérifie les propriétés de Markov, alors la maximisation locale de ces probabilités permet l'atteinte d'un maximum de la vraisemblance de la forme. Nous avons étudié plusieurs cas de conditionnement des probabilités élémentaires des sous-formes. Chaque étude est accompagnée d'illustrations pratiques relatives au domaine de la reconnaissance de l'écriture imprimée et/ou manuscrite

I-Revues

Recommended from our members

Deep Learning for Single-Molecule Science

Author: Bengio Y
Bishop C M
Chang S
Chang S
Coates A
Deng L
Eduardo Alonso
Glorot X
Goodfellow I
Gregory Slabaugh
Hebb D O
Hinton G E
Hinton G E
Minsky M
Mitchell T
Nair V
Saon G
Schwenk H
Simonyan K
Simonyan K
SM Masudur R Al-Arif
Tim Albrecht
Werbos P
Widrow B
Yosinski J
Zeiler M D
Publication venue: 'IOP Publishing'
Publication date: 18/09/2017
Field of study

Exploring and making predictions based on single-molecule data can be challenging, not only due to the sheer size of the datasets, but also because a priori knowledge about the signal characteristics is typically limited and poor signal-to-noise ratio. For example, hypothesis-driven data exploration, informed by an expectation of the signal characteristics, can lead to interpretation bias or loss of information. Equally, even when the different data categories are known, e.g., the four bases in DNA sequencing, it is often difficult to know how to make best use of the available information content. The latest developments in Machine Learning (ML), so-called Deep Learning (DL) offers an interesting, new avenues to address such challenges. In some applications, such as speech and image recognition, DL has been able to outperform conventional Machine Learning strategies and even human performance. However, to date DL has not been applied much in single-molecule science, presumably in part because relatively little is known about the 'internal workings' of such DL tools within single-molecule science as a field. In this Tutorial, we make an attempt to illustrate in a step-by-step guide how one of those, a Convolutional Neural Network, may be used for base calling in DNA sequencing applications. We compare it with a Support Vector Machine as a more conventional ML method, and and discuss some of the strengths and weaknesses of the approach. In particular, a 'deep' neural network has many features of a 'black box', which has important implications on how we look at and interpret data

City Research Online

Crossref

University of Birmingham Research Portal

Critical weather limits for paddy rice under diverse ecosystems of India

Author: Abburi Venkata M. Subba Rao
Abdus Sattar
Chandra B. Singh
Jawahar L. Choudhary
Malamal Alickal Sarath Chandran
Narayanan Manikandan
Nidhi
Sandeep S. Sandhu
Santanu Kumar Bal
Saon Banerjee
Vijay G. More
Vinod Kumar Singh
Publication venue: Frontiers Media S.A.
Publication date: 01/08/2023
Field of study

Rice yields are largely influenced by variability in weather. Here, we demonstrate the effect of weather variables viz., maximum and minimum temperatures, rainfall, morning and evening relative humidity, bright sunshine hours on the yield of rice cv. Swarna, grown across five rice ecologies of India through field experiments during kharif (wet) season (Jun-Sept.). Critical thresholds of weather elements were identified for achieving above average, average and below average yield for each ecology. The investigation could determine how different weather elements individually and collectively affect rice yield in different rice ecosystems of India. While a sudden increase in minimum temperature by 8-10 °C (> 30 °C) during reproductive period resulted in 40-50 per cent yield reduction at Mohanpur, a sudden decrease (< 20 °C) caused yield decline at Dapoli. The higher yields may be attributed to a significant difference in bright sunshine hours between reproductive phases of above-average and below-average yield years (ranging from 2.8 to 7.8 hours during P5 stages and 1.7 to 5.1 during P4 stages). Rice cultivar Swarna performed differently at various sowing dates in a location as well as across locations (6650 kg ha-1 at Dapoli to 1101 kg ha-1 at Samastipur). It was also found that across all locations, the above average yield could be associated with higher range of maximum temperature compared to that of below average yield. Principal component analysis explained 77 per cent of cumulative variance among the variables at first growth stage, whereas 70 per cent at second growth stage followed by 74 per cent and 66 per cent at subsequent growth stages. We found that coastal locations, in contrast to inland ones, could maximize the yield potential of the cultivar Swarna, due to the longer duration of days between panicle initiation to physiological maturity. We anticipate that the location-specific thresholds of weather factors will encourage rice production techniques that are climate resilient

Directory of Open Access Journals

Bayesian group sparse learning for music source separation

Author: A Cichocki
A Lefevre
C Fevotte
CM Bishop
G Saon
H Lee
H Lee
H-L Hsieh
Hsin-Lung Hsieh
J Yoo
J-T Chien
J-T Chien
J-T Chien
J-T Chien
J-T Chien
Jen-Tzung Chien
M Kim
M Kim
M Marlin
M Zhong
MD Hoffman
MD Plumbley
ME Tipping
MN Schmidt
MN Schmidt
PJ Garrigues
PO Hoyer
R Jenatton
R Kompass
R Salakhutdinov
S Bengio
S Chib
S Moussaoui
SD Babacan
Z Duan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Automatic summarization of voicemail messages using lexical and prosodic features

Author: Chen F.
Cordoba R.
Garofolo J.
Gotoh Y.
Hakkani-Tür D.
Hirschberg J.
Hirschberg J.
Hori C.
Huang J.
Jansche M.
Kato Y.
Konstantinos Koumpis
Koumpis K.
Koumpis K.
Koumpis K.
Koumpis K.
Koumpis K.
Kubala F.
Maclay H.
Makhoul J.
Medan Y.
Morgan N.
Morgan N.
Padmanabhan M.
Paksoy E.
Rohlicek J. R.
Saon G.
Scott M.
Shriberg E.
Steve Renals
Stevenson M.
Valenza R.
Walker M. A.
Warnke V.
Williams G.
Zechner K.
Zweig M. H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2005
Field of study

This article presents trainable methods for extracting principal content words from voicemail messages. The short text summaries generated are suitable for mobile messaging applications. The system uses a set of classifiers to identify the summary words with each word described by a vector of lexical and prosodic features. We use an ROC-based algorithm, Parcel, to select input features (and classifiers). We have performed a series of objective and subjective evaluations using unseen data from two different speech recognition systems as well as human transcriptions of voicemail speech

CiteSeerX

Crossref

Edinburgh Research Archive

Edinburgh Research Explorer

Off-line Handwritten Word Recognition Using a Mixed HMM-MRF Approach

Author: A. Belaïd
G. Saon
Publication venue
Publication date: 18/08/1997
Field of study

In this paper we present a two-dimensional stochastic method for the recognition of unconstrained handwritten words in a small lexicon. The method is based on an efficient combination of hidden Markov models (HMMs) and causal Markov random fields (MRFs). It operates in a holistic manner, at the pixel level, on scaled binary word images which are assumed to be random field realizations. The state-related random fields act as smooth local estimators of specific writing strokes by merging conditional pixel probabilities along the columns of the image. The HMM component of our model provides an optimal switching mechanism between sets of mrf distributions in order to dynamically adapt to the features encountered during the left-toright image scan. Experiments performed on a French omni-scriptor, omni-bank database of handwritten legal check amounts provided by the A2iA company are described in great extent

CiteSeerX

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Recognition of unconstrained handwritten words using Markov random fields and HMMs

Author: A. Belaïd
G. Saon
Publication venue
Publication date
Field of study

In this paper we present a system for recognition of handwritten words on literal check amounts which advantageously combines HMMs and Markov random fields (MRFs). It operates, in a holistic manner, at pixel level on height normalised word images which are viewed as random field realizations. The HMM analyses the image along the horizontal writing direction, in a specific state observation probability being given by the column product of causal MRF like pixel conditional probabilities. Aspects concerning definition, training and recognition via this type of model and experiments performed are developed throughout the paper

CiteSeerX

Stochastic Trajectory Modeling for Recognition of Unconstrained Handwritten Words

Author: A. Belaïd
G. Saon
Y. Gong
Publication venue
Publication date
Field of study

In this paper we describe an ooe-line handwritten word recognition (hwr) system applied to the identi- øcation of literal french check amounts. It consists of three successive levels denoted as character, word and phrase level, each of them being related to the previous ones via conditional probability distributions. Training is done on character samples extracted from amount images which are modeled as trajectories in some feature space. At word level, guided by a dictionary, an internal character segmentation algorithm is used in order to maximize a global word probability measure. A stochastic grammar for a priori grammar generation probability of a phrase is proposed at the last level. Results obtained on a 1779 amounts data base provided by the SRTP 1 are encouraging, showing our system open to further improvements. 1 Introduction Because of the large variety of handwriting styles, the recognition is very diOEcult. Dioeerent categories of styles (handprinted, pure cursive) may ..

CiteSeerX