822 research outputs found
Human Shape-Motion Analysis In Athletics Videos for Coarse To Fine Action/Activity Recognition Using Transferable BeliefModel
We present an automatic human shape-motion analysis method based on a fusion architecture for human action and activity recognition in athletic videos. Robust shape and motion features are extracted from human detection and tracking. The features are combined within the Transferable Belief Model (TBM framework for two levels of recognition. The TBM-based modelling of the fusion process allows to take into account imprecision, uncertainty and conflict inherent to the features. First, in a coarse step, actions are roughly recognized. Then, in a fine step, an action sequence recognition method is used to discriminate activities. Belief on actions are made smooth by a Temporal Credal Filter and action sequences, i.e. activities, are recognized using a state machine, called belief scheduler, based on TBM. The belief scheduler is also exploited for feedback information extraction in order to improve tracking results. The system is tested on real videos of athletics meetings to recognize four types of actions (running, jumping, falling and standing) and four types of activities (high jump, pole vault, triple jump and long jump). Results on actions, activities and feedback demonstrate the relevance of the proposed features and as well the efficiency of the proposed recognition approach based on TBM
Quarterly Research Output Reports
These reports paper summarize research outputs published in each quarter by academic staff at the University of Lincoln. The lists include substantive research outputs first appearing "in published form" (or equivalent for non-textual outputs) during this period. The lists have been generated automatically from data stored in the Lincoln Repository (http://eprints.lincoln.ac.uk/). Tables summarize the volume of outputs recorded by School
A new track for technology: Can ICT take care for healthier lifestyles?
The paper takes a look on potential contribution of Information and Communication Technologies to abate public health challenges caused by demographics and lifestyle. From the current convergence of mhealth, and sport market products emerge targeting normal athletes to control their training in a quantified manner. The resulting feedback and transparency foster a healthier lifestyle. These products and services help overcome limitations to innovation typical to the health care market. The paper is based on research by the European Commission's Institute for Prospective Technological Studies on Integrated Personal Health/Care services. --eHealth,Integrated Personal Health/Care services,sport,training,lifestyle related disease,innovation
REPRESENTATION LEARNING FOR ACTION RECOGNITION
The objective of this research work is to develop discriminative representations for human
actions. The motivation stems from the fact that there are many issues encountered while
capturing actions in videos like intra-action variations (due to actors, viewpoints, and duration),
inter-action similarity, background motion, and occlusion of actors. Hence, obtaining
a representation which can address all the variations in the same action while maintaining
discrimination with other actions is a challenging task. In literature, actions have been represented
either using either low-level or high-level features. Low-level features describe
the motion and appearance in small spatio-temporal volumes extracted from a video. Due
to the limited space-time volume used for extracting low-level features, they are not able
to account for viewpoint and actor variations or variable length actions. On the other hand,
high-level features handle variations in actors, viewpoints, and duration but the resulting
representation is often high-dimensional which introduces the curse of dimensionality. In
this thesis, we propose new representations for describing actions by combining the advantages
of both low-level and high-level features. Specifically, we investigate various linear
and non-linear decomposition techniques to extract meaningful attributes in both high-level
and low-level features. In the first approach, the sparsity of high-level feature descriptors is leveraged to build
action-specific dictionaries. Each dictionary retains only the discriminative information
for a particular action and hence reduces inter-action similarity. Then, a sparsity-based
classification method is proposed to classify the low-rank representation of clips obtained
using these dictionaries. We show that this representation based on dictionary learning improves
the classification performance across actions. Also, a few of the actions consist of
rapid body deformations that hinder the extraction of local features from body movements.
Hence, we propose to use a dictionary which is trained on convolutional neural network
(CNN) features of the human body in various poses to reliably identify actors from the
background. Particularly, we demonstrate the efficacy of sparse representation in the identification
of the human body under rapid and substantial deformation.
In the first two approaches, sparsity-based representation is developed to improve discriminability
using class-specific dictionaries that utilize action labels. However, developing
an unsupervised representation of actions is more beneficial as it can be used to both
recognize similar actions and localize actions. We propose to exploit inter-action similarity
to train a universal attribute model (UAM) in order to learn action attributes (common and
distinct) implicitly across all the actions. Using maximum aposteriori (MAP) adaptation,
a high-dimensional super action-vector (SAV) for each clip is extracted. As this SAV contains
redundant attributes of all other actions, we use factor analysis to extract a novel lowvi
dimensional action-vector representation for each clip. Action-vectors are shown to suppress
background motion and highlight actions of interest in both trimmed and untrimmed
clips that contributes to action recognition without the help of any classifiers.
It is observed during our experiments that action-vector cannot effectively discriminate
between actions which are visually similar to each other. Hence, we subject action-vectors
to supervised linear embedding using linear discriminant analysis (LDA) and probabilistic
LDA (PLDA) to enforce discrimination. Particularly, we show that leveraging complimentary
information across action-vectors using different local features followed by discriminative
embedding provides the best classification performance. Further, we explore
non-linear embedding of action-vectors using Siamese networks especially for fine-grained
action recognition. A visualization of the hidden layer output in Siamese networks shows
its ability to effectively separate visually similar actions. This leads to better classification
performance than linear embedding on fine-grained action recognition.
All of the above approaches are presented on large unconstrained datasets with hundreds
of examples per action. However, actions in surveillance videos like snatch thefts are
difficult to model because of the diverse variety of scenarios in which they occur and very
few labeled examples. Hence, we propose to utilize the universal attribute model (UAM)
trained on large action datasets to represent such actions. Specifically, we show that there
are similarities between certain actions in the large datasets with snatch thefts which help
in extracting a representation for snatch thefts using the attributes from the UAM. This
representation is shown to be effective in distinguishing snatch thefts from regular actions
with high accuracy.In summary, this thesis proposes both supervised and unsupervised approaches for representing
actions which provide better discrimination than existing representations. The
first approach presents a dictionary learning based sparse representation for effective discrimination
of actions. Also, we propose a sparse representation for the human body based
on dictionaries in order to recognize actions with rapid body deformations. In the next
approach, a low-dimensional representation called action-vector for unsupervised action
recognition is presented. Further, linear and non-linear embedding of action-vectors is
proposed for addressing inter-action similarity and fine-grained action recognition, respectively.
Finally, we propose a representation for locating snatch thefts among thousands of
regular interactions in surveillance videos
Advanced photonic and electronic systems - WILGA 2017
WILGA annual symposium on advanced photonic and electronic systems has been organized by young scientist for young scientists since two decades. It traditionally gathers more than 350 young researchers and their tutors. Ph.D students and graduates present their recent achievements during well attended oral sessions. Wilga is a very good digest of Ph.D. works carried out at technical universities in electronics and photonics, as well as information sciences throughout Poland and some neighboring countries. Publishing patronage over Wilga keep Elektronika technical journal by SEP, IJET by PAN and Proceedings of SPIE. The latter world editorial series publishes annually more than 200 papers from Wilga. Wilga 2017 was the XL edition of this meeting. The following topical tracks were distinguished: photonics, electronics, information technologies and system research. The article is a digest of some chosen works presented during Wilga 2017 symposium. WILGA 2017 works were published in Proc. SPIE vol.10445
Modelos de aprendizaje automático en la detección e identificación de personas: una revisión de literatura
Introduction: This article is the result of research entitled "Development of a prototype to optimize access conditions to the SENA-Pescadero using artificial intelligence and open-source tools", developed at the Servicio Nacional de Aprendizaje in 2020.
Problem: How to identify Machine Learning Techniques applied to computer vision processes through a literature review?
Objective: Determine the application, as well as advantages and disadvantages of machine learning techniques focused on the detection and identification of people.
Methodology: Systematic literature review in 4 high-impact bibliographic and scientific databases, using search filters and information selection criteria.
Results: Machine Learning techniques defined as Principal Component Analysis, Weak Label Regularized Local Coordinate Coding, Support Vector Machines, Haar Cascade Classifiers and EigenFaces and FisherFaces, as well as their applicability in detection and identification processes.
Conclusion: The research led to the identification of the main computational intelligence techniques based on machine learning, applied to the detection and identification of people. Their influence was shown in several application cases, but most of them were focused on the implementation and optimization of access control systems, or tasks in which the identification of people was required for the execution of processes.
Originality: Through this research, we studied and defined the main machine learning techniques currently used for the detection and identification of people.
Limitations: The systematic review is limited to information available in the 4 databases consulted, and the amount of information is variable as articles are deposited in the databases.Introducción: Este artículo es el resultado de la investigación titulada " Desarrollo de un prototipo para optimizar las condiciones de acceso al SENA-Pescadero utilizando inteligencia artificial y herramientas de código abierto", desarrollada en el Servicio Nacional de Aprendizaje en 2020.
Problema: ¿Cómo identificar las técnicas de aprendizaje automático aplicadas a los procesos de visión por computador a través de una revisión bibliográfica?
Objetivo: Determinar la aplicación, así como las ventajas y desventajas de las técnicas de aprendizaje automático enfocadas a la detección e identificación de personas.
Metodología: Revisión sistemática de la literatura en 4 bases de datos bibliográficas y científicas de alto impacto, utilizando filtros de búsqueda y criterios de selección de información.
Resultados: Técnicas de aprendizaje automático definidas como Análisis de Componentes Principales, Codificación Local de Coordenadas Regularizada de Etiquetas Débiles, Máquinas de Vectores de Soporte, Clasificadores en Cascada de Haar y EigenFaces y FisherFaces, así como su aplicabilidad en procesos de detección e identificación.
Conclusiones: La investigación permitió identificar las principales técnicas de inteligencia computacional basadas en machine learning aplicadas a la detección e identificación de personas. Su influencia se mostró en varios casos de aplicación, pero la mayoría de ellos se centraron en la implementación y optimización de sistemas de control de acceso, o tareas en las que se requería la identificación de personas para la ejecución de procesos
Originalidad: A través de esta investigación se estudiaron y definieron las principales técnicas de machine learning utilizadas actualmente para la detección e identificación de personas
Eye quietness and quiet eye in expert and novice golf performance: an electrooculographic analysis
Quiet eye (QE) is the final ocular fixation on the target of an action (e.g., the ball in golf putting). Camerabased eye-tracking studies have consistently found longer QE durations in experts than novices; however, mechanisms underlying QE are not known. To offer a new perspective we examined the feasibility of measuring the QE using electrooculography (EOG) and developed an index to assess ocular activity across time: eye quietness (EQ). Ten expert and ten novice golfers putted 60 balls to a 2.4 m distant hole. Horizontal EOG (2ms resolution) was recorded from two electrodes placed on the outer sides of the eyes. QE duration was measured using a EOG voltage threshold and comprised the sum of the pre-movement and post-movement initiation components. EQ was computed as the standard deviation of the EOG in 0.5 s bins from –4 to +2 s, relative to backswing initiation: lower values indicate less movement of the eyes, hence greater quietness. Finally, we measured club-ball address and swing durations. T-tests showed that total QE did not differ between groups (p = .31); however, experts had marginally shorter pre-movement QE (p = .08) and longer post-movement QE (p < .001) than novices. A group × time ANOVA revealed that experts had less EQ before
backswing initiation and greater EQ after backswing initiation (p = .002). QE durations were inversely correlated with EQ from –1.5 to 1 s (rs = –.48 - –.90, ps = .03 - .001). Experts had longer swing durations than novices (p = .01) and, importantly, swing durations correlated positively with post-movement QE (r = .52, p = .02) and negatively with EQ from 0.5 to 1s (r = –.63, p = .003). This study demonstrates the feasibility of measuring ocular activity using EOG and validates EQ as an index of ocular activity. Its findings challenge the dominant perspective on QE and provide new evidence that expert-novice differences in ocular activity may reflect differences in the kinematics of how experts and novices execute skills
The role of the biomechanics analyst in swimming training and competition analysis
Swimming analysts aid coaches and athletes in the decision-making by providing evidence-based recommendations. The aim of this narrative review was to report the best practices of swimming analysts that have been supporting high-performance athletes. It also aims to share how swimming analysts can translate applied research into practice. The role of the swimming analyst, as part of a holistic team supporting high-performance athletes, has been expanding and is needed to be distinguished from the job scope of a swimming researcher. As testing can be time-consuming, analysts must decide what to test and when to conduct the evaluation sessions. Swimming analysts engage in the modelling and forecast of the performance, that in short- and mid-term can help set races target-times, and in the long-term provide insights on talent and career development. Races can be analysed by manual, semi-automatic or fully automatic video analysis with single or multi-cameras set-ups. The qualitative and quantitative analyses of the swim strokes, start, turns, and finish are also part of the analyst job scope and associated with race performance goals. Land-based training is another task that can be assigned to analysts and aims to enhance the performance, prevent musculoskeletal injuries and monitor its risk factors.This project was funded by the FCT - Portuguese Foundation for Science and Technology (UIDB/DTP/04045/2020).info:eu-repo/semantics/publishedVersio
- …