68 research outputs found

    Caption-guided patent image segmentation

    Full text link

    VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval.

    Get PDF
    This paper presents VERGE interactive search engine, which is capable of browsing and searching into video content. The system integrates content-based analysis and retrieval modules such as video shot segmentation, concept detection, clustering, as well as visual similarity and object-based search

    COST292 experimental framework for TRECVID 2008

    Get PDF
    In this paper, we give an overview of the four tasks submitted to TRECVID 2008 by COST292. The high-level feature extraction framework comprises four systems. The first system transforms a set of low-level descriptors into the semantic space using Latent Semantic Analysis and utilises neural networks for feature detection. The second system uses a multi-modal classifier based on SVMs and several descriptors. The third system uses three image classifiers based on ant colony optimisation, particle swarm optimisation and a multi-objective learning algorithm. The fourth system uses a Gaussian model for singing detection and a person detection algorithm. The search task is based on an interactive retrieval application combining retrieval functionalities in various modalities with a user interface supporting automatic and interactive search over all queries submitted. The rushes task submission is based on a spectral clustering approach for removing similar scenes based on eigenvalues of frame similarity matrix and and a redundancy removal strategy which depends on semantic features extraction such as camera motion and faces. Finally, the submission to the copy detection task is conducted by two different systems. The first system consists of a video module and an audio module. The second system is based on mid-level features that are related to the temporal structure of videos

    The COST292 experimental framework for TRECVID 2007

    Get PDF
    In this paper, we give an overview of the four tasks submitted to TRECVID 2007 by COST292. In shot boundary (SB) detection task, four SB detectors have been developed and the results are merged using two merging algorithms. The framework developed for the high-level feature extraction task comprises four systems. The first system transforms a set of low-level descriptors into the semantic space using Latent Semantic Analysis and utilises neural networks for feature detection. The second system uses a Bayesian classifier trained with a “bag of subregions”. The third system uses a multi-modal classifier based on SVMs and several descriptors. The fourth system uses two image classifiers based on ant colony optimisation and particle swarm optimisation respectively. The system submitted to the search task is an interactive retrieval application combining retrieval functionalities in various modalities with a user interface supporting automatic and interactive search over all queries submitted. Finally, the rushes task submission is based on a video summarisation and browsing system comprising two different interest curve algorithms and three features

    The COST292 experimental framework for TRECVID 2007

    Get PDF
    In this paper, we give an overview of the four tasks submitted to TRECVID 2007 by COST292. In shot boundary (SB) detection task, four SB detectors have been developed and the results are merged using two merging algorithms. The framework developed for the high-level feature extraction task comprises four systems. The first system transforms a set of low-level descriptors into the semantic space using Latent Semantic Analysis and utilises neural networks for feature detection. The second system uses a Bayesian classifier trained with a "bag of subregions". The third system uses a multi-modal classifier based on SVMs and several descriptors. The fourth system uses two image classifiers based on ant colony optimisation and particle swarm optimisation respectively. The system submitted to the search task is an interactive retrieval application combining retrieval functionalities in various modalities with a user interface supporting automatic and interactive search over all queries submitted. Finally, the rushes task submission is based on a video summarisation and browsing system comprising two different interest curve algorithms and three features

    The development of a video retrieval system using a clinician-led approach

    Get PDF
    Patient video taken at home can provide valuable insights into the recovery progress during a programme of physical therapy, but is very time consuming for clinician review. Our work focussed on (i) enabling any patient to share information about progress at home, simply by sharing video and (ii) building intelligent systems to support Physical Therapists (PTs) in reviewing this video data and extracting the necessary detail. This paper reports the development of the system, appropriate for future clinical use without reliance on a technical team, and the clinician involvement in that development. We contribute an interactive content-based video retrieval system that significantly reduces the time taken for clinicians to review videos, using human head movement as an example. The system supports query-by-movement (clinicians move their own body to define search queries) and retrieves the essential fine-grained movements needed for clinical interpretation. This is done by comparing sequences of image-based pose estimates (here head rotations) through a distance metric (here Fréchet distance) and presenting a ranked list of similar movements to clinicians for review. In contrast to existing intelligent systems for retrospective review of human movement, the system supports a flexible analysis where clinicians can look for any movement that interests them. Evaluation by a group of PTs with expertise in training movement control showed that 96% of all relevant movements were identified with time savings of as much as 99.1% compared to reviewing target videos in full. The novelty of this contribution includes retrospective progress monitoring that preserves context through video, and content-based video retrieval that supports both fine-grained human actions and query-by-movement. Future research, including large clinician-led studies, will refine the technical aspects and explore the benefits in terms of patient outcomes, PT time, and financial savings over the course of a programme of therapy. It is anticipated that this clinician-led approach will mitigate the reported slow clinical uptake of technology with resulting patient benefit

    Awakening to languages in the training of the Greek teachers : towards a dynamic model of action-research

    No full text
    L'éducation interculturelle dans la formation des enseignants ne se limite pas à l'idée de la tolérance et de l'acceptation de l'autre. elle comporte trois principes cohérents : l'éveil et le renforcement de la réflexion critique chez l'enseignant, son intérêt et sa flexibilité à gérer des innovations dans l'éducation, la construction d'une conception plus globale et d'une gestion plus efficace de la complexité sociale et humaine. notre travail présente une recherche-action qui a duré deux ans et qui a visé à la formation (longue durée) des enseignants grecs. elle s'est basée sur l'hypothèse générale que l'éveil aux langues est susceptible de créer chez les enseignants des savoirs, des attitudes et des aptitudes qui leur permettent de mieux valoriser le capital linguistique et culturel de leurs élèves ainsi que de leur donner un ensemble de pratiques et une typologie de compétences qui leur permettraient de faire des choses avec les langues dans tous les domaines disciplinaires. de plus nous considérons qu'une formation de type recherche-action sur l'innovation éveil aux langues est susceptible d'aider les enseignants à mieux valoriser le capital linguistique et culturel de leurs élèves, à développer l'interculturalité sous ses divers aspects dans leurs pratiques éducatives ainsi que de mener à bien une éducation langagière en fonction des besoins et des capacités de petits locuteurs de langues variées et sur le plan d'une sensibilisation systématique aux compétences métalinguistiques, métacognitives et interculturelles.Intercultural Training in teacher education is not limited to the idea of tolerance and acceptance of others. It consists of three integrated principles: the awakening and strengthening of critical thinking among the teacher, his interest in the implementation of educational innovation and the ability to build a more holistic view and more effective management of human and social complexity. Our work presents an action-research project that lasted two years and was aimed at training (long-term) of the Greek teachers. The final sample who participated in our research is 10 persons, all early childhood, primary and high school teachers who are working in multilingual classes. The training model called "Evolutionary training model" is based on the general assumption that the innovation of the Awakening to Languages, when en-golfed by teachers education, may create among teachers knowledge, attitudes and skills that enable them to make better use of the linguistic and cultural capital of their students and provide them a set of practices and a typology of skills that can facilitate them to work with languages throughout the curriculum. To test our hypothesis we chose a triangular approach. Research tools in part have been developed by us, in part from comparable research. These are two types of questionnaires, group interviews recorded and transcribed. In addition, we have based on our own observations as well as the experiment conducted by teachers in multilingual early childhood and primary school classes. In our participatory and action-oriented training, a second set of assumptions has emerged : our long group discussions, individual interviews, our observations have led us to ask whether a dynamic and systemic approach to the type of action-research training, as has been the training at the Awakening Languages, may create the necessary conditions, intra psychic and intra groupal so that the teachers develop a reflexive attitude towards their own, representations, manage their own social and professional problems in a dynamic way and stop feeling professional isolation. The main conclusion is that before talking about an effective intercultural education, we need to modify some elements in the socio-professional and personal identity of the teachers because the innovation of Awakening to Languages can help teachers realize their own representations of linguistic and cultural diversity in the classroorn, as well as their teaching practices and renegotiate with them

    L'éveil aux langues dans la formation des enseignant/es grec/ques : vers un modèle dynamique de formation-action

    No full text
    Intercultural Training in teacher education is not limited to the idea of tolerance and acceptance of others. It consists of three integrated principles: the awakening and strengthening of critical thinking among the teacher, his interest in the implementation of educational innovation and the ability to build a more holistic view and more effective management of human and social complexity. Our work presents an action-research project that lasted two years and was aimed at training (long-term) of the Greek teachers. The final sample who participated in our research is 10 persons, all early childhood, primary and high school teachers who are working in multilingual classes. The training model called "Evolutionary training model" is based on the general assumption that the innovation of the Awakening to Languages, when en-golfed by teachers education, may create among teachers knowledge, attitudes and skills that enable them to make better use of the linguistic and cultural capital of their students and provide them a set of practices and a typology of skills that can facilitate them to work with languages throughout the curriculum. To test our hypothesis we chose a triangular approach. Research tools in part have been developed by us, in part from comparable research. These are two types of questionnaires, group interviews recorded and transcribed. In addition, we have based on our own observations as well as the experiment conducted by teachers in multilingual early childhood and primary school classes. In our participatory and action-oriented training, a second set of assumptions has emerged : our long group discussions, individual interviews, our observations have led us to ask whether a dynamic and systemic approach to the type of action-research training, as has been the training at the Awakening Languages, may create the necessary conditions, intra psychic and intra groupal so that the teachers develop a reflexive attitude towards their own, representations, manage their own social and professional problems in a dynamic way and stop feeling professional isolation. The main conclusion is that before talking about an effective intercultural education, we need to modify some elements in the socio-professional and personal identity of the teachers because the innovation of Awakening to Languages can help teachers realize their own representations of linguistic and cultural diversity in the classroorn, as well as their teaching practices and renegotiate with them.L'éducation interculturelle dans la formation des enseignants ne se limite pas à l'idée de la tolérance et de l'acceptation de l'autre. elle comporte trois principes cohérents : l'éveil et le renforcement de la réflexion critique chez l'enseignant, son intérêt et sa flexibilité à gérer des innovations dans l'éducation, la construction d'une conception plus globale et d'une gestion plus efficace de la complexité sociale et humaine. notre travail présente une recherche-action qui a duré deux ans et qui a visé à la formation (longue durée) des enseignants grecs. elle s'est basée sur l'hypothèse générale que l'éveil aux langues est susceptible de créer chez les enseignants des savoirs, des attitudes et des aptitudes qui leur permettent de mieux valoriser le capital linguistique et culturel de leurs élèves ainsi que de leur donner un ensemble de pratiques et une typologie de compétences qui leur permettraient de faire des choses avec les langues dans tous les domaines disciplinaires. de plus nous considérons qu'une formation de type recherche-action sur l'innovation éveil aux langues est susceptible d'aider les enseignants à mieux valoriser le capital linguistique et culturel de leurs élèves, à développer l'interculturalité sous ses divers aspects dans leurs pratiques éducatives ainsi que de mener à bien une éducation langagière en fonction des besoins et des capacités de petits locuteurs de langues variées et sur le plan d'une sensibilisation systématique aux compétences métalinguistiques, métacognitives et interculturelles

    Fusion of Compound Queries with Multiple Modalities for Known Item Video Search

    No full text
    Multimedia collections are ubiquitous and very often contain hundreds of hours of video information. The retrieval of a particular scene of a video (Known Item Search) in a large collection is a difficult problem, considering the multimodal character of all video shots and the complexity of the query, either visual or textual. We tackle these challenges by fusing, first, multiple modalities in a nonlinear graph-based way for each subtopic of the query. In addition, we fuse the top retrieved video shots per sub-query to provide the final list of retrieved shots, which is then re-ranked using temporal information. The framework is evaluated in popular Known Item Search tasks in the context of video shot retrieval and provides the largest Mean Reciprocal Rank scores
    corecore