Search CORE

4,669 research outputs found

Searching Multimedia Data using MPEG-7 Descriptions in a Broadcast Terminal

Author: Lalmas Mounia
Mory Benoit
Moutogianni Katerina
Putz Wolfgang
Rölleke Thomas
Publication venue
Publication date: 30/12/2013
Field of study

Spoken content retrieval: A survey of techniques and technologies

Author: Ani Nenkova
C A. Nenkova
K. Mckeown
Kathleen Mckeown
Publication venue: 'Now Publishers'
Publication date: 01/01/2012
Field of study

Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings. This growth has motivated extensive research on techniques and technologies that facilitate reliable indexing and retrieval. Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR). SCR research initially investigated planned speech structured in document-like units, but has subsequently shifted focus to more informal spoken content produced spontaneously, outside of the studio and in conversational settings. This survey provides an overview of the field of SCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues. It is aimed at researchers with backgrounds in speech technology or IR who are seeking deeper insight on how these fields are integrated to support research and development, thus addressing the core challenges of SCR

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Augmenting conversations through context-aware multimedia retrieval based on speech recognition

Author: Balcısoy Selim
Balcisoy Selim
Cansoy Murat Celik
Cansoy Murat Çelik
Ercil Aytul
Erçil Aytül
Yuksel Kamer Ali
Yüksel Kamer Ali
Publication venue: Google
Publication date: 24/10/2011
Field of study

Future’s environments will be sensitive and responsive to the presence of people to support them carrying out their everyday life activities, tasks and rituals, in an easy and natural way. Such interactive spaces will use the information and communication technologies to bring the computation into the physical world, in order to enhance ordinary activities of their users. This paper describes a speech-based spoken multimedia retrieval system that can be used to present relevant video-podcast (vodcast) footage, in response to spontaneous speech and conversations during daily life activities. The proposed system allows users to search the spoken content of multimedia files rather than their associated meta-information and let them navigate to the right portion where queried words are spoken by facilitating within-medium searches of multimedia content through a bag-of-words approach. Finally, we have studied the proposed system on different scenarios by using vodcasts in English from various categories, as the targeted multimedia, and discussed how it would enhance people’s everyday life activities by different scenarios including education, entertainment, marketing, news and workplace

Sabanci University Research Database

An architecture for life-long user modelling

Author: Elliott D.
Hopfgartner F.
Jose J.M.
Leelanupab T.
Moshfeghi Y.
Publication venue
Publication date: 01/01/2009
Field of study

In this paper, we propose a united architecture for the creation of life-long user profiles. Our architecture combines different steps required for a user prole, including feature extraction and representation, reasoning, recommendation and presentation. We discuss various issues that arise in the context of life-long profiling

CiteSeerX

Enlighten