4,069 research outputs found
Exploiting multimedia in creating and analysing multimedia Web archives
The data contained on the web and the social web are inherently multimedia and consist of a mixture of textual, visual and audio modalities. Community memories embodied on the web and social web contain a rich mixture of data from these modalities. In many ways, the web is the greatest resource ever created by human-kind. However, due to the dynamic and distributed nature of the web, its content changes, appears and disappears on a daily basis. Web archiving provides a way of capturing snapshots of (parts of) the web for preservation and future analysis. This paper provides an overview of techniques we have developed within the context of the EU funded ARCOMEM (ARchiving COmmunity MEMories) project to allow multimedia web content to be leveraged during the archival process and for post-archival analysis. Through a set of use cases, we explore several practical applications of multimedia analytics within the realm of web archiving, web archive analysis and multimedia data on the web in general
Information scraps: how and why information eludes our personal information management tools
In this paper we describe information scraps -- a class of personal information whose content is scribbled on Post-it notes, scrawled on corners of random sheets of paper, buried inside the bodies of e-mail messages sent to ourselves, or typed haphazardly into text files. Information scraps hold our great ideas, sketches, notes, reminders, driving directions, and even our poetry. We define information scraps to be the body of personal information that is held outside of its natural or We have much still to learn about these loose forms of information capture. Why are they so often held outside of our traditional PIM locations and instead on Post-its or in text files? Why must we sometimes go around our traditional PIM applications to hold on to our scraps, such as by e-mailing ourselves? What are information scraps' role in the larger space of personal information management, and what do they uniquely offer that we find so appealing? If these unorganized bits truly indicate the failure of our PIM tools, how might we begin to build better tools? We have pursued these questions by undertaking a study of 27 knowledge workers. In our findings we describe information scraps from several angles: their content, their location, and the factors that lead to their use, which we identify as ease of capture, flexibility of content and organization, and avilability at the time of need. We also consider the personal emotive responses around scrap management. We present a set of design considerations that we have derived from the analysis of our study results. We present our work on an application platform, jourknow, to test some of these design and usability findings
Spoken content retrieval: A survey of techniques and technologies
Speech media, that is, digital audio and video containing spoken content, has blossomed in recent years. Large collections are accruing on the Internet as well as in private and enterprise settings. This growth has motivated extensive research on techniques and technologies that facilitate reliable indexing and retrieval. Spoken content retrieval (SCR) requires the combination of audio and speech processing technologies with methods from information retrieval (IR). SCR research initially investigated planned speech structured in document-like units, but has subsequently shifted focus to more informal spoken content produced spontaneously, outside of the studio and in conversational settings. This survey provides an overview of the field of SCR encompassing component technologies, the relationship of SCR to text IR and automatic speech recognition and user interaction issues. It is aimed at researchers with backgrounds in speech technology or IR who are seeking deeper insight on how these fields are integrated to support research and development, thus addressing the core challenges of SCR
An Indian Based MOOC: An Overview
A massive open online course (MOOC) is a model for delivering learning content online to a large number of people. These are online courses with unrestricted numbers of participation and open accessibility. MOOC gained popularity during the early 2010s and have been presented as an alternative to the traditional instructional model, focused on the lecturing instructor and relying on digital content that is available to all students. The present study attempt to introduce all the MOOC available in India, compare them using Similarweb pro, explain why country like India needs MOOC, its challenges and provide few points to address for proper success of MOOC
MPEG-7 Based Image Retrieval on the World Wide Web
Due to the rapid growth of the number of digital media elements like image, video, audio, graphics on
Internet, there is an increasing demand for effective search and retrieval techniques. Recently, many search
engines have made image search as an option like Google, AlltheWeb, AltaVista, Freenet. In addition to this,
Ditto, Picsearch, can search only the images on Internet. There are also other domain specific search engines
available for graphics and clip art, audio, video, educational images, artwork, stock photos, science and nature
[www.faganfinder.com/img]. These entire search engines are directory based. They crawls the entire Internet and
index all the images in certain categories. They do not display the images in any particular order with respect to
the time and context. With the availability of MPEG-7, a standard for describing multimedia content, it is now
possible to store the images with its metadata in a structured format. This helps in searching and retrieving the
images. The MPEG-7 standard uses XML to describe the content of multimedia information objects. These
objects will have metadata information in the form of MPEG-7 or any other similar format associated with them. It
can be used in different ways to search the objects. In this paper we propose a system, which can do content
based image retrieval on the World Wide Web. It displays the result in user-defined order
- âŠ