86,864 research outputs found

    A spoken document retrieval application in the oral history domain

    Get PDF
    The application of automatic speech recognition in the broadcast news domain is well studied. Recognition performance is generally high and accordingly, spoken document retrieval can successfully be applied in this domain, as demonstrated by a number of commercial systems. In other domains, a similar recognition performance is hard to obtain, or even far out of reach, for example due to lack of suitable training material. This is a serious impediment for the successful application of spoken document retrieval techniques for other data then news. This paper outlines our first steps towards a retrieval system that can automatically be adapted to new domains. We discuss our experience with a recently implemented spoken document retrieval application attached to a web-portal that aims at the disclosure of a multimedia data collection in the oral history domain. The paper illustrates that simply deploying an off-theshelf\ud broadcast news system in this task domain will produce error rates that are too high to be useful for retrieval tasks. By applying adaptation techniques on the acoustic level and language model level, system performance can be improved considerably, but additional research on unsupervised adaptation and search interfaces is required to create an adequate search environment based on speech transcripts

    Overview of GINIX and Top-k Method

    Get PDF
    In today’s life more applications are web based and peoples may communicate with each other by using Internet. It involves more and more data retrieval from database system as per user demand. Inverted Index is a system use for searching in which searching is takes place as per index sequentially. So it require more time for searching. While Ginix can search as per word in which all files or related document that word is search appropriately. But it only search documents file which are save in database system but not search multimedia files. Hence the more competent technique for searching is top-k method in which all database is scan for finding appropriate result for given data. Also data is search on web pages. It provides more perfect result within less time as compare to Ginix. DOI: 10.17762/ijritcc2321-8169.15011

    Management of e-Resources in R amp; D Centers: A Case Study of the Information Center at NAL13;

    Get PDF
    The developments in information technology and their applications to library and information services have given new dimension to the entire spectrum of information management. The information generated is usually stored in four physical media: paper, film, optical, and magnetic disks. The e-document be it a book, journal, technical report, conference proceedings is portable; has random access to its contents; and the document can also be a multimedia object, in that it may contain not only text, but also graphics, drawings, photographs or video. Now we have the emergence of publications over the electronic networks and the activity took off in a big way following the invention of the World Wide Web. The Open Access movement is becoming the order of the day. More than 3000 journals are free on net for anybody to access. A number of Institutional repositories and e-Prints archives have thrown challenge to the publishing industry. Consortium approach through different pricing, management and licensing models is enabling the libraries to provide access to thousands of e- journals, e-books and other kinds of e-documents. The Information center at NAL with its state-of-the-art library has progressed a good deal in this direction by acquiring different kind of documents especially e-form, cataloguing amp; processing them appropriately, storing and giving access to its patrons not only in library premises, but on to the desk tops spread in three different campuses through laboratory LAN and also extending selected services through Internet for the benefit of any body from any part of the world. 13; Created and maintained by ICAST the Portal x2018;AeroInfox2019; (www.aeroinfo.org.in) serves as one window information search facility for Web sources in aerospace science and technology. This virtual library facilitates multiple approach to information seekers as the web sources are indexed and organised using different schemes of classification including NASA subject categories. Care is taken to cover Indian aerospace sources exhaustively. The ICAST site (www.icast.org.in), apart from giving detailed information about library sources including books, journals, E-journals, databases and technical reports makes available different search tools for its users. Other details like working hours, library rules, staff details, contact persons, etc are provided. One can submit an online query and suggest documents for acquisition using online forms provided. The Library Database (OPAC) is probably is single largest in the country with more than 3.25 lakh bibliographic records of books, technical reports, patents, standards, journals, etc. ICAST users can search International databases like Aerospace Database, NTIS, J-Gate, Medline, etc through campus LAN. Users can access more than 2500 full text journals covering titles published by Elsevier (ScienceDirect), ASME, AIAA, Springer, John Wiley, OUP, CUP, AMS, World Scientific, few Annual Series, etc. Created by ICAST an e-journals gateway with browse and search (alphabetical and subject wise) facility for titles provides access to more than 700 journals available free on the net. The Centre provides a number of web/e-mail based innovative information services including Journal Contents Service, News Clipping Service, Monthly Documents Additions Lists covering both Books and Technical Reports, Web Alert Service and Union Catalogue of Journals -CSIR and Aerospace Libraries, etc

    Information extraction from multimedia web documents: an open-source platform and testbed

    No full text
    The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to support multimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overview of the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval

    Multimedia Answering and Retrieval System based on CQA with Media Query Generation

    Get PDF
    The question answering system which has recently received an attention from the various information retrieval systems, machine learning, information extraction and the natural language processing the goal of the QAS is to retrieve the answer to the question than full documents. This question answering system which works on the various modules related only to the question processing, the document processing, and the answer processing. This QAS which doesn’t work properly with the main module which is questioning processing this system fails to categorize properly the questions. So to overcome the QAS the Community question answering (CQA) has gained popularity. As compare to QAS and automated QA sites the CQA sites are more effective. In this drawback available for community question answering system is that it only provides the textual answer. Here in this paper, we propose a scheme that enhances the textual answer with the multimedia data. The outline of Community question answering which mainly consists of three components: the selection of answer medium, the query generation for multimedia search and the selection and presentation of multimedia data. This approach automatically defines which type of media information should be added for the textual answer. Then it automatically collects the data from the web to supplement the answer.by handling an available dataset of QA pairs and adding them to a pool, in this, our approach is to allow a new multimedia question answering (MMQA) approach so as the users can find the answer in multimedia matching the questions pair those in the pool. Therefore, the users can approach MMQA from Web information will answer the questions in different media formats (text, video, and image) as particularly selected by the users

    Never the Same Stream: netomat, XLink, and Metaphors of Web Documents

    Get PDF
    Document engineering employs practices of modeling and representation. Enactment of these practices relies on shared metaphors. However, choices driven by metaphor often receive less attention than those driven by factors critical to developing working systems, such as performance and usability. One way to remedy this issue is to take a historical approach, studying cases without a guiding concern for their ongoing development and maintenance. In this paper, we compare two historical case studies of "failed" designs for hypertext on the Web. The first case is netomat (1999), a Web browser created by the artist Maciej Wisniewski, which responded to search queries with dynamic multimedia streams culled from across the Web and structured by a custom markup language. The second is the XML Linking Language (XLink), a W3C standard to express hypertext links within and between XML documents. Our analysis focuses on the relationship between the metaphors used to make sense of Web documents and the hypermedia structures they compose. The metaphors offered by netomat and XLink stand as alternatives to metaphors of the "page" or the "app." Our intent here is not to argue that any of these metaphors are superior, but to consider how designers' and engineers' metaphorical choices are situated within a complex of already existing factors shaping Web technology and practice. The results provide insight into underexplored interconnections between art and document engineering at a critical moment in the history of the Web, and demonstrate the value for designers and engineers of studying "paths not taken" during the history of the technologies we work on today

    CHORUS Deliverable 2.2: Second report - identification of multi-disciplinary key issues for gap analysis toward EU multimedia search engines roadmap

    Get PDF
    After addressing the state-of-the-art during the first year of Chorus and establishing the existing landscape in multimedia search engines, we have identified and analyzed gaps within European research effort during our second year. In this period we focused on three directions, notably technological issues, user-centred issues and use-cases and socio- economic and legal aspects. These were assessed by two central studies: firstly, a concerted vision of functional breakdown of generic multimedia search engine, and secondly, a representative use-cases descriptions with the related discussion on requirement for technological challenges. Both studies have been carried out in cooperation and consultation with the community at large through EC concertation meetings (multimedia search engines cluster), several meetings with our Think-Tank, presentations in international conferences, and surveys addressed to EU projects coordinators as well as National initiatives coordinators. Based on the obtained feedback we identified two types of gaps, namely core technological gaps that involve research challenges, and “enablers”, which are not necessarily technical research challenges, but have impact on innovation progress. New socio-economic trends are presented as well as emerging legal challenges

    SMIL State: an architecture and implementation for adaptive time-based web applications

    Get PDF
    In this paper we examine adaptive time-based web applications (or presentations). These are interactive presentations where time dictates which parts of the application are presented (providing the major structuring paradigm), and that require interactivity and other dynamic adaptation. We investigate the current technologies available to create such presentations and their shortcomings, and suggest a mechanism for addressing these shortcomings. This mechanism, SMIL State, can be used to add user-defined state to declarative time-based languages such as SMIL or SVG animation, thereby enabling the author to create control flows that are difficult to realize within the temporal containment model of the host languages. In addition, SMIL State can be used as a bridging mechanism between languages, enabling easy integration of external components into the web application. Finally, SMIL State enables richer expressions for content control. This paper defines SMIL State in terms of an introductory example, followed by a detailed specification of the State model. Next, the implementation of this model is discussed. We conclude with a set of potential use cases, including dynamic content adaptation and delayed insertion of custom content such as advertisements. © 2009 Springer Science+Business Media, LLC

    Multimedia search without visual analysis: the value of linguistic and contextual information

    Get PDF
    This paper addresses the focus of this special issue by analyzing the potential contribution of linguistic content and other non-image aspects to the processing of audiovisual data. It summarizes the various ways in which linguistic content analysis contributes to enhancing the semantic annotation of multimedia content, and, as a consequence, to improving the effectiveness of conceptual media access tools. A number of techniques are presented, including the time-alignment of textual resources, audio and speech processing, content reduction and reasoning tools, and the exploitation of surface features
    corecore