10 research outputs found

    Dynamic Multimodal Fusion in Video Search

    Full text link

    Die Sphere-Search-Suchmaschine zur graphbasierten Suche auf heterogenen, semistrukturierten Daten

    Get PDF
    In dieser Arbeit wird die neuartige SphereSearch-Suchmaschine vorgestellt, die ein einheitliches ranglistenbasiertes Retrieval auf heterogenen XML- und Web-Daten ermöglicht. Ihre FĂ€higkeiten umfassen die Auswertung von vagen Struktur- und Inhaltsbedingungen sowie ein auf IR-Statistiken und einem graph-basierten Datenmodell basierendes Relevanz-Ranking. Web-Dokumente im HTML- und PDFFormat werden zunĂ€chst automatisch in ein XML-Zwischenformat konvertiert und anschließend mit Hilfe von Annotations-Tools durch zusĂ€tzliche Tags semantisch angereichtert. Die graph-basierte Suchmaschine bietet auf semi-strukturierten Daten vielfĂ€ltige Suchmöglichkeiten, die von keiner herkömmlichen Web- oder XMLSuchmaschine ausgedrĂŒckt werden können: konzeptbewusste und kontextbewusste Suche, die sowohl die implizite Struktur von Daten als auch ihren Kontext berĂŒcksichtigt. Die Vorteile der SphereSearch-Suchmaschine werden durch Experimente auf verschiedenen Dokumentenkorpora demonstriert. Diese umfassen eine große, vielfĂ€ltige Tags beinhaltende, nicht-schematische EnzyklopĂ€die, die um externe Dokumente erweitert wurde, sowie einen Standard-XML-Benchmark.This thesis presents the novel SphereSearch Engine that provides unified ranked retrieval on heterogeneous XML andWeb data. Its search capabilities include vague structure and text content conditions, and relevance ranking based on IR statistics and a graph-based data model. Web pages in HTML or PDF are automatically converted into an intermediate XML format, with the option of generating semantic tags by means of linguistic annotation tools. For semi-structured data the graphbased query engine is leveraged to provide very rich search options that cannot be expressed in traditional Web or XML search engines: concept-aware and linkaware querying that takes into account the implicit structure and context of Web pages. The benefits of the SphereSearch engine are demonstrated by experiments with a large and richly tagged but non-schematic open encyclopedia extended with external documents and a standard XML benchmark

    ModÚle flexible pour la Recherche d'Information dans des corpus de documents semi-structurés

    Get PDF
    Structural information contained in semi-structured documents can be used to focus on relevant information. The aim of Information Retrieval System is then to retrieve relevant information units instead of whole documents. We propose here the XFIRM model (XML Flexible Information Retrieval model), which is based on: (i) a generic data representation model, allowing the modelling of documents having heterogeneous structures; (ii) a flexible query language that allows the expression of users needs according to many precision degrees, by expressing (or not) conditions on the documents structure; (iii) a retrieval model based on a relevance propagation method, which aims at finding the most exhaustive and specific information units answering the query. The interest of our propositions has been shown thanks to the prototype we developedLa nature de sources d'information Ă©volue, et les documents numĂ©riques traditionnels plats ne contenant que du texte s'enrichissent d'information structurelle et multimĂ©dia. Cette Ă©volution est accĂ©lĂ©rĂ©e par l'expansion du Web, et les documents semi-structurĂ©s de type XML (eXtensible Markup Language) tendent Ă  former la majoritĂ© des documents numĂ©riques mis Ă  disposition des utilisateurs. Le dĂ©veloppement d'outils automatisĂ©s permettant un accĂšs efficace Ă  ce nouveau type d'information numĂ©rique apparaĂźt comme une nĂ©cessitĂ©. Afin de valoriser au mieux l'ensemble des informations disponibles, les mĂ©thodes existantes de Recherche d'Information (RI) doivent ĂȘtre adaptĂ©es. L'information structurelle des documents peut en effet servir Ă  affiner le concept de granule documentaire. Le but pour les SystĂšmes de Recherche d'Information (SRI) est alors de retrouver des unitĂ©s d'information (et non plus de documents) pertinentes Ă  des requĂȘtes utilisateur. Afin de rĂ©pondre Ă  cette problĂ©matique fondamentale, de nouveaux modĂšles prenant en compte l'information structurelle des documents, tant au niveau de l'indexation, de l'interrogation que de la recherche doivent ĂȘtre construits. L'objectif de nos travaux est de proposer un modĂšle permettant d'effectuer des recherches flexibles dans des corpus de document semi-structurĂ©s. Ceci nous a conduit Ă  proposer le modĂšle XFIRM (XML Flexible Information Retrieval Model ) reposant sur : (i) Un modĂšle de reprĂ©sentation des donnĂ©es gĂ©nĂ©rique, permettant de modĂ©liser des documents possĂ©dant des structures diffĂ©rentes ; (ii) Un langage de requĂȘte flexible, permettant Ă  l'utilisateur d'exprimer son besoin selon divers degrĂ©s de prĂ©cision, en exprimant ou non des conditions sur la structure des documents ; (iii) Un modĂšle de recherche basĂ©e sur une mĂ©thode de propagation de la pertinence. Ce modĂšle a pour but de trouver les unitĂ©s d'information les plus exhaustives et spĂ©cifiques rĂ©pondant Ă  une requĂȘte utilisateur, que celle-ci contienne ou non des conditions de structure. Les documents semi-structurĂ©s peuvent ĂȘtre reprĂ©sentĂ©s sous forme arborescente, et le but est alors de trouver les sous-arbres de taille minimale rĂ©pondant Ă  la requĂȘte. Les recherches sur le contenu seul des documents sont effectuĂ©es en prenant en compte les importances diverses des feuilles des sous-arbres, et en plaçant ces derniers dans leur contexte, c'est Ă  dire, en tenant compte de la pertinence du document. Les recherches portant Ă  la fois sur le contenu et la structure des documents sont effectuĂ©es grĂące Ă  plusieurs propagations de pertinence dans l'arbre du document, et ce afin d'effectuer une correspondance vague entre l'arbre du document et l'arbre de la requĂȘte. L'Ă©valuation de notre modĂšle, grĂące au prototype que nous avons dĂ©veloppĂ©, montre l'intĂ©rĂȘt de nos propositions, que ce soit pour effectuer des recherches sur le contenu seul des documents que sur le contenu et la structure

    Focused Retrieval

    Get PDF
    Traditional information retrieval applications, such as Web search, return atomic units of retrieval, which are generically called ``documents''. Depending on the application, a document may be a Web page, an email message, a journal article, or any similar object. In contrast to this traditional approach, focused retrieval helps users better pin-point their exact information needs by returning results at the sub-document level. These results may consist of predefined document components~---~such as pages, sections, and paragraphs~---~or they may consist of arbitrary passages, comprising any sub-string of a document. If a document is marked up with XML, a focused retrieval system might return individual XML elements or ranges of elements. This thesis proposes and evaluates a number of approaches to focused retrieval, including methods based on XML markup and methods based on arbitrary passages. It considers the best unit of retrieval, explores methods for efficient sub-document retrieval, and evaluates formulae for sub-document scoring. Focused retrieval is also considered in the specific context of the Wikipedia, where methods for automatic vandalism detection and automatic link generation are developed and evaluated

    Un modÚle de recherche d'information agrégée basée sur les réseaux bayésiens dans des documents semi-structurés

    Get PDF
    Nous proposons un modĂšle de recherche d'information basĂ© sur les rĂ©seaux bayĂ©siens. Dans ce modĂšle, la requĂȘte de l'utilisateur dĂ©clenche un processus de propagation pour sĂ©lectionner les Ă©lĂ©ments pertinents. Dans notre modĂšle, nous cherchons Ă  renvoyer Ă  l'utilisateur un agrĂ©gat au lieu d'une liste d'Ă©lĂ©ments. En fait, l'agrĂ©gat formulĂ© Ă  partir d'un document est considĂ©rĂ© comme Ă©tant un ensemble d'Ă©lĂ©ments ou une unitĂ© d'information (portion d'un document) qui rĂ©pond le mieux Ă  la requĂȘte de l'utilisateur. Cet agrĂ©gat doit rĂ©pondre Ă  trois aspects Ă  savoir la pertinence, la non-redondance et la complĂ©mentaritĂ© pour qu'il soit qualifiĂ© comme une rĂ©ponse Ă  cette requĂȘte. L'utilitĂ© des agrĂ©gats retournĂ©s est qu'ils donnent Ă  l'utilisateur un aperçu sur le contenu informationnel de cette requĂȘte dans la collection de documents. Afin de valider notre modĂšle, nous l'avons Ă©valuĂ© dans le cadre de la campagne d'Ă©valuation INEX 2009 (utilisant plus que 2 666 000 documents XML de l'encyclopĂ©die en ligne WikipĂ©dia). Les expĂ©rimentations montrent l'intĂ©rĂȘt de cette approche en mettant en Ă©vidence l'impact de l'agrĂ©gation de tels Ă©lĂ©ments.The work described in this thesis are concerned with the aggregated search on XML elements. We propose new approaches to aggregating and pruning using different sources of evidence (content and structure). We propose a model based on Bayesian networks. The dependency relationships between query-terms and terms-elements are quantified by probability measures. In this model, the user's query triggers a propagation process to find XML elements. In our model, we search to return to the user an aggregate instead of a list of XML elements. In fact, the aggregate made from a document is considered an information unit (or a portion of this document) that best meets the user's query. This aggregate must meet three aspects namely relevance, non-redundancy and complementarity in order to answer the query. The value returned aggregates is that they give the user an overview of the information need in the collection

    Semantics of video shots for content-based retrieval

    Get PDF
    Content-based video retrieval research combines expertise from many different areas, such as signal processing, machine learning, pattern recognition, and computer vision. As video extends into both the spatial and the temporal domain, we require techniques for the temporal decomposition of footage so that specific content can be accessed. This content may then be semantically classified - ideally in an automated process - to enable filtering, browsing, and searching. An important aspect that must be considered is that pictorial representation of information may be interpreted differently by individual users because it is less specific than its textual representation. In this thesis, we address several fundamental issues of content-based video retrieval for effective handling of digital footage. Temporal segmentation, the common first step in handling digital video, is the decomposition of video streams into smaller, semantically coherent entities. This is usually performed by detecting the transitions that separate single camera takes. While abrupt transitions - cuts - can be detected relatively well with existing techniques, effective detection of gradual transitions remains difficult. We present our approach to temporal video segmentation, proposing a novel algorithm that evaluates sets of frames using a relatively simple histogram feature. Our technique has been shown to range among the best existing shot segmentation algorithms in large-scale evaluations. The next step is semantic classification of each video segment to generate an index for content-based retrieval in video databases. Machine learning techniques can be applied effectively to classify video content. However, these techniques require manually classified examples for training before automatic classification of unseen content can be carried out. Manually classifying training examples is not trivial because of the implied ambiguity of visual content. We propose an unsupervised learning approach based on latent class modelling in which we obtain multiple judgements per video shot and model the users' response behaviour over a large collection of shots. This technique yields a more generic classification of the visual content. Moreover, it enables the quality assessment of the classification, and maximises the number of training examples by resolving disagreement. We apply this approach to data from a large-scale, collaborative annotation effort and present ways to improve the effectiveness for manual annotation of visual content by better design and specification of the process. Automatic speech recognition techniques along with semantic classification of video content can be used to implement video search using textual queries. This requires the application of text search techniques to video and the combination of different information sources. We explore several text-based query expansion techniques for speech-based video retrieval, and propose a fusion method to improve overall effectiveness. To combine both text and visual search approaches, we explore a fusion technique that combines spoken information and visual information using semantic keywords automatically assigned to the footage based on the visual content. The techniques that we propose help to facilitate effective content-based video retrieval and highlight the importance of considering different user interpretations of visual content. This allows better understanding of video content and a more holistic approach to multimedia retrieval in the future

    JuruXML - an XML retrieval system at INEX 02

    No full text
    XML documents represent a middle range between unstructured data such as textual documents and fully structured data encoded in databases. Typically, information retrieval techniques are used to support search on the “unstructured ” end of this scale, while database techniques are used for the other end. To date, most of the work on XML query and search has stemmed from the structured side and is strongly inspired by database techniques. We describe here an approach that originates from the “unstructured ” end and is based on augmentation of information retrieval techniques. It is specifically targeted to support the information needs of end-users, more specifically a generic querying mechanism, and ranking of results for approximate needs. We describe our query format and ranking mechanism and demonstrate how it was used to run the INEX topics
    corecore