6,973 research outputs found

    Retrieving descriptive phrases from large amounts of free text

    Get PDF
    This paper presents a system that retrieves descriptive phrases of proper nouns from free text. Sentences holding the specified noun are ranked using a technique based on pattern matching, word counting, and sentence location. No domain specific knowledge is used. Experiments show the system able to rank highly those sentences that contain phrases describing or defining the query noun. In contrast to existing methods, this system does not use parsing techniques but still achieves high levels of accuracy. From the results of a large-scale experiment, it is speculated that the success of this simpler method is due to the high quantities of free text being searched. Parallels between this work and recent findings in the very large corpus track of TREC are drawn

    Concept-based Interactive Query Expansion Support Tool (CIQUEST)

    Get PDF
    This report describes a three-year project (2000-03) undertaken in the Information Studies Department at The University of Sheffield and funded by Resource, The Council for Museums, Archives and Libraries. The overall aim of the research was to provide user support for query formulation and reformulation in searching large-scale textual resources including those of the World Wide Web. More specifically the objectives were: to investigate and evaluate methods for the automatic generation and organisation of concepts derived from retrieved document sets, based on statistical methods for term weighting; and to conduct user-based evaluations on the understanding, presentation and retrieval effectiveness of concept structures in selecting candidate terms for interactive query expansion. The TREC test collection formed the basis for the seven evaluative experiments conducted in the course of the project. These formed four distinct phases in the project plan. In the first phase, a series of experiments was conducted to investigate further techniques for concept derivation and hierarchical organisation and structure. The second phase was concerned with user-based validation of the concept structures. Results of phases 1 and 2 informed on the design of the test system and the user interface was developed in phase 3. The final phase entailed a user-based summative evaluation of the CiQuest system. The main findings demonstrate that concept hierarchies can effectively be generated from sets of retrieved documents and displayed to searchers in a meaningful way. The approach provides the searcher with an overview of the contents of the retrieved documents, which in turn facilitates the viewing of documents and selection of the most relevant ones. Concept hierarchies are a good source of terms for query expansion and can improve precision. The extraction of descriptive phrases as an alternative source of terms was also effective. With respect to presentation, cascading menus were easy to browse for selecting terms and for viewing documents. In conclusion the project dissemination programme and future work are outlined

    The Eurovision St Andrews collection of photographs

    Get PDF
    This report describes the Eurovision image collection compiled for the ImageCLEF (Cross Language Evaluation Forum) evaluation exercise. The image collection consists of around 30,000 photographs from the collection provided by the University of St Andrews Library. The construction and composition of this unique image collection are described, together with the necessary information to obtain and use the image collection

    Automatically organising images using concept hierarchies

    Get PDF
    In this paper we discuss the use of concept hierarchies, an approach to automatically organize a set of documents based upon a set of concepts derived from the documents themselves for image retrieval. Co-occurrence between terms associated with image captions and a statistical relation called subsumption are used to generate term clusters which are organized hierarchically. Previously, the approach has been studied for document retrieval and results have shown that automatically generating hierarchies can help users with their search task. In this paper we present an implementation of concept hierarchies for image retrieval, together with preliminary ad-hoc evaluation. Although our approach requires more investigation, initial results from a prototype system are promising and would appear to provide a useful summary of the search results

    Aspects of planar, oblique and interacting shock waves in an ideal dissociating gas

    Get PDF
    We develop a compact dimensionless framework for the analysis of canonical thermo-chemical nonequilibrium flow fields involving normal, oblique and interacting shock waves. Discontinuous solutions of the conservation equations are coupled with thermodynamic and kinetic models for an ideal dissociating gas. Convenient forms are provided for the variation of the relevant dimensionless parameters across shock waves in dissociating gases. The treatment is carried through in a consistent manner for the pressure–flow deflection angle plane representation of shock wave interaction problems. The contribution of the current paper is a careful nondimensionalization of the problem that yields a tractable formulation and allows results with considerable generality to be obtained

    Spatio-textual indexing for geographical search on the web

    Get PDF
    Many web documents refer to specific geographic localities and many people include geographic context in queries to web search engines. Standard web search engines treat the geographical terms in the same way as other terms. This can result in failure to find relevant documents that refer to the place of interest using alternative related names, such as those of included or nearby places. This can be overcome by associating text indexing with spatial indexing methods that exploit geo-tagging procedures to categorise documents with respect to geographic space. We describe three methods for spatio-textual indexing based on multiple spatially indexed text indexes, attaching spatial indexes to the document occurrences of a text index, and merging text index access results with results of access to a spatial index of documents. These schemes are compared experimentally with a conventional text index search engine, using a collection of geo-tagged web documents, and are shown to be able to compete in speed and storage performance with pure text indexing

    The influence of non-equilibrium dissociation on the flow produced by shock impingement on a blunt body

    Get PDF
    We describe an investigation of the effects of non-equilibrium thermochemistry on the interaction between a weak oblique shock and the strong bow shock formed by a blunt body in hypersonic flow. This type of shock-on-shock interaction, also known as an Edney type IV interaction, causes locally intense enhancement of the surface heat transfer rate. A supersonic jet is formed by the nonlinear interaction that occurs between the two shock waves and elevated heat transfer rates and surface pressures are produced by the impingement of the supersonic jet on the body. The current paper is motivated by previous studies suggesting that real gas effects would significantly increase the severity of the phenomenon. Experiments are described in which a free-piston shock tunnel is used to produce shock interaction flows with significant gas dissociation. Surprisingly, the data that are obtained show no significant stagnation enthalpy dependence of the ratio of the peak heat transfer rates with and without shock interaction, in contrast to existing belief. The geometry investigated is the nominally two-dimensional flow about a cylinder with coplanar impinging shock wave. Holographic interferometry is used to visualize the flow field and to quantify increases in the stagnation density caused by shock interaction. Time-resolved heat transfer measurements are obtained from surface junction thermocouples about the model forebody. An improved model is developed to elucidate the finite-rate thermochemical processes occurring in the interaction region. It is shown that severe heat transfer intensification is a result of a jet shock structure that minimizes the entropy rise of the supersonic jet fluid whereas strong thermochemical effects are promoted by conditions that maximize the entropy rise (and hence temperature). This dichotomy underlies the smaller than anticipated influence of real gas effects on the heat transfer intensification. The model accurately predicts the measured heat transfer rates. Improved understanding of the influence of real gas effects on the shock interaction phenomenon reduces a significant element of risk in the design of hypersonic vehicles. The peak heat transfer rate for the Edney type IV interaction is shown to be well-correlated, in the weak impinging shock regime, by an expression of the form [equation] for use in practical design calculations

    Document frequency and term specificity

    Get PDF
    Document frequency is used in various applications in Information Retrieval and other related fields. An assumption frequently made is that the document frequency represents a level of the term’s specificity. However, empirical results to support this assumption are limited. Therefore, a large-scale experiment was carried out, using multiple corpora, to gain further insight into the relationship between the document frequency and terms specificity. The results show that the assumption holds only at the very specific levels that cover the majority of vocabulary. The results also show that a larger corpus is more accurate at estimating the specificity. However, the co-occurrence information is shown to be effective for improving the accuracy when only a small corpus is available

    Alien Registration- O\u27Reilly, Margaret H. (Bath, Sagadahoc County)

    Get PDF
    https://digitalmaine.com/alien_docs/8835/thumbnail.jp
    • …
    corecore