68,414 research outputs found

    Spatio-textual indexing for geographical search on the web

    Get PDF
    Many web documents refer to specific geographic localities and many people include geographic context in queries to web search engines. Standard web search engines treat the geographical terms in the same way as other terms. This can result in failure to find relevant documents that refer to the place of interest using alternative related names, such as those of included or nearby places. This can be overcome by associating text indexing with spatial indexing methods that exploit geo-tagging procedures to categorise documents with respect to geographic space. We describe three methods for spatio-textual indexing based on multiple spatially indexed text indexes, attaching spatial indexes to the document occurrences of a text index, and merging text index access results with results of access to a spatial index of documents. These schemes are compared experimentally with a conventional text index search engine, using a collection of geo-tagged web documents, and are shown to be able to compete in speed and storage performance with pure text indexing

    Extending Yioop! With Geographical Location Local Search

    Get PDF
    It is often useful when doing an internet search to get results based on our current location. For example, we might want such results when we search on restaurants, car service center, or hospitals. Current open source search engines like those based on Nutch do not provide this facility. Commercial engines like Google and Yahoo! provide this facility so it would be useful to incorporate it in an open source alternative. The goal of this project is to include location aware search in Yioop!(Pollett, 2012) by using geographical data from OpenStreetMap(“Open Street map wiki”, 2012) and hostip.info (“DMOZ”, n.d.) database to geolocate IP addresses

    Deep Metric Learning via Facility Location

    Full text link
    Learning the representation and the similarity metric in an end-to-end fashion with deep networks have demonstrated outstanding results for clustering and retrieval. However, these recent approaches still suffer from the performance degradation stemming from the local metric training procedure which is unaware of the global structure of the embedding space. We propose a global metric learning scheme for optimizing the deep metric embedding with the learnable clustering function and the clustering metric (NMI) in a novel structured prediction framework. Our experiments on CUB200-2011, Cars196, and Stanford online products datasets show state of the art performance both on the clustering and retrieval tasks measured in the NMI and Recall@K evaluation metrics.Comment: Submission accepted at CVPR 201

    Use of Subimages in Fish Species Identification: A Qualitative Study

    Get PDF
    Many scholarly tasks involve working with subdocuments, or contextualized fine-grain information, i.e., with information that is part of some larger unit. A digital library (DL) facil- itates management, access, retrieval, and use of collections of data and metadata through services. However, most DLs do not provide infrastructure or services to support working with subdocuments. Superimposed information (SI) refers to new information that is created to reference subdocu- ments in existing information resources. We combine this idea of SI with traditional DL services, to define and develop a DL with SI (SI-DL). We explored the use of subimages and evaluated the use of a prototype SI-DL (SuperIDR) in fish species identification, a scholarly task that involves work- ing with subimages. The contexts and strategies of working with subimages in SuperIDR suggest new and enhanced sup- port (SI-DL services) for scholarly tasks that involve working with subimages, including new ways of querying and search- ing for subimages and associated information. The main contribution of our work are the insights gained from these findings of use of subimages and of SuperIDR (a prototype SI-DL), which lead to recommendations for the design of digital libraries with superimposed information

    Wildfire Smoke Particle Properties and Evolution, from Space-Based Multi-Angle Imaging

    Get PDF
    Emitted smoke composition is determined by properties of the biomass burning source and ambient ecosystem. However, conditions that mediate the partitioning of black carbon (BC) and brown carbon (BrC) formation, as well as the spatial and temporal factors that drive particle evolution, are not understood adequately for many climate and air-quality related modeling applications. In situ observations provide considerable detail about aerosol microphysical and chemical properties, although sampling is extremely limited. Satellites offer the frequent global coverage that would allow for statistical characterization of emitted and evolved smoke, but generally lack microphysical detail. However, once properly validated, data from the National Aeronautics and Space Administration (NASA) Earth Observing Systems Multi-Angle Imaging Spectroradiometer (MISR) instrument can create at least a partial picture of smoke particle properties and plume evolution. We use in situ data from the Department of Energys Biomass Burning Observation Project (BBOP) field campaign to assess the strengths and limitations of smoke particle retrieval results from the MISR Research Aerosol (RA) retrieval algorithm. We then use MISR to characterize wildfire smoke particle properties and to identify the relevant aging factors in several cases, to the extent possible. The RA successfully maps qualitative changes in effective particle size, light absorption, and its spectral dependence, when compared to in situ observations. By observing the entire plume uniformly, the satellite data can be interpreted in terms of smoke plume evolution, including size-selective deposition, new-particle formation, and locations within the plume where BC or BrC dominates

    ImageSieve: Exploratory search of museum archives with named entity-based faceted browsing

    Get PDF
    Over the last few years, faceted search emerged as an attractive alternative to the traditional "text box" search and has become one of the standard ways of interaction on many e-commerce sites. However, these applications of faceted search are limited to domains where the objects of interests have already been classified along several independent dimensions, such as price, year, or brand. While automatic approaches to generate faceted search interfaces were proposed, it is not yet clear to what extent the automatically-produced interfaces will be useful to real users, and whether their quality can match or surpass their manually-produced predecessors. The goal of this paper is to introduce an exploratory search interface called ImageSieve, which shares many features with traditional faceted browsing, but can function without the use of traditional faceted metadata. ImageSieve uses automatically extracted and classified named entities, which play important roles in many domains (such as news collections, image archives, etc.). We describe one specific application of ImageSieve for image search. Here, named entities extracted from the descriptions of the retrieved images are used to organize a faceted browsing interface, which then helps users to make sense of and further explore the retrieved images. The results of a user study of ImageSieve demonstrate that a faceted search system based on named entities can help users explore large collections and find relevant information more effectively

    Analysis of systems hardware flown on LDEF. Results of the systems special investigation group

    Get PDF
    The Long Duration Exposure Facility (LDEF) was retrieved after spending 69 months in low Earth orbit (LEO). LDEF carried a remarkable variety of mechanical, electrical, thermal, and optical systems, subsystems, and components. The Systems Special Investigation Group (Systems SIG) was formed to investigate the effects of the long duration exposure to LEO on systems related hardware and to coordinate and collate all systems analysis of LDEF hardware. Discussed here is the status of the LDEF Systems SIG investigation through the end of 1991

    Comparison of cloud top heights derived from MISR stereo and MODIS CO(2)-slicing

    Get PDF
    corecore