345 research outputs found

    Uncovering the unarchived web

    Get PDF
    htmlabstractMany national and international heritage institutes realize the importance of archiving the web for future culture heritage. Web archiving is currently performed either by harvesting a national domain, or by crawling a pre-defined list of websites selected by the archiving institution. In either method, crawling results in more information being harvested than just the websites intended for preservation; which could be used to reconstruct impressions of pages that existed on the live web of the crawl date, but would have been lost forever. We present a method to create representations of what we will refer to as a web collection's (aura): the web documents that were not included in the archived collection, but are known to have existed --- due to their mentions on pages that were included in the archived web collection. To create representations of these unarchived pages, we exploit the information about the unarchived URLs that can be derived from the crawls by combining crawl date distribution, anchor text and link structure. We illustrate empirically that the size of the aura can be substantial: in 2012, the Dutch Web archive contained 12.3M unique pages, while we uncover references to 11.9M additional (unarchived) pages

    Diagnostic Strategies for Postmenopausal Bleeding

    Get PDF
    Postmenopausal bleeding (PMB) is a common clinical problem. Patients with PMB have 10%-15% chance of having endometrial carcinoma and therefore the diagnostic workup is aimed at excluding malignancy. Patient characteristics can alter the probability of having endometrial carcinoma in patients with PMB; in certain groups of patients the incidence has been reported to be as high as 29%. Transvaginal sonography (TVS) is used as a first step in the diagnostic workup, but different authors have come to different conclusions assessing the accuracy of TVS for excluding endometrial carcinoma. Diagnostic procedures obtaining material for histological assessment (e.g., dilatation and curettage, hysteroscopy, and endometrial biopsy) can be more accurate but are also more invasive. The best diagnostic strategy for diagnosing endometrial carcinoma in patients with PMB still remains controversial. Future research should be focussed on achieving a higher accuracy of different diagnostic strategies

    Uncovering the unarchived web

    Get PDF
    Many national and international heritage institutes realize the importance of archiving the web for future culture heritage. Web archiving is currently performed either by harvesting a national domain, or by crawling a pre-defined list of websites selected by the archiving institution. In either method, crawling results in more information being harvested than just the websites intended for preservation; which could be used to reconstruct impressions of pages that existed on the live web of the crawl date, but would have been lost forever. We present a method to create representations of what we will refer to as a web collection's (aura): the web documents that were not included in the archived collection, but are known to have existed --- due to their mentions on pages that were included in the archived web collection. To create representations of these unarchived pages, we exploit the information about the unarchived URLs that can be derived from the crawls by combining crawl date distribution, anchor text and link structure. We illustrate empirically that the size of the aura can be substantial: in 2012, the Dutch Web archive contained 12.3M unique pages, while we uncover references to 11.9M additional (unarchived) pages

    Lubricating Bacteria Model for Branching growth of Bacterial Colonies

    Full text link
    Various bacterial strains (e.g. strains belonging to the genera Bacillus, Paenibacillus, Serratia and Salmonella) exhibit colonial branching patterns during growth on poor semi-solid substrates. These patterns reflect the bacterial cooperative self-organization. Central part of the cooperation is the collective formation of lubricant on top of the agar which enables the bacteria to swim. Hence it provides the colony means to advance towards the food. One method of modeling the colonial development is via coupled reaction-diffusion equations which describe the time evolution of the bacterial density and the concentrations of the relevant chemical fields. This idea has been pursued by a number of groups. Here we present an additional model which specifically includes an evolution equation for the lubricant excreted by the bacteria. We show that when the diffusion of the fluid is governed by nonlinear diffusion coefficient branching patterns evolves. We study the effect of the rates of emission and decomposition of the lubricant fluid on the observed patterns. The results are compared with experimental observations. We also include fields of chemotactic agents and food chemotaxis and conclude that these features are needed in order to explain the observations.Comment: 1 latex file, 16 jpeg files, submitted to Phys. Rev.

    Lost but not forgotten: finding pages on the unarchived web

    Get PDF
    Web archives attempt to preserve the fast changing web, yet they will always be incomplete. Due to restrictions in crawling depth, crawling frequency, and restrictive selection policies, large parts of the Web are unarchived and, therefore, lost to posterity. In this paper, we propose an approach to uncover unarchived web pages and websites and to reconstruct different types of descriptions for these pages and sites, based on links and anchor text in the set of crawled pages. We experiment with this approach on the Dutch Web Archive and evaluate the usefulness of page and host-level representations of unarchived content. Our main findings are the following: First, the crawled web contains evidence of a remarkable number of unarchived pages and websites, potentially dramatically increasing the coverage of a Web archive. Second, the link and anchor text have a highly skewed distribution: popular pages such as home pages have more links pointing to them and more terms in the anchor text, but the richness tapers off quickly. Aggregating web page evidence to the host-level leads to significantly richer representations, but the distribution remains skewed. Third, the succinct representation is generally rich enough to uniquely identify pages on the unarchived web: in a known-item search setting we can retrieve unarchived web pages within the first ranks on average, with host-level representations leading to further improvement of the retrieval effectiveness for websites

    Cosmic Microwave Background Observables of Small Field Models of Inflation

    Full text link
    We construct a class of single small field models of inflation that can predict, contrary to popular wisdom, an observable gravitational wave signal in the cosmic microwave background anisotropies. The spectral index, its running, the tensor to scalar ratio and the number of e-folds can cover all the parameter space currently allowed by cosmological observations. A unique feature of models in this class is their ability to predict a negative spectral index running in accordance with recent cosmic microwave background observations. We discuss the new class of models from an effective field theory perspective and show that if the dimensionless trilinear coupling is small, as required for consistency, then the observed spectral index running implies a high scale of inflation and hence an observable gravitational wave signal. All the models share a distinct prediction of higher power at smaller scales, making them easy targets for detection.Comment: 13 pages, 3 figures, added numerical analysis and discussion on the properties of the spectra. Version to be published in JCA

    Analysis of chaotic motion and its shape dependence in a generalized piecewise linear map

    Full text link
    We analyse the chaotic motion and its shape dependence in a piecewise linear map using Fujisaka's characteristic function method. The map is a generalization of the one introduced by R. Artuso. Exact expressions for diffusion coefficient are obtained giving previously obtained results as special cases. Fluctuation spectrum relating to probability density function is obtained in a parametric form. We also give limiting forms of the above quantities. Dependence of diffusion coefficient and probability density function on the shape of the map is examined.Comment: 4 pages,4 figure

    Glycosphingolipids are required for sorting melanosomal proteins in the Golgi complex

    Get PDF
    A;lthough glycosphingolipids are ubiquitously expressed and essential for multicellular organisms, surprisingly little is known about their intracellular functions. To explore the role of glycosphingolipids in membrane transport, we used the glycosphingolipid-deficient GM95 mouse melanoma cell line. We found that GM95 cells do not make melanin pigment because tyrosinase, the first and rate-limiting enzyme in melanin synthesis, was not targeted to melanosomes but accumulated in the Golgi complex. However, tyrosinase-related protein 1 still reached melanosomal structures via the plasma membrane instead of the direct pathway from the Golgi. Delivery of lysosomal enzymes from the Golgi complex to endosomes was normal, suggesting that this pathway is not affected by the absence of glycosphingolipids. Loss of pigmentation was due to tyrosinase mislocalization, since transfection of tyrosinase with an extended transmembrane domain, which bypassed the transport block, restored pigmentation. Transfection of ceramide glucosyltransferase or addition of glucosylsphingosine restored tyrosinase transport and pigmentation. We conclude that protein transport from Golgi to melanosomes via the direct pathway requires glycosphingolipids

    A Structured Assessment to Decrease the Amount of Inconclusive Endometrial Biopsies in Women with Postmenopausal Bleeding

    Get PDF
    Objective. To determine whether structured assessment of outpatient endometrial biopsies decreases the number of inconclusive samples. Design. Retrospective cohort study. Setting. Single hospital pathology laboratory. Population. Endometrial biopsy samples of 66 women with postmenopausal bleeding, collected during the usual diagnostic work-up an

    An Agent-Based Approach to Self-Organized Production

    Full text link
    The chapter describes the modeling of a material handling system with the production of individual units in a scheduled order. The units represent the agents in the model and are transported in the system which is abstracted as a directed graph. Since the hindrances of units on their path to the destination can lead to inefficiencies in the production, the blockages of units are to be reduced. Therefore, the units operate in the system by means of local interactions in the conveying elements and indirect interactions based on a measure of possible hindrances. If most of the units behave cooperatively ("socially"), the blockings in the system are reduced. A simulation based on the model shows the collective behavior of the units in the system. The transport processes in the simulation can be compared with the processes in a real plant, which gives conclusions about the consequencies for the production based on the superordinate planning.Comment: For related work see http://www.soms.ethz.c
    corecore