5,322 research outputs found

    Evaluating the SiteStory Transactional Web Archive With the ApacheBench Tool

    Full text link
    Conventional Web archives are created by periodically crawling a web site and archiving the responses from the Web server. Although easy to implement and common deployed, this form of archiving typically misses updates and may not be suitable for all preservation scenarios, for example a site that is required (perhaps for records compliance) to keep a copy of all pages it has served. In contrast, transactional archives work in conjunction with a Web server to record all pages that have been served. Los Alamos National Laboratory has developed SiteSory, an open-source transactional archive written in Java solution that runs on Apache Web servers, provides a Memento compatible access interface, and WARC file export features. We used the ApacheBench utility on a pre-release version of to measure response time and content delivery time in different environments and on different machines. The performance tests were designed to determine the feasibility of SiteStory as a production-level solution for high fidelity automatic Web archiving. We found that SiteStory does not significantly affect content server performance when it is performing transactional archiving. Content server performance slows from 0.076 seconds to 0.086 seconds per Web page access when the content server is under load, and from 0.15 seconds to 0.21 seconds when the resource has many embedded and changing resources.Comment: 13 pages, Technical Repor

    Shatter cones in Illinois: Evidence for metoeritic impacts at Glasford and Des Plaines

    Get PDF
    Shatter cone fragments were recovered from rock cores at two previously suspected, but heretofore unverified, impact structures in Illinois. Both sites are buried features known from geophysical surveys and drill holes. Shatter cones are accepted widely as field criteria of meteoritic impact. Detection of these shock indicators in both the Glasford Structure and the Des Plains Disturbance upgrades these sites in Earth's inventory of known and suspected impact structures from possible impact sites with compatible structure and morphology to probable impact structures which possess also evidence of shock metamorphism

    Free Energies of Isolated 5- and 7-fold Disclinations in Hexatic Membranes

    Full text link
    We examine the shapes and energies of 5- and 7-fold disclinations in low-temperature hexatic membranes. These defects buckle at different values of the ratio of the bending rigidity, κ\kappa, to the hexatic stiffness constant, KAK_A, suggesting {\em two} distinct Kosterlitz-Thouless defect proliferation temperatures. Seven-fold disclinations are studied in detail numerically for arbitrary κ/KA\kappa/K_A. We argue that thermal fluctuations always drive κ/KA\kappa/K_A into an ``unbuckled'' regime at long wavelengths, so that disclinations should, in fact, proliferate at the {\em same} critical temperature. We show analytically that both types of defects have power law shapes with continuously variable exponents in the ``unbuckled'' regime. Thermal fluctuations then lock in specific power laws at long wavelengths, which we calculate for 5- and 7-fold defects at low temperatures.Comment: LaTeX format. 17 pages. To appear in Phys. Rev.

    Population genetic structure of N. American and European \u3ci\u3ePhalaris arundinacea\u3c/i\u3e L. as inferred from inter-simple sequence repeat markers

    Get PDF
    Phalaris arundinacea L. (reed canarygrass) has become one of the most aggressive invaders of North American wetlands. P. arundinacea is native to temperate N. America, Europe, and Asia, but repeated introductions of European genotypes to N. America, recent range expansions, and the planting of forage and ornamental cultivars complicate the resolution of its demographic history. Molecular tools can help to unravel the demographic and invasion history of populations of invasive species. In this study, inter-simple sequence repeat markers were used to analyze the population genetic structure of European and N. American populations of reed canary grass as well as forage and ornamental cultivars. We found that P. arundinacea harbors a high amount of genetic diversity with most of the diversity located within, as opposed to among, populations. Cluster analyses suggested that current populations are admixtures of two formerly distinct genetic groups

    Lexical Semantic Recognition

    Full text link
    In lexical semantics, full-sentence segmentation and segment labeling of various phenomena are generally treated separately, despite their interdependence. We hypothesize that a unified lexical semantic recognition task is an effective way to encapsulate previously disparate styles of annotation, including multiword expression identification / classification and supersense tagging. Using the STREUSLE corpus, we train a neural CRF sequence tagger and evaluate its performance along various axes of annotation. As the label set generalizes that of previous tasks (PARSEME, DiMSUM), we additionally evaluate how well the model generalizes to those test sets, finding that it approaches or surpasses existing models despite training only on STREUSLE. Our work also establishes baseline models and evaluation metrics for integrated and accurate modeling of lexical semantics, facilitating future work in this area.Comment: 11 pages, 3 figures; to appear at MWE 202

    Combining Heritrix and PhantomJS for Better Crawling of Pages with Javascript

    Get PDF
    PDF of a powerpoint presentation from the International Internet Preservation Consortium (IIPC) 2016 Conference in Reykjavik, Iceland, April 11, 2016. Also available on Slideshare.https://digitalcommons.odu.edu/computerscience_presentations/1003/thumbnail.jp

    Structure-based stabilization of insulin as a therapeutic protein assembly via enhanced aromatic-aromatic interactions

    Get PDF
    Key contributions to protein structure and stability are provided by weakly polar interactions, which arise from asymmetric electronic distributions within amino acids and peptide bonds. Of particular interest are aromatic side chains whose directional π-systems commonly stabilize protein interiors and interfaces. Here, we consider aromatic-aromatic interactions within a model protein assembly: the dimer interface of insulin. Semi-classical simulations of aromatic-aromatic interactions at this interface suggested that substitution of residue TyrB26 by Trp would preserve native structure while enhancing dimerization (and hence hexamer stability). The crystal structure of a [TrpB26]insulin analog (determined as a T3Rf3 zinc hexamer at a resolution of 2.25 Å) was observed to be essentially identical to that of WT insulin. Remarkably and yet in general accordance with theoretical expectations, spectroscopic studies demonstrated a 150-fold increase in the in vitro lifetime of the variant hexamer, a critical pharmacokinetic parameter influencing design of long-acting formulations. Functional studies in diabetic rats indeed revealed prolonged action following subcutaneous injection. The potency of the TrpB26-modified analog was equal to or greater than an unmodified control. Thus, exploiting a general quantum-chemical feature of protein structure and stability, our results exemplify a mechanism-based approach to the optimization of a therapeutic protein assembly

    A Method for Identifying Personalized Representations in Web Archives

    Get PDF
    Web resources are becoming increasingly personalized — two different users clicking on the same link at the same time can see content customized for each individual user. These changes result in multiple representations of a resource that cannot be canonicalized in Web archives. We identify characteristics of this problem by presenting a potential solution to generalize personalized representations in archives. We also present our proof-of-concept prototype that analyzes WARC (Web ARChive) format files, inserts metadata establishing relationships, and provides archive users the ability to navigate on the additional dimension of environment variables in a modified Wayback Machine
    • …
    corecore