1,088 research outputs found

    Node.js based Document Store for Web Crawling

    Get PDF
    WARC files are central to internet preservation projects. They contain the raw resources of web crawled data and can be used to create windows into the past of web pages at the time they were accessed. Yet there are few tools that manipulate WARC files outside of basic parsing. The creation of our tool WARC-KIT gives users in the Node.js JavaScript environment, a tool kit to interact with and manipulate WARC files. Included with WARC-KIT is a WARC parsing tool known as WARCFilter that can be used standalone tool to parse, filter, and create new WARC files. WARCFilter can also, create CDX index files on the WARC files, parse existing CDX files, or even generate webgraph datasets for graph analysis algorithms. Aside from WARCFilter, WARC-KIT includes a custom on disk database system implemented with an underlying Linear Hash Table data structure. The database system is the first of its kind as a JavaScript only on disk document store. The overall main application of WARC-KIT is that it allows users to create custom indices upon collections of WARC files. After creating an index on a WARC collections, users are then query their collection using the GraphQL query language to retrieve desired WARC records. Experiments with WARCFilter on a WARC dataset composed of 238,000 WARC records demonstrates that utilizing CDX index files speeds WARC record filtering around ten to twenty times faster than raw WARC parsing. Database timing tests with the JavaScript Linear Hash Table database system displayed twice as fast insertion and retrieval operations than a similar Rust implemented Linear Hash Table database. Experiments with the overall WARC-KIT application on the same 238,000 WARC record dataset exhibited consistent query times for different complex queries

    Student Debt

    Get PDF
    On January 25, President Barack Obama presented the world with his State of the Union address, informing Americans what his national plans and priorities would be for the next four years. And once the topic turned to college affordability, he made a statement that piqued the interest of university students and faculty. “Let me put colleges and universities on notice: If you can’t stop tuition from going up, the funding you get from taxpayers will go down,” Obama said. “Higher education can’t be a luxury – it is an economic imperative that every family in America should be able to afford.” Iowa State is ranked third in the U.S. for highest debt rate

    Working Futures 2017-2027 : Long-run labour market and skills projections headline report

    Get PDF
    This report provides a concise overview of Working Futures 2017-2027 results for the UK. It presents historical trends and future prospects by sector for the UK and its constituent nations and the English regions. The prime focus of Working Futures is on the demand for skills as measured by employment by occupation and qualification, although the supply side is also considered. Its prime objective is to provide useful labour market information that can help to inform policy development and strategy around skills, careers and employment, for both policy makers and a much wider audience. The results are intended to provide a sound statistical foundation for reflection and debate among all those with an interest in the demand for and supply of skills. It is aimed at the general reader and focuses on the key messages from this very detailed study. It complements the more detailed outputs and results from the project available from the gov.uk website2 and cover sectors, occupations, geography and qualifications

    High Reynolds number tests of a Douglas DLBA 032 airfoil in the Langley 0.3-meter transonic cryogenic tunnel

    Get PDF
    A wind-tunnel investigation of a Douglas advanced-technology airfoil was conducted in the Langley 0.3-Meter Transonic Cryogenic Tunnel (0.3-m TCT). The temperature was varied from 227 K (409 R) to 100 K (180 R) at pressures ranging from about 159 kPa (1.57 atm) to about 514 kPa (5.07 atm). Mach number was varied from 0.50 to 0.78. These variables provided a Reynolds number range (based on airfoil chord) from 6.0 to 30.0 x 10 to the 6th power. This investigation was specifically designed to: (1) test a Douglas airfoil from moderately low to flight-equivalent Reynolds numbers, and (2) evaluate sidewall-boundary-layer effects on transonic airfoil performance characteristics by a systematic variation of Mach number, Reynolds number, and sidewall-boundary-layer removal. Data are included which demonstrate the effects of fixing transition, Mach number, Reynolds number, and sidewall-boundary-layer removal on the aerodynamic characteristics of the airfoil. Also included are remarks on model design and model structural integrity
    • …
    corecore