30 research outputs found
ΠΠΈΡΠΎΠ²ΡΠ΅ ΡΠ΅Π½Π΄Π΅Π½ΡΠΈΠΈ ΡΠ°Π·Π²ΠΈΡΠΈΡ Π²Π΅Π±-Π°ΡΡ ΠΈΠ²ΠΎΠ² Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊ
The need for studying and promoting web-archiving for longterm information preservation and accessibility in future is substantiated. The existing technologies of web-archiving are specified and the problems related to the web dynamic character, errors, content complexity, are revealed. Successful experience in the world librariesβ web-archiving is discussed (selection, search, description technologies, access terms, etc.). The study findings demonstrate that web-archives are selected to supplement the librariesβ digital collections on hot topics, like COVID-19, or to meet the demands of specific user groups. For the purpose of cultural heritage preservation, the national libraries often focus of acquiring web-sites by the domains in the corresponding country. The university libraries focus on acquiring web-archives that meet research and educational demands of their users; and public libraries prefer the resources of interest to their local community. The findings may be used by world libraries for developing their digital collections.ΠΠ±ΠΎΡΠ½ΠΎΠ²Π°Π½Π° Π½Π΅ΠΎΠ±Ρ
ΠΎΠ΄ΠΈΠΌΠΎΡΡΡ ΠΈΠ·ΡΡΠ΅Π½ΠΈΡ ΠΈ ΡΠ°ΡΠΏΡΠΎΡΡΡΠ°Π½Π΅Π½ΠΈΡ ΠΏΡΠ°ΠΊΡΠΈΠΊΠΈ Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊ ΠΌΠΈΡΠ° Π² ΠΎΠ±Π»Π°ΡΡΠΈ Π²Π΅Π±-Π°ΡΡ
ΠΈΠ²ΠΈΡΠΎΠ²Π°Π½ΠΈΡ Π² ΡΠ΅Π»ΡΡ
Π΄ΠΎΠ»Π³ΠΎΡΡΠΎΡΠ½ΠΎΠ³ΠΎ ΡΠΎΡ
ΡΠ°Π½Π΅Π½ΠΈΡ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΠΈ ΠΈ ΠΎΠ±Π΅ΡΠΏΠ΅ΡΠ΅Π½ΠΈΡ Π΅Ρ Π΄ΠΎΡΡΡΠΏΠ½ΠΎΡΡΠΈ Π² Π±ΡΠ΄ΡΡΠ΅ΠΌ. ΠΡΡΠ²Π»Π΅Π½Ρ ΡΠΎΠ²ΡΠ΅ΠΌΠ΅Π½Π½ΡΠ΅ ΡΠ΅Ρ
Π½ΠΎΠ»ΠΎΠ³ΠΈΠΈ, ΠΈΡΠΏΠΎΠ»ΡΠ·ΡΠ΅ΠΌΡΠ΅ Π² Π²Π΅Π±-Π°ΡΡ
ΠΈΠ²ΠΈΡΠΎΠ²Π°Π½ΠΈΠΈ, Π° ΡΠ°ΠΊΠΆΠ΅ ΠΏΡΠΎΠ±Π»Π΅ΠΌΡ, ΡΠ²ΡΠ·Π°Π½Π½ΡΠ΅ Ρ Π΄ΠΈΠ½Π°ΠΌΠΈΡΠ½ΠΎΠΉ ΠΏΡΠΈΡΠΎΠ΄ΠΎΠΉ ΡΠ°ΠΉΡΠΎΠ², ΠΎΡΠΈΠ±ΠΊΠ°ΠΌΠΈ, ΡΠ»ΠΎΠΆΠ½ΠΎΡΡΡΡ ΠΊΠΎΠ½ΡΠ΅Π½ΡΠ° Π΄Π»Ρ ΡΠΎΡ
ΡΠ°Π½Π΅Π½ΠΈΡ ΠΈ Π΄Ρ. Π Π°ΡΡΠΌΠΎΡΡΠ΅Π½Ρ ΠΏΡΠΈΠΌΠ΅ΡΡ Π°ΠΊΡΠΈΠ²Π½ΠΎ ΡΠ°Π·Π²ΠΈΠ²Π°ΡΡΠΈΡ
ΡΡ Π²Π΅Π±-Π°ΡΡ
ΠΈΠ²ΠΎΠ² Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊ ΠΌΠΈΡΠ° (ΡΠ΅Ρ
Π½ΠΎΠ»ΠΎΠ³ΠΈΠΈ ΠΎΡΠ±ΠΎΡΠ°, ΠΏΠΎΠΈΡΠΊΠ°, ΠΎΠΏΠΈΡΠ°Π½ΠΈΡ, ΡΡΠ»ΠΎΠ²ΠΈΡ Π΄ΠΎΡΡΡΠΏΠ° ΠΈ Π΄Ρ.). Π Π΅Π·ΡΠ»ΡΡΠ°ΡΡ ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΡ ΠΏΠΎΠΊΠ°Π·ΡΠ²Π°ΡΡ, ΡΡΠΎ Π²Π΅Π±-Π°ΡΡ
ΠΈΠ²Ρ ΠΎΡΠ±ΠΈΡΠ°ΡΡΡΡ Π΄Π»Ρ Π΄ΠΎΠΏΠΎΠ»Π½Π΅Π½ΠΈΡ ΡΡΡΠ΅ΡΡΠ²ΡΡΡΠΈΡ
ΡΠΈΡΡΠΎΠ²ΡΡ
ΠΊΠΎΠ»Π»Π΅ΠΊΡΠΈΠΉ Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊ ΠΏΠΎ Π°ΠΊΡΡΠ°Π»ΡΠ½ΡΠΌ ΠΏΡΠΎΠ±Π»Π΅ΠΌΠ°ΠΌ, Π½Π°ΠΏΡΠΈΠΌΠ΅Ρ, ΠΏΠΎ ΠΊΠΎΡΠΎΠ½Π°Π²ΠΈΡΡΡΠ½ΠΎΠΉ Π±ΠΎΠ»Π΅Π·Π½ΠΈ (COVID-19), Π»ΠΈΠ±ΠΎ Π² ΠΈΠ½ΡΠ΅ΡΠ΅ΡΠ°Ρ
ΠΎΠΏΡΠ΅Π΄Π΅Π»ΡΠ½Π½ΠΎΠΉ Π³ΡΡΠΏΠΏΡ ΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°ΡΠ΅Π»Π΅ΠΉ. Π ΡΠ΅Π»ΡΡ
ΡΠΎΡ
ΡΠ°Π½Π΅Π½ΠΈΡ ΠΊΡΠ»ΡΡΡΡΠ½ΠΎΠ³ΠΎ Π½Π°ΡΠ»Π΅Π΄ΠΈΡ Π½Π°ΡΠΈΠΎΠ½Π°Π»ΡΠ½ΡΠ΅ Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊΠΈ ΡΠ°ΡΠ΅ ΡΠΎΡΡΠ΅Π΄ΠΎΡΠΎΡΠ΅Π½Ρ Π½Π° ΡΠ±ΠΎΡΠ΅ ΡΠ°ΠΉΡΠΎΠ² ΠΏΠΎ Π΄ΠΎΠΌΠ΅Π½Π°ΠΌ, ΠΎΡΡΠ°ΠΆΠ°ΡΡΠΈΡ
ΠΏΡΠΈΠ½Π°Π΄Π»Π΅ΠΆΠ½ΠΎΡΡΡ ΠΊ ΡΠΎΠΌΡ ΠΈΠ»ΠΈ ΠΈΠ½ΠΎΠΌΡ Π³ΠΎΡΡΠ΄Π°ΡΡΡΠ²Ρ. Π£Π½ΠΈΠ²Π΅ΡΡΠΈΡΠ΅ΡΡΠΊΠΈΠ΅ Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊΠΈ ΠΊΠΎΠ½ΡΠ΅Π½ΡΡΠΈΡΡΡΡΡΡ Π½Π° ΡΠ±ΠΎΡΠ΅ Π²Π΅Π±-Π°ΡΡ
ΠΈΠ²ΠΎΠ², ΠΊΠΎΡΠΎΡΡΠ΅ ΡΠ»ΡΠΆΠ°Ρ ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°ΡΠ΅Π»ΡΡΠΊΠΈΠΌ ΠΈΠ»ΠΈ ΠΎΠ±ΡΠ°Π·ΠΎΠ²Π°ΡΠ΅Π»ΡΠ½ΡΠΌ ΠΏΠΎΡΡΠ΅Π±Π½ΠΎΡΡΡΠΌ ΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°ΡΠ΅Π»Π΅ΠΉ ΠΊΠΎΠ½ΠΊΡΠ΅ΡΠ½ΠΎΠ³ΠΎ ΡΡΡΠ΅ΠΆΠ΄Π΅Π½ΠΈΡ, Π° ΠΏΡΠ±Π»ΠΈΡΠ½ΡΠ΅ Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊΠΈ β Π½Π° ΡΠ΅ΡΡΡΡΠ°Ρ
, ΠΎΡΡΠ°ΠΆΠ°ΡΡΠΈΡ
ΠΆΠΈΠ·Π½Ρ ΠΌΠ΅ΡΡΠ½ΠΎΠ³ΠΎ ΡΠΎΠΎΠ±ΡΠ΅ΡΡΠ²Π°. Π Π°ΡΡΠΌΠΎΡΡΠ΅Π½Π½ΡΠΉ ΠΎΠΏΡΡ ΠΌΠΎΠΆΠ΅Ρ Π±ΡΡΡ ΡΠ°ΡΠΏΡΠΎΡΡΡΠ°Π½ΡΠ½ ΡΡΠ΅Π΄ΠΈ Π΄ΡΡΠ³ΠΈΡ
Π±ΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊ ΠΌΠΈΡΠ° Π΄Π»Ρ ΡΠ°Π·Π²ΠΈΡΠΈΡ ΡΠΈΡΡΠΎΠ²ΡΡ
ΠΊΠΎΠ»Π»Π΅ΠΊΡΠΈΠΉ
Recommended from our members
Using Web Archives to Model Academic Migration and Identify Brain Drain
Presentation for the IIPC General Assembly and Web Archiving Conference held on May 10-12, 2023 in Hilversum, Netherlands. This presentation discusses academic migration at Historically Black Colleges & Universities through analysis of web archives and introduces challenges faced throughout the project
Avoiding Zombies in Archival Replay Using ServiceWorker
[First paragraph] A Composite Memento is an archived representation of a web page with all the page requisites such as images and stylesheets. All embedded resources have their own URIs, hence, they are archived independently. For a meaningful archival replay, it is important to load all the page requisites from the archive within the temporal neighborhood of the base HTML page. To achieve this goal, archival replay systems try to rewrite all the resource references to appropriate archived versions before serving HTML, CSS, or JS. However, an effective server-side URL rewriting is difficult when URLs are generated dynamically using JavaScript. A failure of correct URL rewriting might yield an invalid/unintended URI or resolve to a live resource. Such live resources, leaking into a composite memento, are called zombies
Client-Assisted Memento Aggregation Using The Prefer Header
[First paragraph] Preservation of the Web ensures that future generations have a picture of how the web was. Web archives like Internet Archive\u27s Wayback Machine, WebCite, and archive.is allow individuals to submit URIs to be archived, but the captures they preserve then reside at the archives. Traversing these captures in time as preserved by multiple archive sources (using Memento [8]) provides a more comprehensive picture of the past Web than relying on a single archive. Some content on the Web, such as content behind authentication, may be unsuitable or inaccessible for preservation by these organizations. Furthermore, this content may be inappropriate for the organizations to preserve due to reasons of privacy or exposure of personally identifiable information [4]. However, preserving this content would ensure an even-more comprehensive picture of the web and may be useful for future historians who wish to analyze content beyond the capability or suitability of archives created to preserve the public Web
MementoEmbed and Raintale for Web Archive Storytelling
For traditional library collections, archivists can select a representative sample from a collection and display it in a featured physical or digital library space. Web archive collections may consist of thousands of archived pages, or mementos. How should an archivist display this sample to drive visitors to their collection? Search engines and social media platforms often represent web pages as cards consisting of text snippets, titles, and images. Web storytelling is a popular method for grouping these cards in order to summarize a topic. Unfortunately, social media platforms are not archive-aware and fail to consistently create a good experience for mementos. They also allow no UI alterations for their cards. Thus, we created MementoEmbed to generate cards for individual mementos and Raintale for creating entire stories that archivists can export to a variety of formats
It is Hard to Compute Fixity on Archived Web Pages
[Introduction] Checking fixity in web archives is performed to ensure archived resources, or mementos (denoted by URI-M) have remained unaltered since when they were captured. The final report of the PREMIS Working Group [2] defines information used for fixity as information used to verify whether an object has been altered in an undocumented or unauthorized way. The common technique for checking fixity is to generate a current hash value (i.e., a message digest or a checksum) for a file using a cryptographic hash function (e.g., SHA-256) and compare it to the hash value generated originally. If they have different hash values, then the file has been changed, either maliciously or not. We implicitly trust content delivered by web archives, but with the current trend of extended use of other public and private web archives, we should consider the question of validity of archived web pages. Most web archives do not allow users to retrieve fixity information. More importantly, even if fixity information is accessible, it is provided by the same archive delivering the content. A part of our research is dedicated to establishing and checking the fixity of archived resources with the following requirements: Any user can generate fixity information, not only the archive Fixity information can be generated on the mementos playbac
SHARI- An Integration of Tools to Visualize the Story of the Day
Tools such as google news and flipboard exist to convey daily news, but what about the news of the past? In this paper, we describe how to combine several existing tools and web archive holdings to convey the βbiggest storyβ for a given date in the past. StoryGraph clusters news articles together to identify a common news story. Hypercane leverages ArchiveNow to store URLs produced by Story-Graph in web archives. Hypercane analyzes these URLs to identify the most common terms, entities, and highest quality images for social media storytelling. Raintale then takes the output of these tools to produce a visualization of the news story for a given day. We name this process SHARI (StoryGraph Hypercane ArchiveNow Raintale Integration). With SHARI, a user can visualize the articles belonging to a past dateβs news story