30 research outputs found
Virtual unrolling and information recovery from scanned scrolled historical documents
The objective of our work is to enable the reading of fragile scrolled historical parchments without the need to physically unravel them, thus providing valuable information to a wide range of scholarly disciplines. This problem has not been investigated by the computer vision community properly yet due to the need for parchment scanning technology: standard X-ray equipment is not sufficient as there is a requirement to extract out parchment ink in addition to the parchment's underlying structure. Effective data recovery is also compromised as content from historical scrolled documents is inaccessible due to the deterioration of the parchment. We create a 3D volumetric model of a scrolled parchment's underlying geometry and perform digital unwrapping of the parchment, producing a readable image of the text as an output. The proposed recovery framework consists of structure preserving anisotropic filtering in combination with robust segmentation, surface modelling and ink projection. We demonstrate with real examples how our algorithm is able to recover the underlying text and to solve the major challenge for scrolled parchment analysis, namely segmentation of connected layers and processing the data without user interaction
Robust virtual unrolling of historical parchment XMT images
We develop a framework to virtually unroll fragile historical parchment scrolls, which cannot be physically unfolded via a sequence of X-ray tomographic slices, thus providing easy access to those parchments whose contents have remained hidden for centuries. The first step is to produce a topologically correct segmentation, which is challenging as the parchment layers vary significantly in thickness, contain substantial interior textures and can often stick together in places. For this purpose, our method starts with linking the broken layers in a slice using the topological structure propagated from its previous processed slice. To ensure topological correctness, we identify fused regions by detecting junction sections, and then match them using global optimization efficiently solved by the blossom algorithm, taking into account the shape energy of curves separating fused layers. The fused layers are then separated using as-parallel-as-possible curves connecting junction section pairs. To flatten the segmented parchment, pixels in different frames need to be put into alignment. This is achieved via a dynamic programming-based global optimization, which minimizes the total matching distances and penalizes stretches. Eventually, the text of the parchment is revealed by ink projection. We demonstrate the effectiveness of our approach using challenging real-world data sets, including the water damaged fifteenth century Bressingham scroll
Virtual unrolling and deciphering of Herculaneum papyri by X-ray phase-contrast tomography
A collection of more than 1800 carbonized papyri, discovered in the Roman 'Villa dei Papiri' at Herculaneum is the unique classical library survived from antiquity. These papyri were charred during 79 A.D. Vesuvius eruption, a circumstance which providentially preserved them until now. This magnificent collection contains an impressive amount of treatises by Greek philosophers and, especially, Philodemus of Gadara, an Epicurean thinker of 1st century BC. We read many portions of text hidden inside carbonized Herculaneum papyri using enhanced X-ray phase-contrast tomography non-destructive technique and a new set of numerical algorithms for 'virtual-unrolling'. Our success lies in revealing the largest portion of Greek text ever detected so far inside unopened scrolls, with unprecedented spatial resolution and contrast, all without damaging these precious historical manuscripts. Parts of text have been decoded and the 'voice' of the Epicurean philosopher Philodemus is brought back again after 2000 years from Herculaneum papyri
From Damage to Discovery Via Virtual Unwrapping: Reading the Scroll from En-Gedi
Computer imaging techniques are commonly used to preserve and share readable manuscripts, but capturing writing locked away in ancient, deteriorated documents poses an entirely different challenge. This software pipeline—referred to as “virtual unwrapping”—allows textual artifacts to be read completely and noninvasively. The systematic digital analysis of the extremely fragile En-Gedi scroll (the oldest Pentateuchal scroll in Hebrew outside of the Dead Sea Scrolls) reveals the writing hidden on its untouchable, disintegrating sheets. Our approach for recovering substantial ink-based text from a damaged object results in readable columns at such high quality that serious critical textual analysis can occur. Hence, this work creates a new pathway for subsequent textual discoveries buried within the confines of damaged materials
Robust segmentation of historical parchment XMT images for virtual unrolling
Historical parchment scrolls are fragile, and prone
to damage from a variety of causes such as fire, water, and
general mistreatment. Consequently many of these scrolls cannot
be unrolled, so that their contents have remained hidden for
centuries. To overcome these difficulties, we have developed a
method of segmenting X-ray tomographic scans of parchment
which enables a “virtual unrolling” of these documents. After an
initial segmentation we link the broken layers of the parchment.
Then, junction sections are extracted from the boundaries of
the parchment. Subsequently, we find the fused regions which
are formed by layers stuck together, and separate them into
several layers by reconstructing the missing boundaries using
parallel connecting curves. Experiments on the fifteenth century
Bressingham scroll validate the effectiveness of our segmentation
method
Virtual Recovery of Content from X-Ray Micro-Tomography Scans of Damaged Historic Scrolls
Part of this work was carried out with funding from the EPSRC (project EP/G010110/1, High defnition X-ray
microtomography and advanced visualisation techniques for information recovery from unopenable historical
documents), the China Postdoctoral Innovation Program (No. 230210342) and the China Scholarship Council
(File No. 201406020068