1 research outputs found
Using Binary File Format Description Languages for Documenting, Parsing, and Verifying Raw Data in TAIGA Experiment
The paper is devoted to the issues of raw binary data documenting, parsing
and verifying in astroparticle data lifecycle. The long-term preservation of
raw data of astroparticle experiments as originally generated is essential for
re-running analyses and reproducing research results. The selected high-quality
raw data should have detailed documentation and accompanied by open software
tools for access to them. We consider applicability of binary file format
description languages to specify, parse and verify raw data of the Tunka
Advanced Instrument for cosmic rays and Gamma Astronomy (TAIGA) experiment. The
formal specifications are implemented for five data formats of the experiment
and provide automatic generation of source code for data reading libraries in
target programming languages (e.g. C++, Java, and Python). These libraries were
tested on TAIGA data. They showed a good performance and help us to locate the
parts with corrupted data. The format specifications can be used as metadata
for exchanging of astroparticle raw data. They can also simplify software
development for data aggregation from various sources for the multi-messenger
analysis