52 research outputs found

    OCR Quality Affects Perceived Usefulness of Historical Newspaper Clippings. A User Study

    Get PDF
    Publisher Copyright: © 2022 Copyright for this paper by its authors.Effects of Optical Character Recognition (OCR) quality on historical information retrieval have so far been studied in data-oriented scenarios regarding the effectiveness of retrieval results. Such studies have either focused on the effects of artificially degraded OCR quality (see, e.g., [1-2]) or utilized test collections containing texts based on authentic low quality OCR data (see, e.g., [3]). In this paper the effects of OCR quality are studied in a user-oriented information retrieval setting. Thirty-two users evaluated subjectively query results of six topics each (out of 30 topics) based on pre-formulated queries using a simulated work task setting. To the best of our knowledge our simulated work task experiment is the first one showing empirically that users' subjective relevance assessments of retrieved documents are affected by a change in the quality of optically read text. Users of historical newspaper collections have so far commented effects of OCR'ed data quality mainly in impressionistic ways, and controlled user environments for studying effects of OCR quality on users' relevance assessments of the retrieval results have so far been missing. To remedy this The National Library of Finland (NLF) set up an experimental query environment for the contents of one Finnish historical newspaper, Uusi Suometar 1869-1918, to be able to compare users' evaluation of search results of two different OCR qualities for digitized newspaper articles. The query interface was able to present the same underlying document for the user based on two alternatives: either based on the lower OCR quality, or based on the higher OCR quality, and the choice was randomized. The users did not know about quality differences in the article texts they evaluated. The main result of the study is that improved optical character recognition quality affects perceived usefulness of historical newspaper articles significantly. The mean average evaluation score for the improved OCR results was 7.94% higher than the mean average evaluation score of the old OCR results.Peer reviewe

    Task information types related to data gathering in media studies

    Get PDF
    Purpose The purpose of this paper is to examine what types of task information media scholars need while gathering research data to create new knowledge. Design/methodology/approach The research design is qualitative and user-oriented. A total of 25 media scholars were interviewed about their research processes and interactions with their research data. The interviews were semi-structured, complemented by critical incident interviews. The analysis focused on the activity of gathering research data. A typology of information (task, domain and task-solving information) guided the analysis of information types related to data gathering, with further analysis focusing only on task information types. Findings Media scholars needed the following task information types while gathering research data to create new knowledge: (1) information about research data (aboutness of data, characteristics of data, metadata and secondary information about data), (2) information about sources of research data (characteristics of sources, local media landscapes) and (3) information about cases and their contexts (case information, contextual information). All the task information types should be considered when building data services and tools to support media scholars' work. Originality/value The paper increases understanding of the concept of task information in the context of gathering research data to create new knowledge and thereby informs the providers of research data services about the task information types that researchers need.publishedVersionPeer reviewe

    Analyzing gender clues in war-time letters

    Get PDF
    Many historians struggle with their information needs which cannot be directly served by the information access systems. Satisfying these needs often requires reasoning and interpretation of pieces of information in context, from user-specific viewpoints. One common need in studying historical phenomena is what indicates gender in historical text. We call such textual indicators ‘gender clues’ because they help satisfy information needs regarding the concept of gender. In this article, we analyze gender clues qualitatively and present a typology of them based on a set of private letters from the Second World War in Finland. We also discuss the general need to create metadata to support the historian’s explorations from specific viewpoints, especially in small and noisy collections that are common in the historical domain.publishedVersionPeer reviewe

    Targeted Query Expansions as a Method for Searching Mixed Quality Digitized Cultural Heritage Documents

    Get PDF
    Digitization of cultural heritage is a huge ongoing effort in many countries. In digitized historical documents, words may occur in different surface forms due to three types of variation - morphological variation, historical variation, and errors in optical character recognition (OCR). Because individual documents may differ significantly from each other regarding the level of such variations, digitized collections may contain documents of mixed quality. Such different types of documents may require different types of retrieval methods. We suggest using targeted query expansions (QE) to access documents in mixed-quality text collections. In QE the user-given search term is replaced by a set of expansion keys (search words); in targeted QE the selection of expansion terms is based on the type of surface level variation occurring in the particular text searched. We illustrate our approach in a highly inflectional compounding language, Finnish while the variation occur across all natural languages. We report a minimal-scale experiment based on the QE method and discuss the need to support targeted QEs in the search interface.ye

    Luonnon uraanisarjan isotooppisuhteiden massaspektrometrinen määritys

    No full text

    Mineral and colloid surface phenomena in the final disposal of radioactive

    No full text

    Hapetus-pelkistysilmiöt ja niiden merkitys käytetyn ydinpolttoaineen loppusijoitustilassa

    No full text

    Actinide speciation by laser spectroscopic methods

    No full text

    Kivinäytteiden diffusiviteetti ja sähkönjohtavuusmittaukset

    No full text
    corecore