Search CORE

13 research outputs found

Optimizing the neural network training for OCR error correction of historical Hebrew texts

Author: Elmalech Avshalom
Suissa Omri
Zhitomirsky-Geffet Maayan
Publication venue: 'iSchools'
Publication date: 23/03/2020
Field of study

Over the past few decades, large archives of paper-based documents such as books and newspapers have been digitized using Optical Character Recognition. This technology is error-prone, especially for historical documents. To correct OCR errors, post-processing algorithms have been proposed based on natural language analysis and machine learning techniques such as neural networks. Neural network's disadvantage is the vast amount of manually labeled data required for training, which is often unavailable. This paper proposes an innovative method for training a light-weight neural network for Hebrew OCR post-correction using significantly less manually created data. The main research goal is to develop a method for automatically generating language and task-specific training data to improve the neural network results for OCR post-correction, and to investigate which type of dataset is the most effective for OCR post-correction of historical documents. To this end, a series of experiments using several datasets was conducted. The evaluation corpus was based on Hebrew newspapers from the JPress project. An analysis of historical OCRed newspapers was done to learn common language and corpus-specific OCR errors. We found that training the network using the proposed method is more effective than using randomly generated errors. The results also show that the performance of the neural net-work for OCR post-correction strongly depends on the genre and area of the training data. Moreover, neural networks that were trained with the proposed method outperform other state-of-the-art neural networks for OCR post-correction and complex spellcheckers. These results may have practical implications for many digital humanities projects

Illinois Digital Environment for Access to Learning and Scholarship Repository

Recommended from our members

Problem restructuring for better decision making in recurring decision situations

Author: Elmalech Avshalom
Grosz Barbara J.
Sarne David
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/07/2014
Field of study

This paper proposes the use of restructuring information about choices to improve the performance of computer agents on recurring sequentially dependent decisions. The intended situations of use for the restructuring methods it defines are website platforms such as electronic marketplaces in which agents typically engage in sequentially dependent decisions. With the proposed methods, such platforms can improve agents’ experience, thus attracting more customers to their sites. In sequentially-dependent-decisions settings, decisions made at one time may affect decisions made later; hence, the best choice at any point depends not only on the options at that point, but also on future conditions and the decisions made in them. This “problem restructuring” approach was tested on sequential economic search, which is a common type of recurring sequentially dependent decision-making problem that arises in a broad range of areas. The paper introduces four heuristics for restructuring the choices that are available to decision makers in economic search applications. Three of these heuristics are based on characteristics of the choices, not of the decision maker. The fourth heuristic requires information about a decision-makers prior decision-making, which it uses to classify the decision-maker. The classification type is used to choose the best of the three other heuristics. The heuristics were extensively tested on a large number of agents designed by different people with skills similar to those of a typical agent developer. The results demonstrate that the problem-restructuring approach is a promising one for improving the performance of agents on sequentially dependent decisions. Although there was a minor degradation in performance for a small portion of the agents, the overall and average individual performance improved substantially. Complementary experimentation with people demonstrated that the methods carry over, to some extent, also to human decision makers. Interestingly, the heuristic that adapts based on a decision-maker’s history achieved the best results for computer agents, but not for people.Engineering and Applied Science

Harvard University - DASH

Toward the Optimized Crowdsourcing Strategy for OCR Post-Correction

Author: Elmalech Avshalom
Suissa Omri
Zhitomirsky-Geffet Maayan
Publication venue: 'Emerald'
Publication date: 12/06/2021
Field of study

Digitization of historical documents is a challenging task in many digital humanities projects. A popular approach for digitization is to scan the documents into images, and then convert images into text using Optical Character Recognition (OCR) algorithms. However, the outcome of OCR processing of historical documents is usually inaccurate and requires post-processing error correction. This study investigates how crowdsourcing can be utilized to correct OCR errors in historical text collections, and which crowdsourcing methodology is the most effective in different scenarios and for various research objectives. A series of experiments with different micro-task's structures and text lengths was conducted with 753 workers on the Amazon's Mechanical Turk platform. The workers had to fix OCR errors in a selected historical text. To analyze the results, new accuracy and efficiency measures have been devised. The analysis suggests that in terms of accuracy, the optimal text length is medium (paragraph-size) and the optimal structure of the experiment is two-phase with a scanned image. In terms of efficiency, the best results were obtained when using longer text in the single-stage structure with no image. The study provides practical recommendations to researchers on how to build the optimal crowdsourcing task for OCR post-correction. The developed methodology can also be utilized to create golden standard historical texts for automatic OCR post-correction. This is the first attempt to systematically investigate the influence of various factors on crowdsourcing-based OCR post-correction and propose an optimal strategy for this process.Comment: 25 pages, 12 figures, 1 tabl

arXiv.org e-Print Archive

Recommended from our members

"But you Promised": Methods to Improve Crowd Engagement In Non-Ground Truth Tasks

Author: Elmalech Avshalom
Grosz Barbara J.
Publication venue
Publication date: 21/09/2017
Field of study

Crowdsourcing platforms were initially designed to recruit people to perform tasks that were simple cognitively but difficult for computers. One challenge in these settings is to identify an incentive mechanism for motivating workers to complete tasks and do high-quality work. Previous research has studied the use of financial incentive mechanisms and social comparison as motivators. These mechanisms can only be applied to ground truth tasks, tasks for which there is an objective performance scale. In this paper, we define and compare three innovative methods for improving worker engagement on non-ground truth tasks drawing on a psychological theory of commitment. The three methods are similar in asking participants to promise they will complete a task, but they differ in terms of how the commitment is made. In the first method, participants commit by signing a contract; in the second, by listening to a recording; in the third, by recording a personal commitment. The last two methods significantly improved the task completion rate when compared to two baseline conditions. The methods we propose can be implemented simply, can be used for any task, and do not affect participants' behavior other than by improving their engagement.Engineering and Applied Science

Harvard University - DASH

Association for the Advancement of Artificial Intelligence: AAAI Publications

Recommended from our members

Less is more: Restructuring decisions to improve agent search

Author: Elmalech Avshalom
Geva Moti
Grosz Barbara J.
Sarne David
Publication venue: International Foundation for Autonomous Agents and Multiagent Systems
Publication date: 06/05/2016
Field of study

In many settings and for various reasons, people fail to make optimal decisions. These factors also influence the agents people design to act on their behalf in such virtual environments as eCommerce and distributed operating systems, so that the agents also act sub-optimally despite their greater computational capabilities. In some decision-making situations it is theoretically possible to supply the optimal strategy to people or their agents, but this optimal strategy may be non-intuitive, and providing a convincing explanation of optimality may be complex. This paper explores an alternative approach to improving the performance of a decision-maker in such settings: the data on choices is manipulated to guide searchers to a strategy that is closer to optimal. This approach was tested for sequential search, which is a classical sequential decision-making problem with broad areas of applicability (e.g., product search, partnership search). The paper introduces three heuristics for manipulating choices, including one for settings in which repeated interaction or access to a decision-maker's past history is available. The heuristics were evaluated on a large population of computer agents, each of which embodies a search strategy programmed by a different person. Extensive tests on thousands of search settings demonstrate the promise of the problem-restructuring approach: despite a minor degradation in performance for a small portion of the population, the overall and average individual performance improve substantially. The heuristic that adapts based on a decision-maker's history achieved the best results.Engineering and Applied Science

Harvard University - DASH

Problem restructuring for better decision making in recurring decision situations

Author: A Burgess
A Grosfeld-Nir
A Klos
A Rapoport
A Rosenfeld
A Schotter
A Tversky
AN Thorndike
Avshalom Elmalech
Barbara J. Grosz
C Harries
C Starmer
D Kahneman
D Kahneman
D Kahneman
D Kleinmuntz
D Sarne
D Sarne
D Schunk
DA Redelmeier
David Sarne
DH Wedell
EH Simpson
G Barron
G Dushnitsky
G Keren
G Keren
G Paolacci
G Shackle
GC Blackhart
GI Kempen
H Liu
H Montgomery
HA Simon
HJ Einhorn
J Quiggin
J Sonnemans
JD Hey
JD Hey
JH Roberts
JH Roberts
JO Kephart
K Natarajan
L Hogg
LR Klein
M Bertrand
M Brown
M Chalamish
M Rabin
MD Lee
ML Dekay
ML Weitzman
P Bharati
P Moon
RA Drake
RH Thaler
RH Thaler
S Kraus
S Kraus
S Shamoun
T Bench-Capon
T Dudey
T Ferguson
T Hills
T Langer
T Tanaka
VI Danilov
WP Dommermuth
Y Gal
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Trust and attitude toward information presented using augmented reality and other technological means

Author: Avshalom Elmalech
Gilad Taub
Noa Aharony
Publication venue: Elsevier
Publication date: 01/02/2024
Field of study

In recent years, augmented reality (AR) technology has grown, and its use has become widespread among smartphone users. People are consuming more and more digital information from various sources and in different presentation modes. Therefore, in this study, we investigate the extent to which different presentation modes relate to the level of trust in information, while considering demographic variables, as well as personality traits and thinking styles. The participants in our experiments were asked to indicate whether certain statements that were presented in various presentation methods (image + text, image + audio, AR + text, AR + audio) were true or false. The results indicate that users are more likely to trust statements that are accompanied by AR than statements that are accompanied by a static image. In addition, younger participants have greater trust in audio-presented information than text-presented information. As AR is expected to grow considerably in popularity in the next few years, users should be cautious of the potential impact on their trust in digital information while using AR

Directory of Open Access Journals

When Suboptimal Rules

Author: Elmalech Avshalom
Erez Eden
Rosenfeld Avi
Sarne David
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 16/02/2015
Field of study

This paper represents a paradigm shift in what advice agents should provide people. Contrary to what was previously thought, we empirically show that agents that dispense optimal advice will not necessary facilitate the best improvement in people's strategies. Instead, we claim that agents should at times suboptimally advise. We provide results demonstrating the effectiveness of a suboptimal advising approach in extensive experiments in two canonical mixed agent-human advice-giving domains. Our proposed guideline for suboptimal advising is to rely on the level of intuitiveness of the optimal advice as a measure for how much the suboptimal advice presented to the user should drift from the optimal value

CiteSeerX

Association for the Advancement of Artificial Intelligence: AAAI Publications

Enhancing Crowdworkers' Vigilance

Author: David Esther
Elmalech Avshalom
Hajaj Chen
Sarne David
Publication venue: 'International Joint Conferences on Artificial Intelligence'
Publication date: 06/02/2018
Field of study

This paper presents methods for improving the attention span of workers in tasks that heavily rely on their attention to the occurrence of rare events. The underlying idea in our approach is to dynamically augment the task with some dummy (artificial) events at different times throughout the task, rewarding the worker upon identifying and reporting them. The proposed approach is an alternative to the traditional approach of exclusively relying on rewarding the worker for successfully identifying the event of interest itself. We propose three methods for timing the dummy events throughout the task. Two of these methods are static and determine the timing of the dummy events at random or uniformly throughout the task. The third method is dynamic and uses the identification (or misidentification) of dummy events as a signal for the worker's attention to the task, adjusting the rate of dummy events generation accordingly.Engineering and Applied Science

Crossref

Harvard University - DASH

Monetary Compensation and Private Information Sharing in Augmented Reality Applications

Author: Ariel Rosenfeld
Avshalom Elmalech
Gilad Taub
Noa Aharony
Publication venue: 'MDPI AG'
Publication date: 01/06/2023
Field of study

This research studied people’s responses to requests that ask for accessing their personal information when using augmented reality (AR) technology. AR is a new technology that superimposes digital information onto the real world, creating a unique user experience. As such, AR is often associated with the collection and use of personal information, which may lead to significant privacy concerns. To investigate these potential concerns, we adopted an experimental approach and examined people’s actual responses to real-world requests for various types of personal information while using a designated AR application on their personal smartphones. Our results indicate that the majority (57%) of people are willing to share sensitive personal information with an unknown third party without any compensation other than using the application. Moreover, there is variability in the individuals’ willingness to allow access to various kinds of personal information. For example, while 75% of participants were open to granting access to their microphone, only 35% of participants agreed to allow access to their contacts. Lastly, monetary compensation is linked with an increased willingness to share personal information. When no compensation was offered, only 35% of the participants agreed to grant access to their contacts, but when a low compensation was offered, 57.5% of the participants agreed. These findings combine to suggest several practical implications for the development and distribution of AR technologies

Directory of Open Access Journals