Search CORE

39 research outputs found

Measuring and improving data quality of media collections for professional tasks

Author: Traub M.C. (Myriam)
Publication venue: ACM New York, NY, USA
Publication date: 01/08/2014
Field of study

Carrying out research tasks on data collections is hampered, or even made impossible, by data quality issues of different types, such as incompleteness or inconsistency, and severity. We identify research tasks carried out by professional users of data collections that are hampered by inherent quality issues. We investigate what types of issues exist and how they influence these research tasks. To measure the quality perceived by professional users, we develop a quality metric. This allows us to measure the suitability of the data quality for a chosen user ta

CWI's Institutional Repository

Measuring Tool Bias and Improving Data Quality for Digital Humanities Research

Author: Traub M.C. (Myriam)
Publication venue
Publication date: 11/05/2020
Field of study

CWI's Institutional Repository

Impact Analysis of OCR Quality on Research Tasks in Digital Archives

Author: Hardman L. (Lynda)
Ossenbruggen J.R. (Jacco) van
Traub M.C. (Myriam)
Publication venue
Publication date: 01/09/2015
Field of study

Humanities scholars increasingly rely on digital archives for their research in place of time-consuming visits to physical archives. This shift in research methodology has the hidden cost of working with digi- tally processed historical documents: how much trust can a scholar place in noisy representations of source texts? In a series of interviews with historians about their use of digital archives, we found that scholars are aware that optical character recognition (OCR) errors may bias their results. They were, however, unable to quantify this bias or to indicate what information they would need to estimate it. Based on the interviews and a literature study, we provide a classification scheme relating schol- arly research tasks to their specific OCR-induced uncertainty and the data required for more reliable uncertainty estimations. We conducted a use case study on a national newspaper archive with example research tasks. From this we learned what data is typically available in digital archives and how it could be used to reduce and/or assess the uncer- tainty in result sets. We conclude that the current knowledge situation on the users’ side as well as on the tool makers and data providers’ side is insufficient and needs further research to be improved

CWI's Institutional Repository

Second Screen Interactions for Automatically Web-Enriched Broadcast Video

Author: Hardman L. (Lynda)
Leyssen M.H.R. (Mieke)
Pérez Romero L. (Lilia)
Traub M.C. (Myriam)
Publication venue
Publication date: 01/01/2013
Field of study

Including hypermedia in broadcast video combines content formatted for a lean-forward medium (the Web) with a lean-back one (TV) to form a hybrid medium. We identify four challenges for interacting with and experiencing this new medium. We discuss the role a second screen may play in addressing these challenges

CWI's Institutional Repository

Is it a bird or is it a crow? The influence of presented tags on image tagging by non-expert users

Author: Hardman L. (Lynda)
Leyssen M.H.R. (Mieke)
Ossenbruggen J.R. (Jacco) van
Traub M.C. (Myriam)
Publication venue: CWI
Publication date: 01/12/2012
Field of study

Cultural heritage institutes often make use of tags to facilitate searching their collections. While professionals associated with these institutes are able to add high quality descriptions to objects in the collections, both their time and their areas of expertise are limited. As a result, online tagging by non-professional users is more frequently becoming deployed to increase the number of tags. When these users are asked to tag objects in the collection, they can be confronted with tags submitted by other users. These tags may be of varying quality and present in differing numbers, both of which may influence users' tagging behavior. We report on a study on the impact of presenting different types of tags on the quality and quantity of tags added by users. We conclude that there is no difference in the quality and quantity of added tags in all experimental conditions, with the exception of the condition in which incorrect tags were presented. In this condition, the quality of the tags added by users decreased. We discuss the implications of these findings on the design of tagging interfaces

CWI's Institutional Repository

Measuring the Effectiveness of Gamesourcing Expert Oil Painting Annotations

Author: Hardman L. (Lynda)
He J. (Jiyin)
Ossenbruggen J.R. (Jacco) van
Traub M.C. (Myriam)
Publication venue
Publication date: 01/04/2014
Field of study

Tasks that require users to have expert knowledge are diffi- cult to crowdsource. They are mostly too complex to be carried out by non-experts and the available expert

CWI's Institutional Repository

Intricate visibility effects from resolved emission of young stellar objects: the case of MWC158 observed with the VLTI

Author: Benisty Myriam
Berger Jean-Philippe
Bouquin Jean-Baptiste Le
Kluska Jacques
Lazareff Bernard
Malbet Fabien
Menard François
Millan-Gabet Rafael
Pinte Christophe
Traub Wesley
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/08/2012
Field of study

In the course of our VLTI young stellar object PIONIER imaging program, we have identified a strong visibility chromatic dependency that appeared in certain sources. This effect, rising value of visibilities with decreasing wavelengths over one base, is also present in previous published and archival AMBER data. For Herbig AeBe stars, the H band is generally located at the transition between the star and the disk predominance in flux for Herbig AeBe stars. We believe that this phenomenon is responsible for the visibility rise effect. We present a method to correct the visibilities from this effect in order to allow "gray" image reconstruction software, like Mira, to be used. In parallel we probe the interest of carrying an image reconstruction in each spectral channel and then combine them to obtain the final broadband one. As an illustration we apply these imaging methods to MWC158, a (possibly Herbig) B[e] star intensively observed with PIONIER. Finally, we compare our result with a parametric model fitted onto the data.Comment: 14 pages, 7 figure

arXiv.org e-Print Archive

Crossref

Caltech Authors

Gamesourcing Expert Painting Annotations

Author: Hardman L. (Lynda)
He J. (Jiyin)
Ossenbruggen J.R. (Jacco) van
Traub M.C. (Myriam)
Publication venue
Publication date: 01/04/2014
Field of study

Online collections provided by museums are increasingly opened for contributions from users outside the museum. These initiatives are mostly targeted at obtaining tags describing aspects of artworks that are common knowledge. This does not require the contributors to have specific skills or knowledge. Museums, however, are also interested in obtaining very specific information about the subject matter of their artworks. We present a game that can help to collect expert knowledge by enabling non-expert users to perform an expert annotation task. This is achieved by simplifying the expert task and providing a sufficient level of annotation support to the users. In a user study we could prove the usefulness of our approach

CWI's Institutional Repository

Crowd vs Experts: Nichesourcing for Knowledge Intensive Tasks in Cultural Heritage

Author: Aroyo L. (Lora)
Bozzon A.
Dijkshoorn C. (Chris)
Houben G.J.
Leyssen M.H.R. (Mieke)
Nottamkandath A.
Oosterman J.
Traub M.C. (Myriam)
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/04/2014
Field of study

The results of our exploratory study provide new insights to crowdsourcing knowledge intensive tasks. We designed and performed an annotation task on a print collection of the Rijksmuseum Amsterdam, involving experts and crowd workers in the domain-specific description of depicted flowers. We created a testbed to collect annotations from flower experts and crowd workers and analyzed these in regard to user agreement. The findings show promising results, demonstrating how, for given categories, nichesourcing can provide useful annotations by connecting crowdsourcing to domain expertise

CWI's Institutional Repository

Sculpting the disk around T Cha: an interferometric view

Author: Augereau Jean-Charles
Benisty Myriam
Berger Jean-Philippe
Bouquin Jean-Baptiste Le
Burtscher Leonard
Crida Aurélien
Henning Thomas
Lacour Sylvestre
Lazareff Bernard
Malbet Fabien
Meeus Gwendolyn
Ménard François
Olofsson Johan
Pinte Christophe
Ratzka Thorsten
Traub Wesley A.
Publication venue: 'EDP Sciences'
Publication date: 01/01/2013
Field of study

(Abridged) Circumstellar disks are believed to be the birthplace of planets and are expected to dissipate on a timescale of a few Myr. The processes responsible for the removal of the dust and gas will strongly modify the radial distribution of the dust and consequently the SED. In particular, a young planet will open a gap, resulting in an inner disk dominating the near-IR emission and an outer disk emitting mostly in the far-IR. We analyze a full set of data (including VLTI/Pionier, VLTI/Midi, and VLT/NaCo/Sam) to constrain the structure of the transition disk around TCha. We used the Mcfost radiative transfer code to simultaneously model the SED and the interferometric observations. We find that the dust responsible for the emission in excess in the near-IR must have a narrow temperature distribution with a maximum close to the silicate sublimation temperature. This translates into a narrow inner dusty disk (0.07-0.11 AU). We find that the outer disk starts at about 12 AU and is partially resolved by the Pionier, Sam, and Midi instruments. We show that the Sam closure phases, interpreted as the signature of a candidate companion, may actually trace the asymmetry generated by forward scattering by dust grains in the upper layers of the outer disk. These observations help constrain the inclination and position angle of the outer disk. The presence of matter inside the gap is difficult to assess with present-day observations. Our model suggests the outer disk contaminates the interferometric signature of any potential companion that could be responsible for the gap opening, and such a companion still has to be unambiguously detected. We stress the difficulty to observe point sources in bright massive disks, and the consequent need to account for disk asymmetries (e.g. anisotropic scattering) in model-dependent search for companions.Comment: Removed the word "first" in the abstract of the paper: "obtained with the first 4-telescope combiner (VLTI/Pionier)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Hal - Université Grenoble Alpes

EDP Sciences OAI-PMH repository (1.2.0)

HAL Descartes

HAL Université de Savoie

Repositorio Académico de la Universidad de Chile

MPG.PuRe

arXiv.org e-Print Archive