30 research outputs found

    Characteristics of Duplicate Records in OCLC's Online Union Catalog

    Get PDF
    Duplicate records in the Online Union Catalog of the OCLC Online Computer Library Center, Inc., were analyzed. Bibliographic elements comprise information found in one or more fields of a bibliographic record; e.g., the author element comprises the main and added author entry fields. Bibliographic element mismatches in duplicate record pairs were considered relative to the number of records in which each element was present. When a single element differed in a duplicate record pair, that element was most often publication date. This finding shows that a difference in the date of publication is not a reliable indicator of bibliographic uniqueness. General cataloging and data entry patterns such as variations in title transcription and form of name, typographical errors, mistagged fields, misplaced subfield codes, omissions, and inconsistencies between fixed and variable fields often caused records that were duplicates to appear different. These factors can make it extremely difficult for catalogers to retrieve existing bibliographic records and thus avoid creating duplicate records. They also prevent duplicate detection algorithms used for tape-loading records from achieving desired results. An awareness of particularly problematic bibliographic elements and general factors contributing to the creation of duplicate records should help catalogers identify and accept existing records more often. This awareness should also help to direct system designers in their development of more sensitive algorithms to be used for tape loading. The resulting general reduction in the number of duplicate records in union catalogs will be a major step toward increased cataloger productivity, user satisfaction, and overall online database quality

    An Exploration of Relationships in Female Prisons- Content Analysis from the Original TV Series Orange Is the New Black

    No full text
    The researcher examined the Netflix Original Series, Orange Is the New Black , looking at the media\u27s portrayal of female relationships inside a women\u27s federal institution. This research compiled date over all thirteen episodes using content analysis research to explain a constant comparative narrative over characters and their relationships throughout this TV series. This research compares literature review over female prisoners and their relationships to the media\u27s portrayal in this series along with whether or not this data depicts them accurately

    Attitudes of parents toward certain aspects of family life education in a Kansas high school

    Get PDF
    Digitized by Kansas State University Librarie

    Misinformation and Bias in Metadata Processing: Matching in Large Databases

    No full text
    This article discusses structural, systems, and other types of bias that arise in matching new records to large databases. The focus is databases for bibliographic utilities, but other related database concerns will be discussed. Problems of satisfying a “match” with sufficient flexibility and rigor in an environment of imperfect data are presented, and sources of unintentional variance are discussed

    GLIMIR: Manifestation and Content Clustering within WorldCat

    No full text
    The GLIMIR project at OCLC clusters and assigns an identifier to WorldCat records representing the same manifestation. These include parallel records in different languages (e.g., a record with English descriptive notes and subject headings and one for the same book with French equivalents). It also clusters records that probably represent the same manifestation, but which could not be safely merged by OCLC's Duplicate Detection and Resolution (DDR) program for various reasons. As the project progressed, it became clear that it would also be useful to create content-based clusters for groups of manifestations that are generally equivalent from the end user perspective (e.g., the original print text with its microform, ebook and reprint versions, but not new editions). Lessons from the GLIMIR project have improved OCLC's duplicate detection program through the introduction of new matching techniques. GLIMIR has also had unexpected benefits for OCLC's FRBR algorithm by providing new methods for identifying outliers thus enabling more records to be included in the correct work cluster
    corecore