3,198 research outputs found
A Comparison of Blocking Methods for Record Linkage
Record linkage seeks to merge databases and to remove duplicates when unique
identifiers are not available. Most approaches use blocking techniques to
reduce the computational complexity associated with record linkage. We review
traditional blocking techniques, which typically partition the records
according to a set of field attributes, and consider two variants of a method
known as locality sensitive hashing, sometimes referred to as "private
blocking." We compare these approaches in terms of their recall, reduction
ratio, and computational complexity. We evaluate these methods using different
synthetic datafiles and conclude with a discussion of privacy-related issues.Comment: 22 pages, 2 tables, 7 figure
John Jenkins and \u3ci\u3eThe Art of Writing\u3c/i\u3e: Handwriting and Identity in the Early American Republic
A literary critique of the 1791 and 1813 editions of the American penmanship and handwriting text The Art of Writing, Reduced to a Plain and Easy System, by John Jenkins is presented. It considers aspects of national identity and individual identity, noting Jenkins\u27 concern for national unity in the first edition. The author explores social and economic facets of handwriting, reflecting on gentility, opportunity, and aesthetics
Expression of the DNA mismatch repair proteins hMLH1 and hPMS2 in normal human tissues.
hMLH1 and hPMS2 are part of the DNA mismatch repair complex. Mutations in these genes have been linked to hereditary non-polyposis colon cancer; they also occur in a variety of sporadic cancers. Western blot analysis and immunohistochemistry demonstrated that hMLH1 and hPMS2 are widely expressed nuclear proteins with a distribution pattern very similar to that previously described for hMSH2. These observations showing similar localization of hMLH1 and hPMS2 with hMSH2 are consistent with the biochemical function of these proteins in DNA mismatch repair
Low Frequency Quantum Transport in a Three-probe Mesoscopic Conductor
The low frequency quantum transport properties of a three-probe mesoscopic
conductor are studied using B\"uttiker's AC transport formalism. The static
transmission coefficients and emittance matrix of the system were computed by
explicitly evaluating the various partial density of states (PDOS). We have
investigated the finite size effect of the scattering volume on the global
PDOS. By increasing the scattering volume we observed a gradual improvement in
the agreement of the total DOS as computed externally or locally. Our numerical
data permits a particular fitting form of the finite size effect.Comment: 13 pages, LaTeX, submitted to Phys. Rev.
Unusual Closed Traumatic Avulsion of Both Flexor Tendons in Zones 1 and 3 of the Little Finger.
Closed tendon avulsion of both flexor tendons in the same finger is an extremely rare condition. We encountered the case of a patient who presented a rupture of the flexor digitorum profundus in zone 1 and flexor digitorum superficialis in zone 3 in the little finger. This occurrence has not been reported previously. We hereby present our case, make a review of the literature of avulsion of both flexor tendons of the same finger, and propose a treatment according to the site of the ruptures
ERBlox: Combining Matching Dependencies with Machine Learning for Entity Resolution
Entity resolution (ER), an important and common data cleaning problem, is
about detecting data duplicate representations for the same external entities,
and merging them into single representations. Relatively recently, declarative
rules called matching dependencies (MDs) have been proposed for specifying
similarity conditions under which attribute values in database records are
merged. In this work we show the process and the benefits of integrating three
components of ER: (a) Classifiers for duplicate/non-duplicate record pairs
built using machine learning (ML) techniques, (b) MDs for supporting both the
blocking phase of ML and the merge itself; and (c) The use of the declarative
language LogiQL -an extended form of Datalog supported by the LogicBlox
platform- for data processing, and the specification and enforcement of MDs.Comment: To appear in Proc. SUM, 201
On narrowing coated conductor film: emergence of granularity-induced field hysteresis of transport critical current
Critical current density Jc in polycrystalline or granular superconducting
material is known to be hysteretic with applied field H due to the focusing of
field within the boundary between adjacent grains. This is of concern in the
so-called coated conductors wherein superconducting film is grown on a
granular, but textured surface of a metal substrate. While previous work has
mainly been on Jc determined using induced or magnetization currents, the
present work utilizes transport current via an applied potential in strip
geometry. It is observed that the effect is not as pronounced using transport
current, probably due to a large difference in criterion voltage between the
two types of measurements. However, when the films are narrowed by patterning
into 200-, 100-, or 80-micron, the hysteresis is clearly seen, because of the
forcing of percolation across higher-angle grain boundaries. This effect is
compared for films grown on ion-beam-assisted-deposited (IBAD) YSZ substrate
and those grown on rolling-assisted-biaxially-textures substrates (RABiTS)
which have grains that are about ten times larger. The hysteresis is more
pronounced for the latter, which is more likely to have a weak grain boundary
spanning the width of the microbridge. This is also of concern to applications
in which coated conductors will be striated in order to reduce of AC losses.Comment: text-only: 10 pages, plus 5 figures on 5 page
- …