415 research outputs found

    Improving Record Linkage Accuracy with Hierarchical Feature Level Information and Parsed Data

    Get PDF
    Probabilistic record linkage is a well established topic in the literature. Fellegi-Sunter probabilistic record linkage and its enhanced versions are commonly used methods, which calculate match and non- match weights for each pair of records. Bayesian network classifiers – naive Bayes classifier and TAN have also been successfully used here. Recently, an extended version of TAN (called ETAN) has been developed and proved superior in classification accuracy to conventional TAN. However, no previous work has applied ETAN to record linkage and investigated the benefits of using naturally existing hierarchical feature level information and parsed fields of the datasets. In this work, we ex- tend the naive Bayes classifier with such hierarchical feature level information. Finally we illustrate the benefits of our method over previously proposed methods on 4 datasets in terms of the linkage performance (F1 score). We also show the results can be further improved by evaluating the benefit provided by additionally parsing the fields of these datasets

    Brane-World Gravity

    Get PDF
    The observable universe could be a 1+3-surface (the "brane") embedded in a 1+3+\textit{d}-dimensional spacetime (the "bulk"), with Standard Model particles and fields trapped on the brane while gravity is free to access the bulk. At least one of the \textit{d} extra spatial dimensions could be very large relative to the Planck scale, which lowers the fundamental gravity scale, possibly even down to the electroweak (\sim TeV) level. This revolutionary picture arises in the framework of recent developments in M theory. The 1+10-dimensional M theory encompasses the known 1+9-dimensional superstring theories, and is widely considered to be a promising potential route to quantum gravity. At low energies, gravity is localized at the brane and general relativity is recovered, but at high energies gravity "leaks" into the bulk, behaving in a truly higher-dimensional way. This introduces significant changes to gravitational dynamics and perturbations, with interesting and potentially testable implications for high-energy astrophysics, black holes, and cosmology. Brane-world models offer a phenomenological way to test some of the novel predictions and corrections to general relativity that are implied by M theory. This review analyzes the geometry, dynamics and perturbations of simple brane-world models for cosmology and astrophysics, mainly focusing on warped 5-dimensional brane-worlds based on the Randall--Sundrum models. We also cover the simplest brane-world models in which 4-dimensional gravity on the brane is modified at \emph{low} energies -- the 5-dimensional Dvali--Gabadadze--Porrati models. Then we discuss co-dimension two branes in 6-dimensional models.Comment: A major update of Living Reviews in Relativity 7:7 (2004) "Brane-World Gravity", 119 pages, 28 figures, the update contains new material on RS perturbations, including full numerical solutions of gravitational waves and scalar perturbations, on DGP models, and also on 6D models. A published version in Living Reviews in Relativit

    Structure Discovery in Mixed Order Hyper Networks

    Get PDF
    Background  Mixed Order Hyper Networks (MOHNs) are a type of neural network in which the interactions between inputs are modelled explicitly by weights that can connect any number of neurons. Such networks have a human readability that networks with hidden units lack. They can be used for regression, classification or as content addressable memories and have been shown to be useful as fitness function models in constraint satisfaction tasks. They are fast to train and, when their structure is fixed, do not suffer from local minima in the cost function during training. However, their main drawback is that the correct structure (which neurons to connect with weights) must be discovered from data and an exhaustive search is not possible for networks of over around 30 inputs.  Results  This paper presents an algorithm designed to discover a set of weights that satisfy the joint constraints of low training error and a parsimonious model. The combined structure discovery and weight learning process was found to be faster, more accurate and have less variance than training an MLP.  Conclusions  There are a number of advantages to using higher order weights rather than hidden units in a neural network but discovering the correct structure for those weights can be challenging. With the method proposed in this paper, the use of high order networks becomes tractable

    Stochastic Gravity: Theory and Applications

    Get PDF
    Whereas semiclassical gravity is based on the semiclassical Einstein equation with sources given by the expectation value of the stress-energy tensor of quantum fields, stochastic semiclassical gravity is based on the Einstein-Langevin equation, which has in addition sources due to the noise kernel. In the first part, we describe the fundamentals of this new theory via two approaches: the axiomatic and the functional. In the second part, we describe three applications of stochastic gravity theory. First, we consider metric perturbations in a Minkowski spacetime, compute the two-point correlation functions of these perturbations and prove that Minkowski spacetime is a stable solution of semiclassical gravity. Second, we discuss structure formation from the stochastic gravity viewpoint. Third, we discuss the backreaction of Hawking radiation in the gravitational background of a black hole and describe the metric fluctuations near the event horizon of an evaporating black holeComment: 100 pages, no figures; an update of the 2003 review in Living Reviews in Relativity gr-qc/0307032 ; it includes new sections on the Validity of Semiclassical Gravity, the Stability of Minkowski Spacetime, and the Metric Fluctuations of an Evaporating Black Hol

    Unique establishment of procephalic head segments is supported by the identification of cis-regulatory elements driving segment-specific segment polarity gene expression in Drosophila

    Get PDF
    Anterior head segmentation is governed by different regulatory mechanisms than those that control trunk segmentation in Drosophila. For segment polarity genes, both initial mode of activation as well as cross-regulatory interactions among them differ from the typical genetic circuitry in the trunk and are unique for each of the procephalic segments. In order to better understand the segment-specific gene network responsible for the procephalic expression of the earliest active segment polarity genes wingless and hedgehog, we started to identify and analyze cis-regulatory DNA elements of these genes. For hedgehog, we could identify a cis-regulatory element, ic-CRE, that mediates expression specifically in the posterior part of the intercalary segment and requires promoter-specific interaction for its function. The intercalary stripe is the last part of the metameric hedgehog expression pattern that appears during embryonic development, which probably reflects the late and distinct establishment of this segment. The identification of a cis-regulatory element that is specific for one head segment supports the mutant-based observation that the expression of segment polarity genes is governed by a unique gene network in each of the procephalic segments. This provides further indication that the anterior-most head segments represent primary segments, which are set up independently, in contrast to the secondary segments of the trunk, which resemble true repetitive units

    Sequence Variations of Latent Membrane Protein 2A in Epstein-Barr Virus-Associated Gastric Carcinomas from Guangzhou, Southern China

    Get PDF
    Latent membrane protein 2A (LMP2A), expressed in most Epstein-Barr virus (EBV)-associated malignancies, has been demonstrated to be responsible for the maintenance of latent infection and epithelial cell transformation. Besides, it could also act as the target for a CTL-based therapy for EBV-associated malignancies. In the present study, sequence variations of LMP2A in EBV-associated gastric carcinoma (EBVaGC) and healthy EBV carriers from Guangzhou, southern China, where nasopharyngeal carcinoma (NPC) is endemic, were investigated. Widespread sequence variations in the LMP2A gene were found, with no sequence identical to the B95.8 prototype. No consistent mutation was detected in all isolates. The immunoreceptor tyrosine-based activation motif (ITAM) and PY motifs in the amino terminus of LMP2A were strictly conserved, suggesting their important roles in virus infection; while 8 of the 17 identified CTL epitopes in the transmembrane region of LMP2A were affected by at least one point mutation, which may implicate that the effect of LMP2A polymorphisms should be considered when LMP2A-targeted immunotherapy is conducted. The polymorphisms of LMP2A in EBVaGC in gastric remnant carcinoma (GRC) were for the first time investigated in the world. The LMP2A sequence variations in EBVaGC in GRC were somewhat different from those in EBVaGC in conventional gastric carcinoma. The sequence variations of LMP2A in EBVaGC were similar to those in throat washing of healthy EBV carriers, indicating that these variations are due to geographic-associated polymorphisms rather than EBVaGC-associated mutations. This, to our best knowledge, is the first detailed investigation of LMP2A polymorphisms in EBVaGC in Guangzhou, southern China, where NPC is endemic
    corecore