130 research outputs found

    Data-Oriented Parsing with Discontinuous Constituents and Function Tags

    Get PDF
    Statistical parsers are e ective but are typically limited to producing projective dependencies or constituents. On the other hand, linguisti- cally rich parsers recognize non-local relations and analyze both form and function phenomena but rely on extensive manual grammar development. We combine advantages of the two by building a statistical parser that produces richer analyses. We investigate new techniques to implement treebank-based parsers that allow for discontinuous constituents. We present two systems. One system is based on a string-rewriting Linear Context-Free Rewriting System (LCFRS), while using a Probabilistic Discontinuous Tree Substitution Grammar (PDTSG) to improve disambiguation performance. Another system encodes the discontinuities in the labels of phrase structure trees, allowing for efficient context-free grammar parsing. The two systems demonstrate that tree fragments as used in tree-substitution grammar improve disambiguation performance while capturing non-local relations on an as-needed basis. Additionally, we present results of models that produce function tags, resulting in a more linguistically adequate model of the data. We report substantial accuracy improvements in discontinuous parsing for German, English, and Dutch, including results on spoken Dutch

    The Visual Extent of an Object

    Get PDF
    The visual extent of an object reaches beyond the object itself. This is a long standing fact in psychology and is reflected in image retrieval techniques which aggregate statistics from the whole image in order to identify the object within. However, it is unclear to what degree and how the visual extent of an object affects classification performance. In this paper we investigate the visual extent of an object on the Pascal VOC dataset using a Bag-of-Words implementation with (colour) SIFT descriptors. Our analysis is performed from two angles. (a) Not knowing the object location, we determine where in the image the support for object classification resides. We call this the normal situation. (b) Assuming that the object location is known, we evaluate the relative potential of the object and its surround, and of the object border and object interior. We call this the ideal situation. Our most important discoveries are: (i) Surroundings can adequately distinguish between groups of classes: furniture, animals, and land-vehicles. For distinguishing categories within one group the surroundings become a source of confusion. (ii) The physically rigid plane, bike, bus, car, and train classes are recognised by interior boundaries and shape, not by texture. The non-rigid animals dog, cat, cow, and sheep are recognised primarily by texture, i.e. fur, as their projected shape varies greatly. (iii) We confirm an early observation from human psychology (Biederman in Perceptual Organization, pp. 213-263, 1981): in the ideal situation with known object locations, recognition is no longer improved by considering surroundings. In contrast, in the normal situation with unknown object locations, the surroundings significantly contribute to the recognition of most classes

    A non-perturbative determination of Z_V and b_V for O(a) improved quenched and unquenched Wilson fermions

    Get PDF
    By considering the local vector current between nucleon states and imposing charge conservation we determine, for O(a)O(a) improved Wilson fermions, its renormalisation constant and quark mass improvement coefficient. The computation is performed for both quenched and two flavour unquenched fermions.Comment: 3 pages, 4 figures, Lattice(2002)(improve

    Probabilistic Topic Modeling of the Russian Text Corpus on Musicology

    Get PDF
    The paper describes the results of experiments on the development of a statistical model of the Russian text corpus on musicology. We construct a topic model based on Latent Dirichlet Allocation and process corpus data with the help of the GenSim statistical toolkit. Results achieved in course of experiments allow us to distinguish general and special topics which describe conceptual structure of the corpus in question and to analyze paradigmatic and syntagmatic relations between lemmata within topics.The research discussed in the paper is supported by the grant of St.-Petersburg State University № 30.38.305.2014 «Quantitative linguistic parameters for defining stylistic characteristics and subject area of texts»

    Quenched QCD with fixed-point and chirally improved fermion

    Get PDF
    In this contribution we present results from quenched QCD simulations with the parameterized fixed-point (FP) and the chirally improved (CI) Dirac operator. Both these operators are approximate solutions of the Ginsparg-Wilson equation and have good chiral properties. We focus our discussion on observables sensitive to chirality. In particular we explore pion masses down to 210 MeV in light hadron spectroscopy, quenched chiral logs, the pion decay constant and the pion scattering length. We discuss finite volume effects, scaling properties of the FP and CI operators and performance issues in their numerical implementation.Comment: Lattice2002(chiral), 17 pages, 21 figures, (LaTeX style file espcrc2.sty and AMS style files

    Cumulative readings of every do not provide evidence for events and thematic roles

    Get PDF
    An argument by Kratzer (2000) based on Schein (1986, 1993) does not conclusively show that events and thematic roles are necessary ingredients of the logical representation of natural language sentences. The argument claims that cumulative readings of every can be represented only with these ingredients. But scope-splitting accounts make it possible to represent cumulative readings of every in an eventless framework. Such accounts are motivated by obligatory reconstruction effects of every and by crosslinguistic considerations. Kratzer proposes that agent but not theme occurs in the logical representation of sentences because this allows her to model subject-object asymmetries in the distribution of cumulative every. But the reason for these asymmetries seems to be that every must be c-commanded by another quantifier in order to cumulate with it, no matter what its thematic role is. So the distribution of cumulative every does not provide support for Kratzer’s proposal

    The role of sulfoglucuronosyl glycosphingolipids in the pathogenesis of monoclonal IgM paraproteinemia and peripheral neuropathy

    Get PDF
    In IgM paraproteinemia and peripheral neuropathy, IgM M-protein secretion by B cells leads to a T helper cell response, suggesting that it is antibody-mediated autoimmune disease involving carbohydrate epitopes in myelin sheaths. An immune response against sulfoglucuronosyl glycosphingolipids (SGGLs) is presumed to participate in demyelination or axonal degeneration in the peripheral nervous system (PNS). SGGLs contain a 3-sulfoglucuronic acid residue that interacts with anti-myelin-associated glycoprotein (MAG) and the monoclonal antibody anti-HNK-1. Immunization of animals with sulfoglucuronosyl paragloboside (SGPG) induced anti-SGPG antibodies and sensory neuropathy, which closely resembles the human disease. These animal models might help to understand the disease mechanism and lead to more specific therapeutic strategies. In an in vitro study, destruction or malfunction of the blood-nerve barrier (BNB) was found, resulting in the leakage of circulating antibodies into the PNS parenchyma, which may be considered as the initial key step for development of disease
    corecore