257,274 research outputs found

    Mining Entity Synonyms with Efficient Neural Set Generation

    Full text link
    Mining entity synonym sets (i.e., sets of terms referring to the same entity) is an important task for many entity-leveraging applications. Previous work either rank terms based on their similarity to a given query term, or treats the problem as a two-phase task (i.e., detecting synonymy pairs, followed by organizing these pairs into synonym sets). However, these approaches fail to model the holistic semantics of a set and suffer from the error propagation issue. Here we propose a new framework, named SynSetMine, that efficiently generates entity synonym sets from a given vocabulary, using example sets from external knowledge bases as distant supervision. SynSetMine consists of two novel modules: (1) a set-instance classifier that jointly learns how to represent a permutation invariant synonym set and whether to include a new instance (i.e., a term) into the set, and (2) a set generation algorithm that enumerates the vocabulary only once and applies the learned set-instance classifier to detect all entity synonym sets in it. Experiments on three real datasets from different domains demonstrate both effectiveness and efficiency of SynSetMine for mining entity synonym sets.Comment: AAAI 2019 camera-ready versio

    The Synonym management process in SAREL

    Get PDF
    The specification phase is one of the most important and least supported parts of the software development process. The SAREL system has been conceived as a knowledge-based tool to improve the specification phase. The purpose of SAREL (Assistance System for Writing Software Specifications in Natural Language) is to assist engineers in the creation of software specifications written in Natural Language (NL). These documents are divided into several parts. We can distinguish the Introduction and the Overall Description as parts that should be used in the Knowledge Base construction. The information contained in the Specific Requirements Section corresponds to the information represented in the Requirements Base. In order to obtain high-quality software requirements specification the writing norms that define the linguistic restrictions required and the software engineering constraints related to the quality factors have been taken into account. One of the controls performed is the lexical analysis that verifies the words belong to the application domain lexicon which consists of the Required and the Extended lexicon. In this sense a synonym management process is needed in order to get a quality software specification. The aim of this paper is to present the synonym management process performed during the Knowledge Base construction. Such process makes use of the Spanish Wordnet developed inside the Eurowordnet project. This process generates both the Required lexicon and the Extended lexicon that will be used during the Requirements Base construction.Postprint (published version

    Nomenclatorial corrections for Dasytidae and Malachiidae (Coleoptera)

    Get PDF
    Nomenclatorial corrections are proposed for 9 cases of homonymy and 5 cases of synonymy in the Dasytidae and Malachiidae. For the homonyms, the following new names are proposed: Aplocnemus montbabor Mayor, new name, for A. baborensis Pic 1922; Dasytes loboensis Mayor, new name, for D. nevadensis Pic 1954; Dasyte sminor Mayor, new name, for D. minutus Casey 1895; Amalthocuspici Mayor, new name, for A. metallicus (Pic 1955); Attalus tribandipennis Mayor, new name, for A. tricoloripennis Pic 1927; Attalus ulkei Mayor, new name, for A. laevicollis (Horn 1872); Laiuscephalus Mayor, new name, for L. verticalis Fairmaire; Sphinginopalpus mpumalangaensis Mayor, new name, for S. nigriceps Wittmer 1994; Sphinginopalpus rufinotus Mayor, new name, for S. rufithorax Wittmer 1994. Scelopristis Mayor new genus, is proposed for the species of Pristocelis LeConte 1862, a junior synonym of Trichochrous Motschulsky 1860; the available name Hadrocnemus Kraatz 1895 is proposed for the species of Apalochrus Erichson 1840 a senior objective synonym of Paritinus Abeille de Perrin 1891. The following new subjective synonyms are proposed: Pristoscelis Le Conte 1862 is a junior synonym of Trichochrous Motschulsky 1860; Anthocomus (Paremballus) Abeille de Perrin 1891 is a junior synonym of Anthocomus (Anthocomus) Erichson 1840; Anthocomus (Neotrotus) Abeille de Perrin 1891 is a juniorsynonym of Anthocomus (Celidus) Mulsant and Rey 1867; Amanicollops Pic 1908 is a junior synonym of Hadrocnemus Kraatz 1895. Paratinus Abeille de Perrin 1891 is a junior objective synonym of Apalochrus Erichson 1840). New combinations for North American species from Attalus Erichson, Anthocomus Erichson, Microlipus LeConte and Tanaops LeConte formalized here were suggested by Mayor (2002). The Characters indicating the close relationship of Neadasytes Hatch 1962 to Dasytastes Casey 1895, and of Paradasytes Hatch 1962 to Dasytellus Casey 1895 are discussed. Afrocolotes Wittmer 1960 and Olistherarthrus Champion 1922, listed as synonyms of Temnopsophus Horn 1872 by Evers (1989: 6), are here considered distinct genera. These changes are proposed here so that they can be included in a world catalog in preparation

    New records, nomenclatural changes, and taxonomic notes for select North American leaf beetles (Coleoptera: Chrysomelidae)

    Get PDF
    New records, nomenclatural changes and taxonomic notes are presented for select North American leaf beetles (Coleoptera: Chrysomelidae). The following genera are newly recorded from the United States: Nesaecrepida Blake, 1964; Acallepitrix Bechyne, 1959; Margaridisa Bechyne, 1958; Parchicola Bechyne and Springlova de Bechyne, 1975; (all Galerucinae: Alticini); and Demotina Baly, 1874 (Eumolpinae). The following species are newly recorded from the United States: Neolema dorsalis (Olivier, 1791) (Criocerinae); Charidotella bifossulata Boheman, 1855 (Hispinae: Cassidini); Syphrea flauicollis (Jacoby, 1884) (Galerucinae: Alticini); and Promecosoma inflatum Lefevre, 1877, and Demotina modesta Baly, 1874 (Eumolpinae). The following new synonymies are proposed: Deloyala clauata, var. diuersicollis Schaeffer, 1925, transferred from synonymy with Plagiometriona clauata (Fabricius, 1798) to synonymy withP. clauata testudinaria (Boheman, 1855); Chrysomela hybrida Say, 1824, downgraded from subspecies of Calligrapha lunata (Fabricius, 1787) to full synonym of CaZZigrapha lunata (Fabricius); Chrysomela casta Rogers, 1856, downgraded from subspecies of Zygogramma suturalis (Fabricius, 1775) to full synonym of Zygogrammasuturalis (Fabricius); Trirhabdagurneyi Blake, 1951, downgraded to synonym of Trirhabda caduca Horn, 1893; Scelida mimula Wilcox, 1965, downgraded to synonym of Scelida nigricornis (Jacoby, 1888); Exoracalifornica Wilcox, 1953, downgraded to synonym of Pte leon breuicornis (Jacoby, 1887); alaeothonaarizonensis Blake, 1950, downgraded to synonym ofLupraea discrepans (Schaeffer, 1932); Haltica nigritula Linell, 1898, downgraded to synonym of Nesaecrepida asphalt ina (Suffrian, 1868); Cryptocephalus reinhardi Sundman, 1965, downgraded to synonym of Cryptocephalus nwtabilis Melsheimer, 1847. The tribes Chalepini Wiese, 1910, and Uroplatini Weise, 1910 (Hispinae), are synonymized, and both of these family-group names are recognized as nomina protecta. The family-group name Octotomites Chapuis, 1875, is recognized as a nomen oblitum. Coptocycla testudinaria Boheman, 1855, is downgraded from specific status to a subspecies of Plagiometriona clauata (Fabricius, 1798); Cassida bicolor (Fabricius, 1798) is recognized as a valid subspecies of Charidotellasexpunctata (Fabricius). The genus Hemiphrynus Horn, 1889, is removed from synonymy with Phrynocepha and reinstated as a valid genus; Rhabdopterus weisei (Schaeffer, 1920) is removed from synonymy with Rhabdopterus praetextus (Say) and reinstated as a valid species. The following new combinations are proposed: Synetocephalus wallacei (Wilcox, 1965), transferred from Pseudoluperus; Nesaecrepida infuscata (Schaeffer, 1906), transferred from Monomacra; Acallepitrix nitens (Horn, 1889), transferred from Epitrix; Margaridisa atriuentris (Melsheimer, 1847), transferred from Hornaltica; Parchicola iris (Olivier, 1808) and P. tibialis (Olivier, 1808), transferred from Monomacra; and Coleothorpa panochensis (Gilbert, 1981), transferred from Coscinoptera; Promecosoma arizonae (Crotch, 1873), transferred from Metaxyonycha; Tymnes chrysis (Olivier, 1808), T. oregonensis (Crotch, 1873), and T. thaleia (Blake, 1977), transferred from Colaspis. The following new replacement name is proposed: Triarius nigroflaulls, for Luperodes flauoniger Blake, 1942 (not Laboissiere, 1925). The identities of Griburius equestris (Olivier, 1808) and G. laruatlls (Newman, 1840) are discussed

    Bryophytes of Uganda : 6., new and additional records, 3.

    Get PDF
    12 hepatics and 32 mosses are reported new to Uganda, 1 moss being also new to Africa, and 1 liverwort new to mainland Africa. Ectropothecium plumigerum (Broth.) Hedenäs is a new combination (basionym: Isopterygium plumigerum Broth.) with a new synonym Taxicaulis plumirameus Müll.Hal. nom. nud., and Taxiphyllum maniae (Renauld & Paris) M. Fleisch. is a new synonym of Taxiphyllum taxirameum (Mitt.) M.Fleisch. Three mosses are removed from the Uganda list

    Ontogeny and Systematics of the Genus \u3ci\u3eCerophagus\u3c/i\u3e (Acari: Gaudiellidae), Mites Associated With Bumblebees

    Get PDF
    Nymphs and adults of Cerophagus nearcticus n. sp. are described from the nest of Bombus terricola occidentalis in California and phoretic associations with B. impatiens in Michigan and New York. On the basis of adult morphology, the genus Cerophagopsis is removed from synonymy with Cerophagus and retained in the family Acaridae, while Cerophagus is transferred to the family Gaudiellidae. The genus Rhypoglyphus is considered a junior subjec­tive synonym of Cerophagopsis. The genera of Gaudiellidae are noted and the genus Trigonacoptes is considered a junior subjective synonym of Gaudiella

    A synonym in the genus Fluda (Araneae: Salticidae)

    Get PDF
    A synonym in the genus Fluda (Araneae: Salticidae

    Entity Synonym Discovery via Multipiece Bilateral Context Matching

    Full text link
    Being able to automatically discover synonymous entities in an open-world setting benefits various tasks such as entity disambiguation or knowledge graph canonicalization. Existing works either only utilize entity features, or rely on structured annotations from a single piece of context where the entity is mentioned. To leverage diverse contexts where entities are mentioned, in this paper, we generalize the distributional hypothesis to a multi-context setting and propose a synonym discovery framework that detects entity synonyms from free-text corpora with considerations on effectiveness and robustness. As one of the key components in synonym discovery, we introduce a neural network model SYNONYMNET to determine whether or not two given entities are synonym with each other. Instead of using entities features, SYNONYMNET makes use of multiple pieces of contexts in which the entity is mentioned, and compares the context-level similarity via a bilateral matching schema. Experimental results demonstrate that the proposed model is able to detect synonym sets that are not observed during training on both generic and domain-specific datasets: Wiki+Freebase, PubMed+UMLS, and MedBook+MKG, with up to 4.16% improvement in terms of Area Under the Curve and 3.19% in terms of Mean Average Precision compared to the best baseline method.Comment: In IJCAI 2020 as a long paper. Code and data are available at https://github.com/czhang99/SynonymNe

    Synonym Discovery with Etymology-based Word Embeddings

    Full text link
    We propose a novel approach to learn word embeddings based on an extended version of the distributional hypothesis. Our model derives word embedding vectors using the etymological composition of words, rather than the context in which they appear. It has the strength of not requiring a large text corpus, but instead it requires reliable access to etymological roots of words, making it specially fit for languages with logographic writing systems. The model consists on three steps: (1) building an etymological graph, which is a bipartite network of words and etymological roots, (2) obtaining the biadjacency matrix of the etymological graph and reducing its dimensionality, (3) using columns/rows of the resulting matrices as embedding vectors. We test our model in the Chinese and Sino-Korean vocabularies. Our graphs are formed by a set of 117,000 Chinese words, and a set of 135,000 Sino-Korean words. In both cases we show that our model performs well in the task of synonym discovery.Comment: 6 pages, IEEE Symposium Series on Computational Intelligence (IEEE SSCI 2017
    corecore