Search CORE

8,595 research outputs found

Compositional Morphology for Word Representations and Language Modelling

Author: Blunsom Phil
Botha Jan A.
Publication venue
Publication date: 01/01/2014
Field of study

This paper presents a scalable method for integrating compositional morphological representations into a vector-based probabilistic language model. Our approach is evaluated in the context of log-bilinear language models, rendered suitably efficient for implementation inside a machine translation decoder by factoring the vocabulary. We perform both intrinsic and extrinsic evaluations, presenting results on a range of languages which demonstrate that our model learns morphological representations that both perform well on word similarity tasks and lead to substantial reductions in perplexity. When used for translation into morphologically rich languages with large vocabularies, our models obtain improvements of up to 1.2 BLEU points relative to a baseline system using back-off n-gram models.Comment: Proceedings of the 31st International Conference on Machine Learning (ICML

arXiv.org e-Print Archive

CiteSeerX

Oxford University Research Archive

Reusable Knowledge-based Components for Building Software Applications: A Knowledge Modelling Approach

Author: Cuena José
Molina Martin
Sierra José Luis
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/1999
Field of study

In computer science, different types of reusable components for building software applications were proposed as a direct consequence of the emergence of new software programming paradigms. The success of these components for building applications depends on factors such as the flexibility in their combination or the facility for their selection in centralised or distributed environments such as internet. In this article, we propose a general type of reusable component, called primitive of representation, inspired by a knowledge-based approach that can promote reusability. The proposal can be understood as a generalisation of existing partial solutions that is applicable to both software and knowledge engineering for the development of hybrid applications that integrate conventional and knowledge based techniques. The article presents the structure and use of the component and describes our recent experience in the development of real-world applications based on this approach

CiteSeerX

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Improving Language Modelling with Noise-contrastive estimation

Author: Grzes Marek
Liza Farhana Ferdousi
Publication venue
Publication date: 22/09/2017
Field of study

Neural language models do not scale well when the vocabulary is large. Noise-contrastive estimation (NCE) is a sampling-based method that allows for fast learning with large vocabularies. Although NCE has shown promising performance in neural machine translation, it was considered to be an unsuccessful approach for language modelling. A sufficient investigation of the hyperparameters in the NCE-based neural language models was also missing. In this paper, we showed that NCE can be a successful approach in neural language modelling when the hyperparameters of a neural network are tuned appropriately. We introduced the 'search-then-converge' learning rate schedule for NCE and designed a heuristic that specifies how to use this schedule. The impact of the other important hyperparameters, such as the dropout rate and the weight initialisation range, was also demonstrated. We showed that appropriate tuning of NCE-based neural language models outperforms the state-of-the-art single-model methods on a popular benchmark

arXiv.org e-Print Archive

Kent Academic Repository

Recommended from our members

UK Research Information Shared Service (UKRISS) Final Report, July 2014

Author: Gartner R
Joerg B
Jones R
McDonald D
Ritchie M
Scoble R
Sudlow A
Tolev E
Trowell S
Waddington S
Publication venue: JISC
Publication date: 01/01/2014
Field of study

The reporting of research information is a complex and expensive activity for research organisations (ROs). There is little alignment between funders of the reporting requests made to institutions and requests made to individual researchers about their research outputs and outcomes. This inevitably results in duplication and increased costs across the sector, whilst limiting the potential sharing and reuse of the information. The UK Research Information Shared Service (UKRISS) project conducted a feasibility and scoping study for the reporting of research information at a national level based on CERIF (Common European Research Information Format), with the objective of increasing efficiency, productivity and quality across the sector. The aim was to define and prototype solutions which are compelling, easy to use, have a low entry barrier, and support innovative information sharing and benchmarking. CERIF has emerged as the preferred format for expressing research information across Europe. To date, CERIF has been piloted for specific applications, but not as a format for reporting requirements across all UK ROs. The final report presents the work carried out by the UKRISS project, including requirements gathering, modelling and prototyping, as well as recommendation for sustainability. UKRISS was divided into two phases. Phase 1, mapping the reporting landscape, ran from March 2012 to December 2012. Phase 2, exploring delivery of potential solutions, began in February 2013 and ended in December 2013

Brunel University Research Archive

Semantic query languages for knowledge-based web services in a construction context

Author: Beetz Jakob
Bourreau Pierre
Senthilvel Madhumitha
Van Berlo Léon
Werbrouck Jeroen
Publication venue
Publication date: 01/01/2019
Field of study

Since the early 2000s, different frameworks were set up to enable web-based collaboration in building projects. Unfortunately, none of these initiatives was granted a long life. Recently, however, the use of web technologies in the building industry has been gaining momentum again, considered some promising technologies for reaching a more interoperable BIM practice. Specifically, this relates to (1) Linked Data and Semantic Web technologies, and (2) cloud-based applications. In order to combine these into a network of interlinked applications and datastores, an agreed-upon mechanism for automatic communication and data retrieval needs to be used. Apart from the W3C standard SPARQL, often considered too high a threshold for developers to implement, there are some recent GraphQL-based solutions that simplify the querying process and its implementation into web services. In this paper, we review two recent open source technologies based on GraphQL, that enable to query Linked Data on the web: GraphQL-LD and HyperGraphQL

Ghent University Academic Bibliography

Behavior change interventions: the potential of ontologies for advancing science and practice

Author: Ahern David
Bartlett Ellis Rebecca J.
Cole-Lewis Heather
Gibson Bryan
Hekler Eric B.
Hesse Bradford
Larsen Kai R.
Michie Susan
Moser Richard P.
Spruijt-Metz Donna
Yi Jean
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2017
Field of study

A central goal of behavioral medicine is the creation of evidence-based interventions for promoting behavior change. Scientific knowledge about behavior change could be more effectively accumulated using "ontologies." In information science, an ontology is a systematic method for articulating a "controlled vocabulary" of agreed-upon terms and their inter-relationships. It involves three core elements: (1) a controlled vocabulary specifying and defining existing classes; (2) specification of the inter-relationships between classes; and (3) codification in a computer-readable format to enable knowledge generation, organization, reuse, integration, and analysis. This paper introduces ontologies, provides a review of current efforts to create ontologies related to behavior change interventions and suggests future work. This paper was written by behavioral medicine and information science experts and was developed in partnership between the Society of Behavioral Medicine's Technology Special Interest Group (SIG) and the Theories and Techniques of Behavior Change Interventions SIG. In recent years significant progress has been made in the foundational work needed to develop ontologies of behavior change. Ontologies of behavior change could facilitate a transformation of behavioral science from a field in which data from different experiments are siloed into one in which data across experiments could be compared and/or integrated. This could facilitate new approaches to hypothesis generation and knowledge discovery in behavioral science

IUPUIScholarWorks

Annotations for Rule-Based Models

Author: A Bairoch
A Köhler
AA Cuellar
AL Lister
B Smith
C Li
CF Lopez
DA Natale
E Demir
E Sirin
F Zhang
G Misirli
G Misirli
G Misirli
II Moraru
J Ellson
JL Snoep
JR Faeder
K Degtyarenko
K Eilbeck
L Endler
L Montecchi-Palazzi
LA Chylek
LA Harris
M Courtot
M Galdzicki
M Hucka
M Kanehisa
M Klement
ML Blinov
ML Blinov
MT Cooling
N Juty
N Novère Le
N Swainston
NJ Mulder
P Buneman
T Yu
The Gene Ontology Consortium
V Danos
V Danos
V Danos
W Xu
WJ Hedley
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/09/2018
Field of study

The chapter reviews the syntax to store machine-readable annotations and describes the mapping between rule-based modelling entities (e.g., agents and rules) and these annotations. In particular, we review an annotation framework and the associated guidelines for annotating rule-based models of molecular interactions, encoded in the commonly used Kappa and BioNetGen languages, and present prototypes that can be used to extract and query the annotations. An ontology is used to annotate models and facilitate their description

arXiv.org e-Print Archive

Crossref