Search CORE

21,876 research outputs found

Unsupervised continuous-valued word features for phrase-break prediction without a part-of-speech tagger.

Author: King Simon
Watts Oliver
Yamagishi Junichi
Publication venue
Publication date: 01/08/2011
Field of study

Edinburgh Research Explorer

Thoughts about disordered thinking: measuring and quantifying the laws of order and disorder

Author: Cohen Alex
Deyne Simon De
Elvevaag Brita
Ferrer Cancho Ramon
Foltz Peter W.
Mizraji Eduardo
Rosenstein Mark
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2017
Field of study

Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

UPCommons. Portal del coneixement obert de la UPC

Louisiana State University

Neural Motifs: Scene Graph Parsing with Global Context

Author: Choi Yejin
Thomson Sam
Yatskar Mark
Zellers Rowan
Publication venue
Publication date: 29/03/2018
Field of study

We investigate the problem of producing structured graph representations of visual scenes. Our work analyzes the role of motifs: regularly appearing substructures in scene graphs. We present new quantitative insights on such repeated structures in the Visual Genome dataset. Our analysis shows that object labels are highly predictive of relation labels but not vice-versa. We also find that there are recurring patterns even in larger subgraphs: more than 50% of graphs contain motifs involving at least two relations. Our analysis motivates a new baseline: given object detections, predict the most frequent relation between object pairs with the given labels, as seen in the training set. This baseline improves on the previous state-of-the-art by an average of 3.6% relative improvement across evaluation settings. We then introduce Stacked Motif Networks, a new architecture designed to capture higher order motifs in scene graphs that further improves over our strong baseline by an average 7.1% relative gain. Our code is available at github.com/rowanz/neural-motifs.Comment: CVPR 2018 camera read

arXiv.org e-Print Archive

Crossref

Generation of folk song melodies using Bayes transforms

Author: Bent I.
Bod R.
Chaitin G.
Chomsky N.
Chris Thornton
Cope D.
Cope D.
Held G.
Huron D.
Kohonen T.
Kohonen T.
Kohonen T.
Lerdahl A
Li M.
Manning C
Meyer L.
Pearce M.
Rissanen J.
Schaffrath H.
Schenker H.
Schenker H.
Shannon C
Shannon C.
Shannon C.
Temperley D.
Temperley D.
Temperley D.
Wertheimer M.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2011
Field of study

The paper introduces the `Bayes transform', a mathematical procedure for putting data into a hierarchical representation. Applicable to any type of data, the procedure yields interesting results when applied to sequences. In this case, the representation obtained implicitly models the repetition hierarchy of the source. There are then natural applications to music. Derivation of Bayes transforms can be the means of determining the repetition hierarchy of note sequences (melodies) in an empirical and domain-general way. The paper investigates application of this approach to Folk Song, examining the results that can be obtained by treating such transforms as generative models

Crossref

Sussex Research Online

A Neural Network Model of Spatio-Temporal Pattern Recognition, Recall and Timing

Author: Mannes Christian
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/02/1992
Field of study

This paper describes the design of a self~organizing, hierarchical neural network model of unsupervised serial learning. The model learns to recognize, store, and recall sequences of unitized patterns, using either short-term memory (STM) or both STM and long-term memory (LTM) mechanisms. Timing information is learned and recall {both from STM and from LTM) is performed with a learned rhythmical structure. The network, bearing similarities with ART (Carpenter & Grossberg 1987a), learns to map temporal sequences to unitized patterns, which makes it suitable for hierarchical operation. It is therefore capable of self-organizing codes for sequences of sequences. The capacity is only limited by the number of nodes provided. Selected simulation results are reported to illustrate system properties.National Science Foundation (IRI-9024877

Boston University Institutional Repository (OpenBU)

Deepr: A Convolutional Net for Medical Records

Author: Nguyen Phuoc
Tran Truyen
Venkatesh Svetha
Wickramasinghe Nilmini
Publication venue
Publication date: 25/07/2016
Field of study

Feature engineering remains a major bottleneck when creating predictive systems from electronic medical records. At present, an important missing element is detecting predictive regular clinical motifs from irregular episodic records. We present Deepr (short for Deep record), a new end-to-end deep learning system that learns to extract features from medical records and predicts future risk automatically. Deepr transforms a record into a sequence of discrete elements separated by coded time gaps and hospital transfers. On top of the sequence is a convolutional neural net that detects and combines predictive local clinical motifs to stratify the risk. Deepr permits transparent inspection and visualization of its inner working. We validate Deepr on hospital data to predict unplanned readmission after discharge. Deepr achieves superior accuracy compared to traditional techniques, detects meaningful clinical motifs, and uncovers the underlying structure of the disease and intervention space

arXiv.org e-Print Archive

Deakin Research Online

Conditional Image-Text Embedding Networks

Author: A Gordo
A Rohrbach
BA Plummer
CL Zitnick
F Radenović
L Yu
M Wang
R Krishna
Publication venue
Publication date: 28/07/2018
Field of study

This paper presents an approach for grounding phrases in images which jointly learns multiple text-conditioned embeddings in a single end-to-end model. In order to differentiate text phrases into semantically distinct subspaces, we propose a concept weight branch that automatically assigns phrases to embeddings, whereas prior works predefine such assignments. Our proposed solution simplifies the representation requirements for individual embeddings and allows the underrepresented concepts to take advantage of the shared representations before feeding them into concept-specific layers. Comprehensive experiments verify the effectiveness of our approach across three phrase grounding datasets, Flickr30K Entities, ReferIt Game, and Visual Genome, where we obtain a (resp.) 4%, 3%, and 4% improvement in grounding performance over a strong region-phrase embedding baseline.Comment: ECCV 2018 accepted pape

arXiv.org e-Print Archive

Crossref

An investigation of speaker independent phrase break models in End-to-End TTS systems

Author: Vadapalli Anandaswarup
Publication venue
Publication date: 09/04/2023
Field of study

This paper presents our work on phrase break prediction in the context of end-to-end TTS systems, motivated by the following questions: (i) Is there any utility in incorporating an explicit phrasing model in an end-to-end TTS system?, and (ii) How do you evaluate the effectiveness of a phrasing model in an end-to-end TTS system? In particular, the utility and effectiveness of phrase break prediction models are evaluated in in the context of childrens story synthesis, using listener comprehension. We show by means of perceptual listening evaluations that there is a clear preference for stories synthesized after predicting the location of phrase breaks using a trained phrasing model, over stories directly synthesized without predicting the location of phrase breaks.Comment: Submitted for review to IEEE Acces

arXiv.org e-Print Archive