Search CORE

186 research outputs found

Towards zero-shot language modeling

Author: Cotterell R
Korhonen A
Ponti EM
Reichart R
Vulić I
Publication venue: EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference
Publication date: 01/01/2019
Field of study

Can we construct a neural language model which is inductively biased towards learning human language? Motivated by this question, we aim at constructing an informative prior for held-out languages on the task of character-level, open-vocabulary language modeling. We obtain this prior as the posterior over network weights conditioned on the data from a sample of training languages, which is approximated through Laplace’s method. Based on a large and diverse sample of languages, the use of our prior outperforms baseline models with an uninformative prior in both zero-shot and few-shot settings, showing that the prior is imbued with universal linguistic knowledge. Moreover, we harness broad language-specific information available for most languages of the world, i.e., features from typological databases, as distant supervision for held-out languages. We explore several language modeling conditioning techniques, including concatenation and meta-networks for parameter generation. They appear beneficial in the few-shot setting, but ineffective in the zero-shot setting. Since the paucity of even plain digital text affects the majority of the world’s languages, we hope that these insights will broaden the scope of applications for language technology

Crossref

Apollo (Cambridge)

On the relation between linguistic typology and (limitations of) multilingual language modeling

Author: Gerz D
Korhonen A
Ponti EM
Reichart R
Vulić I
Publication venue: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
Publication date: 01/01/2018
Field of study

A key challenge in cross-lingual NLP is developing general language-independent architectures that are equally applicable to any language. However, this ambition is largely hampered by the variation in structural and semantic properties, i.e. the typological profiles of the world's languages. In this work, we analyse the implications of this variation on the language modeling (LM) task. We present a large-scale study of state-of-the art n-gram based and neural language models on 50 typologically diverse languages covering a wide variety of morphological systems. Operating in the full vocabulary LM setup focused on word-level prediction, we demonstrate that a coarse typology of morphological systems is predictive of absolute LM performance. Moreover, fine-grained typological features such as exponence, flexivity, fusion, and inflectional synthesis are borne out to be responsible for the proliferation of low-frequency phenomena which are organically difficult to model by statistical architectures, or for the meaning ambiguity of character n-grams. Our study strongly suggests that these features have to be taken into consideration during the construction of next-level language-agnostic LM architectures, capable of handling morphologically complex languages such as Tamil or Korean.ERC grant Lexica

Crossref

Edinburgh Research Explorer

Apollo (Cambridge)

Adversarial propagation and zero-shot cross-lingual transfer of word vector specialization

Author: Glavaš G
Korhonen A
Mrkšić N
Ponti EM
Vulić I
Publication venue: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018
Publication date: 01/01/2018
Field of study

Semantic \specialization is a process of fine-tuning pre-trained distributional word vectors using external lexical knowledge (e.g., WordNet) to accentuate a particular semantic relation in the specialized vector space. While post-processing specialization methods are applicable to arbitrary distributional vectors, they are limited to updating only the vectors of words occurring in external lexicons (i.e., seen words), leaving the vectors of all other words unchanged. We propose a novel approach to specializing the full distributional vocabulary. Our adversarial post-specialization method propagates the external lexical knowledge to the full distributional space. We exploit words seen in the resources as training examples for learning a global specialization function. This function is learned by combining a standard L2-distance loss with a adversarial loss: the adversarial component produces more realistic output vectors. We show the effectiveness and robustness of the proposed method across three languages and on three tasks: word similarity, dialog state tracking, and lexical simplification. We report consistent improvements over distributional word vectors and vectors specialized by other state-of-the-art specialization frameworks. Finally, we also propose a cross-lingual transfer method for zero-shot specialization which successfully specializes a full target distributional space without any lexical knowledge in the target language and without any bilingual data

arXiv.org e-Print Archive

Crossref

MAnnheim DOCument Server

Edinburgh Research Explorer

Apollo (Cambridge)

Cross-lingual semantic specialization via lexical relation induction

Author: Glavaš G
Korhonen A
Ponti EM
Reichart R
Vulić I
Publication venue: EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference
Publication date: 01/01/2019
Field of study

Semantic specialization integrates structured linguistic knowledge from external resources (such as lexical relations in WordNet) into pretrained distributional vectors in the form of constraints. However, this technique cannot be leveraged in many languages, because their structured external resources are typically incomplete or non-existent. To bridge this gap, we propose a novel method that transfers specialization from a resource-rich source language (English) to virtually any target language. Our specialization transfer comprises two crucial steps: 1) Inducing noisy constraints in the target language through automatic word translation; and 2) Filtering the noisy constraints via a state-of-the-art relation prediction model trained on the source language constraints. This allows us to specialize any set of distributional vectors in the target language with the refined constraints. We prove the effectiveness of our method through intrinsic word similarity evaluation in 8 languages, and with 3 downstream tasks in 5 languages: lexical simplification, dialog state tracking, and semantic textual similarity. The gains over the previous state-of-art specialization methods are substantial and consistent across languages. Our results also suggest that the transfer method is effective even for lexically distant source-target language pairs. Finally, as a by-product, our method produces lists of WordNet-style lexical relations in resource-poor languages

Crossref

MAnnheim DOCument Server

Edinburgh Research Explorer

Apollo (Cambridge)

Decoding sentiment from distributed representations of sentences

Author: Korhonen A
Ponti EM
Vulić I
Publication venue: *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings
Publication date: 01/01/2017
Field of study

Distributed representations of sentences have been developed recently to represent their meaning as real-valued vectors. However, it is not clear how much information such representations retain about the polarity of sentences. To study this question, we decode sentiment from unsupervised sentence representations learned with different architectures (sensitive to the order of words, the order of sentences, or none) in 9 typologically diverse languages. Sentiment results from the (recursive) composition of lexical items and grammatical strategies such as negation and concession. The results are manifold: we show that there is no `one-size-fits-all' representation architecture outperforming the others across the board. Rather, the top-ranking architectures depend on the language and data at hand. Moreover, we find that in several cases the additive composition model based on skip-gram word vectors may surpass supervised state-of-art architectures such as bidirectional LSTMs. Finally, we provide a possible explanation of the observed variation based on the type of negative constructions in each language

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Apollo (Cambridge)

Boundless plains

Author: A Jones
A Reupert
A Reupert
C Reedtz
Camilla Lauritzen
Charlotte Reedtz
D Cicchetti
D Garley
D Maybery
D Maybery
EM Foster
F Gardner
G Downey
G Smith
I Brockington
I Granic
J Cohen
K Biebel
Karin Van Doesum
KTM Doesum Van
M Rutter
Monica Martinussen
NS Bauer
PS Horn Van
RF Muñoz
SH Goodman
T Korhonen
T Korhonen
T Korhonen
WR Beardslee
Publication venue
Publication date: 01/01/2013
Field of study

Scholarships & Prizes Office. University of Sydne

Crossref

Springer - Publisher Connector

PubMed Central

Sydney eScholarship

Munin - Open Research Archive

Radboud Repository

NORA - Norwegian Open Research Archives

Composition of the pericellular matrix modulates the deformation behaviour of chondrocytes in articular cartilage under static loading

Author: A Maroudas
A Maroudas
CA Poole
CA Poole
CA Poole
CA Poole
DP Speer
EM Shapiro
F Guilak
F Guilak
F Guilak
F Guilak
F Guilak
F Guilak
F Guilak
GA Ateshian
H Lipshitz
J Rieppo
JB Choi
JM Huyghe
Jukka S. Jurvelin
JZ Wu
JZ Wu
L Ng
LG Alexopoulos
LG Alexopoulos
LG Alexopoulos
LP Li
M Likhitpanichkul
MD Buschmann
MD Buschmann
MJ Kaab
NM Bachrach
NO Chahine
P Julkunen
P Julkunen
P Julkunen
Petro Julkunen
PJ Basser
R Shirazi
Rami K. Korhonen
RC Schugart
RK Korhonen
RK Korhonen
RK Korhonen
RM Schinagl
S-K Han
SS Chen
TM Quinn
VC Mow
W Wilson
W Wilson
W Wilson
W Wilson
WA Hing
WM Lai
Wouter Wilson
WR Trickey
WR Trickey
WR Trickey
X Xu
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

The aim was to assess the role of the composition changes in the pericellular matrix (PCM) for the chondrocyte deformation. For that, a three-dimensional finite element model with depth-dependent collagen density, fluid fraction, fixed charge density and collagen architecture, including parallel planes representing the split-lines, was created to model the extracellular matrix (ECM). The PCM was constructed similarly as the ECM, but the collagen fibrils were oriented parallel to the chondrocyte surfaces. The chondrocytes were modelled as poroelastic with swelling properties. Deformation behaviour of the cells was studied under 15% static compression. Due to the depth-dependent structure and composition of cartilage, axial cell strains were highly depth-dependent. An increase in the collagen content and fluid fraction in the PCMs increased the lateral cell strains, while an increase in the fixed charge density induced an inverse behaviour. Axial cell strains were only slightly affected by the changes in PCM composition. We conclude that the PCM composition plays a significant role in the deformation behaviour of chondrocytes, possibly modulating cartilage development, adaptation and degeneration. The development of cartilage repair materials could benefit from this information

Repository TU/e

Crossref

Springer - Publisher Connector

Pure OAI Repository

PubMed Central

Zircon ages in granulite facies rocks: decoupling from geochemistry above 850 °C?

Author: A Boriani
A Boriani
A Henk
A Langone
A Mulch
A Mulch
A Möller
A Zanetti
A Zingg
A Zingg
A-M Seydoux-Guillaume
AE Hofmann
B Bingen
B Schnetger
Barbara E. Kunz
BE Kunz
BE Kunz
C Annen
C Paton
C Paton
C Redler
C Redler
C Yakymchuk
CR McFarlane
CR McFarlane
D Regis
D Rubatto
D Rubatto
D Rubatto
D Rubatto
D Rubatto
Daniele Regis
DE Kelsey
DE Kelsey
DJ Cherniak
DJ Cherniak
DM Fountain
DM Fountain
E Rutter
EB Watson
EB Watson
EM Peterman
F Corfu
F Tomaschek
FJ Korhonen
G Bigi
G Fraser
G Peressini
G Rivalenti
G Rivalenti
G Vavra
G Vavra
GL Luvizotto
H Berckhemer
H Degeling
H Voshage
IS Williams
IS Williams
J Hermann
J Pape
J Sláma
JA Baldwin
JA Petrus
JE Quick
JE Quick
JF Diener
JW Valley
K Mezger
KR Mehnert
L Nasdala
L Nasdala
L Schiøtte
M Bertolani
M Bertolani
M Guillong
M Kohn
M Tichomirowa
MA Kusiak
MA Kusiak
Martin Engi
MJ Whitehouse
MP Roberts
MR Handy
MR Handy
NE Timms
NE Timms
NJ Pearce
NM Kelly
P Monjoie
P Vonlanthen
PW Hoskin
PW Hoskin
PW Hoskin
R Schuster
RC Ewing
RJ Taylor
RM Flowers
RT Pidgeon
S Barboza
S Barboza
S Bodorkos
S Capedri
S Piazolo
S Sinigoi
S Sinigoi
S Sinigoi
SE Jackson
SH Vorhies
SL Harley
SM Reddy
SM Schmid
SM Schmid
T Geisler
T Geisler
TA Ewing
U Schaltegger
U Schaltegger
U Schaltegger
U Schaltegger
US Klötzli
WF McDonough
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2018
Field of study

Granulite facies rocks frequently show a large spread in their zircon ages, the interpretation of which raises questions: Has the isotopic system been disturbed? By what process(es) and conditions did the alteration occur? Can the dates be regarded as real ages, reflecting several growth episodes? Furthermore, under some circumstances of (ultra-)high-temperature metamorphism, decoupling of zircon U–Pb dates from their trace element geochemistry has been reported. Understanding these processes is crucial to help interpret such dates in the context of the P–T history. Our study presents evidence for decoupling in zircon from the highest grade metapelites (> 850 °C) taken along a continuous high-temperature metamorphic field gradient in the Ivrea Zone (NW Italy). These rocks represent a well-characterised segment of Permian lower continental crust with a protracted high-temperature history. Cathodoluminescence images reveal that zircons in the mid-amphibolite facies preserve mainly detrital cores with narrow overgrowths. In the upper amphibolite and granulite facies, preserved detrital cores decrease and metamorphic zircon increases in quantity. Across all samples we document a sequence of four rim generations based on textures. U–Pb dates, Th/U ratios and Ti-in-zircon concentrations show an essentially continuous evolution with increasing metamorphic grade, except in the samples from the granulite facies, which display significant scatter in age and chemistry. We associate the observed decoupling of zircon systematics in high-grade non-metamict zircon with disturbance processes related to differences in behaviour of non-formula elements (i.e. Pb, Th, U, Ti) at high-temperature conditions, notably differences in compatibility within the crystal structure

Crossref

Open Research Online (The Open University)

Functional Roles of the N- and C-Terminal Regions of the Human Mitochondrial Single-Stranded DNA-Binding Protein

Author: A Bochkarev
A Kornberg
AG Kozlov
AJ Williams
B Marintcheva
C Yang
CL Farr
CL Farr
CM Wernette
CR Hackenbrock
D Maier
DA Clayton
DC Wallace
E Van Dyck
E Yakubovskaya
EM Hyland
H Nakai
H Ruhanen
IJ Holt
JA Korhonen
JA Korhonen
Janine Santos
JD Thompson
KB Krassa
L Fan
L Tomaska
Laurie S. Kaguni
Marcos T. Oliveira
MS Wold
MT Oliveira
MW Gray
MY Yang
N Luo
P Thommes
PA Srere
RD Shereda
RD Shereda
RL Burke
S Raghunathan
SE Lim
T Hollis
TD Ziebarth
U Curth
U Curth
Y Shamoo
Y Wang
YS Lee
YS Lee
YT Kim
YT Kim
YT Kim
ZG He
Publication venue: Public Library of Science
Publication date: 01/10/2010
Field of study

Biochemical studies of the mitochondrial DNA (mtDNA) replisome demonstrate that the mtDNA polymerase and the mtDNA helicase are stimulated by the mitochondrial single-stranded DNA-binding protein (mtSSB). Unlike Escherichia coli SSB, bacteriophage T7 gp2.5 and bacteriophage T4 gp32, mtSSBs lack a long, negatively charged C-terminal tail. Furthermore, additional residues at the N-terminus (notwithstanding the mitochondrial presequence) are present in the sequence of species across the animal kingdom. We sought to analyze the functional importance of the N- and C-terminal regions of the human mtSSB in the context of mtDNA replication. We produced the mature wild-type human mtSSB and three terminal deletion variants, and examined their physical and biochemical properties. We demonstrate that the recombinant proteins adopt a tetrameric form, and bind single-stranded DNA with similar affinities. They also stimulate similarly the DNA unwinding activity of the human mtDNA helicase (up to 8-fold). Notably, we find that unlike the high level of stimulation that we observed previously in the Drosophila system, stimulation of DNA synthesis catalyzed by human mtDNA polymerase is only moderate, and occurs over a narrow range of salt concentrations. Interestingly, each of the deletion variants of human mtSSB stimulates DNA synthesis at a higher level than the wild-type protein, indicating that the termini modulate negatively functional interactions with the mitochondrial replicase. We discuss our findings in the context of species-specific components of the mtDNA replisome, and in comparison with various prokaryotic DNA replication machineries

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Child responsible personnel in adult mental health services

Author: A Grant
A Reupert
BM Gladstone
C Lauritzen
C Lauritzen
C Lauritzen
C Lauritzen
C Lauritzen
C Reedtz
C Reedtz
Camilla Lauritzen
Charlotte Reedtz
CMH Hosman
D Maybery
E Siegenthaler
EM Rogers
G Aamodt
J Cohen
M Hernandez
M Rutter
MN Christoffersen
NM Kowalenko
S Wiltsey Stirman
SH Goodman
SH Goodman
T Korhonen
T Solantaus
WR Beardslee
WR Beardslee
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref