Search CORE

37 research outputs found

Building and Evaluating Open-Vocabulary Language Models

Author: Mielke Sebastian J
Publication venue: 'The Busan Gyeongnam Mathematical Society'
Publication date: 03/01/2024
Field of study

Language models have always been a fundamental NLP tool and application. This thesis focuses on open-vocabulary language models, i.e., models that can deal with novel and unknown words at runtime. We will propose both new ways to construct such models as well as use such models in cross-linguistic evaluations to answer questions of difficulty and language-specificity in modern NLP tools. We start by surveying linguistic background as well as past and present NLP approaches to tokenization and open-vocabulary language modeling (Mielke et al., 2021). Thus equipped, we establish desirable principles for such models, both from an engineering mindset as well as a linguistic one and hypothesize a model based on the marriage of neural language modeling and Bayesian nonparametrics to handle a truly infinite vocabulary, boasting attractive theoretical properties and mathematical soundness, but presenting practical implementation difficulties. As a compromise, we thus introduce a word-based two-level language model that still has many desirable characteristics while being highly feasible to run (Mielke and Eisner, 2019). Unlike the more dominant approaches of characters or subword units as one-layer tokenization it uses words; its key feature is the ability to generate novel words in context and in isolation. Moving on to evaluation, we ask: how do such models deal with the wide variety of languages of the world---are they struggling with some languages? Relating this question to a more linguistic one, are some languages inherently more difficult to deal with? Using simple methods, we show that indeed they are, starting with a small pilot study that suggests typological predictors of difficulty (Cotterell et al., 2018). Thus encouraged, we design a far bigger study with more powerful methodology, a principled and highly feasible evaluation and comparison scheme based again on multi-text likelihood (Mielke et al., 2019). This larger study shows that the earlier conclusion of typological predictors is difficult to substantiate, but also offers a new insight on the complexity of Translationese. Following that theme, we end by extending this scheme to machine translation models to answer questions traditional evaluation metrics like BLEU cannot (Bugliarello et al., 2020)

Johns Hopkins University

Are All Languages Equally Hard to Language-Model?

Author: Cotterell Ryan
Eisner Jason
Mielke Sebastian J
Roark Brian
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2018
Field of study

How cross-linguistically applicable are NLP models, specifically language models? A fair comparison between languages is tricky: not only do training corpora in different languages have different sizes and topics, some of which may be harder to predict than others, but standard metrics for language modeling depend on the orthography of a language. We argue for a fairer metric based on the bits per utterance using utterance-aligned multi-text. We conduct a study on 21 languages, training and testing both n-gram and LSTM language models on “the same” set of utterances in each language (modulo translation), demonstrating that in some languages, especially those with complex inflectional morphology, the textual expression of the information is harder to predict

arXiv.org e-Print Archive

Crossref

ScholarWorks@UMass Amherst

Recommended from our members

A Structured Variational Autoencoder for Contextual Morphological Inflection

Author: Cotterell Ryan
Mielke Sebastian J
Naradowsky Jason
Wolf-Sonkin Lawrence
Publication venue: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Publication date: 01/01/2018
Field of study

Apollo (Cambridge)

Towards resolution of the Fermi surface in underdoped high-Tc superconductors

Author: Anderson P W
Chen W-Q
Edwards H L
Ghiringhelli G
Gilbert G Lonzarich
Harrison N Sebastian S E
Haug D
Neil Harrison
Peets D C
Sebastian S E Harrison N Altarawneh M M Balakirev F F Mielke C H Liang R Bonn D A Hardy W N Lonzarich G G
Suchitra E Sebastian
Sushkov O P
Usher A
Wilson J A
Publication venue: 'IOP Publishing'
Publication date: 18/09/2012
Field of study

We survey recent experimental results including quantum oscillations and complementary measurements probing the electronic structure of underdoped cuprates, and theoretical proposals to explain them. We discuss quantum oscillations measured at high magnetic fields in the underdoped cuprates that reveal a small Fermi surface section comprising quasiparticles that obey Fermi-Dirac statistics, unaccompanied by other states of comparable thermodynamic mass at the Fermi level. The location of the observed Fermi surface section at the nodes is indicated by a body of evidence including the collapse in Fermi velocity measured by quantum oscillations, which is found to be associated with the nodal density of states observed in angular resolved photoemission, the persistence of quantum oscillations down to low fields in the vortex state, the small value of density of states from heat capacity and the multiple frequency quantum oscillation pattern consistent with nodal magnetic breakdown of bilayer-split pockets. A nodal Fermi surface pocket is further consistent with the observation of a density of states at the Fermi level concentrated at the nodes in photoemission experiments, and the antinodal pseudogap observed by photoemission, optical conductivity, nuclear magnetic resonance Knight shift, as well as other complementary diffraction, transport and thermodynamic measurements. One of the possibilities considered is that the small Fermi surface pockets observed at high magnetic fields can be understood in terms of Fermi surface reconstruction by a form of small wavevector charge order, observed over long lengthscales in experiments such as nuclear magnetic resonance and x-ray scattering, potentially accompanied by an additional mechanism to gap the antinodal density of states.Comment: 33 pages, 15 figures (this version updated with new figures and additional text, as published

arXiv.org e-Print Archive

Crossref

Post-transplant cyclophosphamide versus antithymocyte globulin in patients with acute myeloid leukemia in first complete remission undergoing allogeneic stem cell transplantation from 10/10 HLA-matched unrelated donors

Author: Blaise Didier
Brissot Eolia
Bulabois Claude Eric
Ciceri Fabio
Cornelissen J. J.
Deconinck Eric
Forcade Edouard
Giebel Sebastian
Griskevicius Laimonas
Labopin Myriam
Meijer Ellen
Mielke Stephan
Mistrik Martin
Mohty Mohamad
Moiseev Ian
Nagler Arnon
Niittyvuopio Riitta
Rovira Montserrat
Ruggeri Annalisa
Sanz Jaime
Savani Bipin
Spyridonidis Alexandros
Van Gorkom Gwendolyn
Publication venue
Publication date: 01/01/2020
Field of study

Background Graft-versus-host disease (GVHD) remains a major contributor to mortality and morbidity after allogeneic stem-cell transplantation (allo-HSCT). The updated recommendations suggest that rabbit antithymocyte globulin or anti-T-lymphocyte globulin (ATG) should be used for GVHD prophylaxis in patients undergoing matched-unrelated donor (MUD) allo-HSCT. More recently, using post-transplant cyclophosphamide (PTCY) in the haploidentical setting has resulted in low incidences of both acute (aGVHD) and chronic GVHD (cGVHD). Therefore, the aim of our study was to compare GVHD prophylaxis using either PTCY or ATG in patients with acute myeloid leukemia (AML) who underwent allo-HSCT in first remission (CR1) from a 10/10 HLA-MUD. Methods Overall, 174 and 1452 patients from the EBMT registry receiving PTCY and ATG were included. Cumulative incidence of aGVHD and cGVHD, leukemia-free survival, overall survival, non-relapse mortality, cumulative incidence of relapse, and refined GVHD-free, relapse-free survival were compared between the 2 groups. Propensity score matching was also performed in order to confirm the results of the main analysis Results No statistical difference between the PTCY and ATG groups was observed for the incidence of grade II-IV aGVHD. The same held true for the incidence of cGVHD and for extensive cGVHD. In univariate and multivariate analyses, no statistical differences were observed for all other transplant outcomes. These results were also confirmed using matched-pair analysis. Conclusion These results highlight that, in the10/10 HLA-MUD setting, the use of PTCY for GVHD prophylaxis may provide similar outcomes to those obtained with ATG in patients with AML in CR1.Peer reviewe

Maastricht University Research Portal

Crossref

HAL AMU

HAL-Inserm

EUR Research Repository

Erasmus University Digital Repository

Helsingin yliopiston digitaalinen arkisto

Vilnius University Institutional Repository

Innate immunodeficiency following genetic ablation of Mcl1 in natural killer cells

Author: Belz Gabrielle T.
Carotta Sebastian
Chopin Michael
Delconte Rebecca B.
Glaser Stefan P.
Huntington Nicholas D.
Kolesnik Tatiana B.
Mielke Lisa A.
Nicholson Sandra E.
Nutt Stephen L.
Rankin Lucille C.
Sathe Priyanka
Seillet Cyril
Smyth Mark J.
Souza-Fonseca-Guimaraes Fernando
Strasser Andreas
Vandenberg Cassandra J.
Vikstrom Ingela
Vivier Eric
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/08/2014
Field of study

The cytokine IL-15 is required for natural killer (NK) cell homeostasis; however, the intrinsic mechanism governing this requirement remains unexplored. Here we identify the absolute requirement for myeloid cell leukaemia sequence-1 (Mcl1) in the sustained survival of NK cells in vivo. Mcl1 is highly expressed in NK cells and regulated by IL-15 in a dose-dependent manner via STAT5 phosphorylation and subsequent binding to the 3'-UTR of Mcl1. Specific deletion of Mcl1 in NK cells results in the absolute loss of NK cells from all tissues owing to a failure to antagonize pro-apoptotic proteins in the outer mitochondrial membrane. This NK lymphopenia results in mice succumbing to multiorgan melanoma metastases, being permissive to allogeneic transplantation and being resistant to toxic shock following polymicrobial sepsis challenge. These results clearly demonstrate a non-redundant pathway linking IL-15 to Mcl1 in the maintenance of NK cells and innate immune responses in vivo

University of Queensland eSpace

Strong-coupling expansion and effective hamiltonians

Author: A Abendschein
A Abendschein
A Auerbach
A Luther
A Mielke
A Reischl
B Damski
BS Shastry
C Knetter
C Knetter
C Knetter
C Knetter
C Knetter
CJ Morningstar
CJ Morningstar
CP Heidbrink
DC Cabra
DC Mattis
DL Bergman
E Altman
E Berg
E Müller-Hartmann
F Levy
F Mila
F Mila
FDM Haldane
FJ Wegner
G Misguich
G Misguich
GS Uhrig
H Tsunetsugu
HA Bethe
I Rousochatzakis
J Dorier
J Piekarewicz
J Piekarewicz
J Stein
J-B Fouet
JN Kriel
K Kodama
K Onizuka
K Totsuka
KG Wilson
KP Schmidt
KP Schmidt
L Landau
M Ferrero
M Takahashi
M Takigawa
M Weinstein
MS Siu
P Fulde
P Lecheminant
P Lenz
PW Anderson
R Budnik
R Moessner
S Capponi
S Dusuel
S Dusuel
S Miyahara
S Miyahara
S Miyahara
S Miyahara
S Sachdev
S. Capponi
SD Głazek
SD Głazek
SE Sebastian
T Kato
T Momoi
V Subrahmanyam
VN Kotov
VN Kotov
W Brenig
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/05/2010
Field of study

When looking for analytical approaches to treat frustrated quantum magnets, it is often very useful to start from a limit where the ground state is highly degenerate. This chapter discusses several ways of deriving {effective Hamiltonians} around such limits, starting from standard {degenerate perturbation theory} and proceeding to modern approaches more appropriate for the derivation of high-order effective Hamiltonians, such as the perturbative continuous unitary transformations or contractor renormalization. In the course of this exposition, a number of examples taken from the recent literature are discussed, including frustrated ladders and other dimer-based Heisenberg models in a field, as well as the mapping between frustrated Ising models in a transverse field and quantum dimer models.Comment: To appear as a chapter in "Highly Frustrated Magnetism", Eds. C. Lacroix, P. Mendels, F. Mil

arXiv.org e-Print Archive

Crossref

CNS Involvement at Initial Diagnosis and Risk of Relapse After Allogeneic HCT for Acute Lymphoblastic Leukemia in First Complete Remission

Author: Aljurf Mahmoud
Balsat Marie
Bazarbachi Ali
Brissot Eolia
Chevallier Patrice
Cornelissen Jan J.
Giebel Sebastian
Huynh Anne
Kharfan-Dabaja Mohamed A.
Labopin Myriam
Menard Anne-Lise
Mielke Stephan
Mohty Mohamad
Nagler Arnon
Peric Zina
Pioltelli Pietro
Rodriguez Arancha Bermudez
Rubio Marie Therese
Salmenniemi Urpu
Schaap Nicolaas
Socie Gerard
Yakoub-Agha Ibrahim
Publication venue
Publication date: 12/10/2022
Field of study

Outcomes of allogeneic hematopoietic cell transplantation (allo-HCT) for adult acute lymphoblastic leukemia (ALL) have improved over time. Studies have shown that total body irradiation (TBI) is the preferable type of myeloablative conditioning (MAC). However, outcomes based on central nervous system (CNS) involvement, namely CNS-positive versus CNS-negative, have not been compared. Here, we evaluated outcomes of 547 patients (CNS-positive = 96, CNS-negative = 451) who were allografted in the first complete remission (CR1) between 2009 and 2019. Primary endpoint was leukemia-free survival (LFS). Median follow-up was not different between the CNS-positive and CNS-negative groups (79 versus 67.2 months, P = 0.58). The CNS-positive group were younger (median age 31.3 versus 39.7 years, P = 0.004) and were allografted more recently (median year 2012 versus 2010, P = 0.003). In both groups, MAC was the preferred approach (82.3% versus 85.6%, P = 0.41). On multivariate analysis, the CNS-positive group had higher incidence of relapse (RI) (hazard ratio [HR] = 1.58 [95% confidence interval (CI) = 1.06-2.35], P = 0.025), but no adverse effect on LFS (HR = 1.38 [95% CI = 0.99-1.92], P = 0.057) or overall survival (OS) (HR = 1.28 [95% CI = 0.89-1.85], P = 0.18). A subgroup multivariate analysis limited to CNS-positive patients showed that a TBI-based MAC regimen resulted in better LFS (HR = 0.43 [95% CI = 0.22-0.83], P = 0.01) and OS (HR = 0.44 [95% CI = 0.21-0.92], P = 0.03) and lower RI (HR = 0.35 [95% CI = 0.15-0.79], P = 0.01). Another subgroup analysis in CNS-negative patients showed that MAC-TBI preparative regimens also showed a lower RI without a benefit in LFS or OS. While a MAC-TBI allo-HCT regimen may not be suitable to all, particularly for older patients with comorbidities, this approach should be considered for patients who are deemed fit and able to tolerate.Peer reviewe

HAL-HCL

Directory of Open Access Journals

PubMed Central

Helsingin yliopiston digitaalinen arkisto

Multi-dimensional modeling and simulation of semiconductor nanophotonic devices

Author: A. Strittmatter
Aimi Abass
Alexander Mielke
Alexander Mielke
Alexander Wilms
André Strittmatter
Axel Fischer
Axel Fischer
Benjamin Wohlfeil
Bjorn Maes
Carlo Jacoboni
Christiane Becker
D. M. Richey
D. Peschka
D.L. Scharfetter
Daniel Werdehausen
Daniel Werdehausen
Dieter Bimberg
Dietmar Schroeder
Dirk Taillaert
E. M. Purcell
E. S. C. Ching
Ermin Malić
Franco Brezzi
Frank Schmidt
G.K. Wachutka
Günter Albinus
Günter Kewes
Günter Kewes
H. Gajewski
H. Gajewski
H. J. Kimble
H. Si
H. Wenzel
H.K. Gummel
Herbert Spohn
I. Magnusdottir
Igor Aharonovich
Ivo Babuška
J. Bethge
J.S. Blakemore
Jakob Rosenkrantz de Lasson
Jan Pomplun
Jan Pomplun
Jürgen Fuhrmann
K. Iga
Karl Hess
Kathy Ludge
Kinga Żołnacz
Lars Onsager
M. Grupen
M. Karl
M. Streiff
Maciej Dems
Marianne Bessemoulin-Chatard
Markus Kantner
Markus Kantner
Markus Kantner
Markus Mittnenzweig
Matteo Patriarca
Max Strauß
Miroslav Grmela
Morgane Bergot
N. Somaschi
N. Srocka
P. Bienstman
P. Lalanne
Pallab Bhattacharya
Patricio Farrell
Patricio Farrell
Paweł Mrowiński
Peter A. Markowich
Peter Lodahl
Peter Schnauber
Peter Schnauber
Philipp Gutsche
Philipp-Immanuel Schneider
Philipp-Immanuel Schneider
S Reitzenstein
S. Selberherr
S.M. Sze
Sebastian Steiger
Siegfried Selberherr
Sonia Buckley
Synopsys Inc
Theresa Hoehne
Thomas Koprucki
Thomas Koprucki
Thomas Koprucki
U. Lindefelt
Vassil Palankovski
Viktoriia E. Babicheva
Vitaly Shchukin
W. Unrau
W. Van Roosbroeck
W.L. Barnes
W.W. Chow
Weng W. Chow
Publication venue
Publication date: 01/01/2019
Field of study

Self-consistent modeling and multi-dimensional simulation of semiconductor nanophotonic devices is an important tool in the development of future integrated light sources and quantum devices. Simulations can guide important technological decisions by revealing performance bottlenecks in new device concepts, contribute to their understanding and help to theoretically explore their optimization potential. The efficient implementation of multi-dimensional numerical simulations for computer-aided design tasks requires sophisticated numerical methods and modeling techniques. We review recent advances in device-scale modeling of quantum dot based single-photon sources and laser diodes by self-consistently coupling the optical Maxwell equations with semiclassical carrier transport models using semi-classical and fully quantum mechanical descriptions of the optically active region, respectively. For the simulation of realistic devices with complex, multi-dimensional geometries, we have developed a novel hp-adaptive finite element approach for the optical Maxwell equations, using mixed meshes adapted to the multi-scale properties of the photonic structures. For electrically driven devices, we introduced novel discretization and parameter-embedding techniques to solve the drift-diffusion system for strongly degenerate semiconductors at cryogenic temperature. Our methodical advances are demonstrated on various applications, including vertical-cavity surface-emitting lasers, grating couplers and single-photon sources

Crossref

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics

Repositorium für Naturwissenschaften und Technik