Search CORE

927 research outputs found

A System for Real-Time Interactive Analysis of Deep Learning Training

Author: Bergstra James S.
Hastie Trevor
Stockinger K.
Sung Nako
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 07/01/2020
Field of study

Performing diagnosis or exploratory analysis during the training of deep learning models is challenging but often necessary for making a sequence of decisions guided by the incremental observations. Currently available systems for this purpose are limited to monitoring only the logged data that must be specified before the training process starts. Each time a new information is desired, a cycle of stop-change-restart is required in the training process. These limitations make interactive exploration and diagnosis tasks difficult, imposing long tedious iterations during the model development. We present a new system that enables users to perform interactive queries on live processes generating real-time information that can be rendered in multiple formats on multiple surfaces in the form of several desired visualizations simultaneously. To achieve this, we model various exploratory inspection and diagnostic tasks for deep learning training processes as specifications for streams using a map-reduce paradigm with which many data scientists are already familiar. Our design achieves generality and extensibility by defining composable primitives which is a fundamentally different approach than is used by currently available systems. The open source implementation of our system is available as TensorWatch project at https://github.com/microsoft/tensorwatch.Comment: Accepted at ACM SIGCHI Symposium on Engineering Interactive Computing Systems (EICS 2019). Code available as TensorWatch project at https://github.com/microsoft/tensorwatc

arXiv.org e-Print Archive

Crossref

Enabling semantic queries across federated bioinformatics databases

Author: Anisimova M
Dessimoz C
Gil M
Mendes de Farias T
Robinson-Rechavi M
Sima AC
Stockinger H
Stockinger K
Zbinden E
Publication venue
Publication date: 07/11/2019
Field of study

MOTIVATION: Data integration promises to be one of the main catalysts in enabling new insights to be drawn from the wealth of biological data available publicly. However, the heterogeneity of the different data sources, both at the syntactic and the semantic level, still poses significant challenges for achieving interoperability among biological databases. RESULTS: We introduce an ontology-based federated approach for data integration. We applied this approach to three heterogeneous data stores that span different areas of biological knowledge: (i) Bgee, a gene expression relational database; (ii) Orthologous Matrix (OMA), a Hierarchical Data Format 5 orthology DS; and (iii) UniProtKB, a Resource Description Framework (RDF) store containing protein sequence and functional information. To enable federated queries across these sources, we first defined a new semantic model for gene expression called GenEx. We then show how the relational data in Bgee can be expressed as a virtual RDF graph, instantiating GenEx, through dedicated relational-to-RDF mappings. By applying these mappings, Bgee data are now accessible through a public SPARQL endpoint. Similarly, the materialized RDF data of OMA, expressed in terms of the Orthology ontology, is made available in a public SPARQL endpoint. We identified and formally described intersection points (i.e. virtual links) among the three data sources. These allow performing joint queries across the data stores. Finally, we lay the groundwork to enable nontechnical users to benefit from the integrated data, by providing a natural language template-based search interface

UCL Discovery

Querying knowledge graphs in natural language.

Author: Anisimova M.
de Farias T.M.
Gil M.
Liang S.
Stockinger K.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Knowledge graphs are a powerful concept for querying large amounts of data. These knowledge graphs are typically enormous and are often not easily accessible to end-users because they require specialized knowledge in query languages such as SPARQL. Moreover, end-users need a deep understanding of the structure of the underlying data models often based on the Resource Description Framework (RDF). This drawback has led to the development of Question-Answering (QA) systems that enable end-users to express their information needs in natural language. While existing systems simplify user access, there is still room for improvement in the accuracy of these systems. In this paper we propose a new QA system for translating natural language questions into SPARQL queries. The key idea is to break up the translation process into 5 smaller, more manageable sub-tasks and use ensemble machine learning methods as well as Tree-LSTM-based neural network models to automatically learn and translate a natural language question into a SPARQL query. The performance of our proposed QA system is empirically evaluated using the two renowned benchmarks-the 7th Question Answering over Linked Data Challenge (QALD-7) and the Large-Scale Complex Question Answering Dataset (LC-QuAD). Experimental results show that our QA system outperforms the state-of-art systems by 15% on the QALD-7 dataset and by 48% on the LC-QuAD dataset, respectively. In addition, we make our source code available

Repository for Publications and Research Data

Serveur académique lausannois

ZHAW digitalcollection

Three-dimensional Models of Core-collapse Supernovae From Low-mass Progenitors With Implications for Crab

Author: Ertl T.
Gabler M.
Gessner A.
Heger A.
Janka H. -Th.
Kresse D.
Leung S. -C.
Melson T.
Nomoto K.
Stockinger G.
Tolstov A.
Wongwathanarat A.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 10/06/2020
Field of study

We present 3D full-sphere supernova simulations of non-rotating low-mass (~9 Msun) progenitors, covering the entire evolution from core collapse through bounce and shock revival, through shock breakout from the stellar surface, until fallback is completed several days later. We obtain low-energy explosions [~(0.5-1.0)x 10^{50} erg] of iron-core progenitors at the low-mass end of the core-collapse supernova (LMCCSN) domain and compare to a super-AGB (sAGB) progenitor with an oxygen-neon-magnesium core that collapses and explodes as electron-capture supernova (ECSN). The onset of the explosion in the LMCCSN models is modelled self-consistently using the Vertex-Prometheus code, whereas the ECSN explosion is modelled using parametric neutrino transport in the Prometheus-HOTB code, choosing different explosion energies in the range of previous self-consistent models. The sAGB and LMCCSN progenitors that share structural similarities have almost spherical explosions with little metal mixing into the hydrogen envelope. A LMCCSN with less 2nd dredge-up results in a highly asymmetric explosion. It shows efficient mixing and dramatic shock deceleration in the extended hydrogen envelope. Both properties allow fast nickel plumes to catch up with the shock, leading to extreme shock deformation and aspherical shock breakout. Fallback masses of <~5x10^{-3} Msun have no significant effects on the neutron star (NS) masses and kicks. The anisotropic fallback carries considerable angular momentum, however, and determines the spin of the newly-born NS. The LMCCSNe model with less 2nd dredge-up results in a hydrodynamic and neutrino-induced NS kick of >40 km/s and a NS spin period of ~30 ms, both not largely different from those of the Crab pulsar at birth.Comment: 47 pages, 27 figures, 6 tables; minor revisions, accepted by MNRA

arXiv.org e-Print Archive

Caltech Authors

MPG.PuRe

A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL [version 1; peer review: 1 approved, 2 approved with reservations]

Author: Dessimoz C
Mendes de Farias T
Sima AC
Stockinger K
Zahn-Zabal M
Publication venue: 'F1000 Research Ltd'
Publication date: 29/10/2019
Field of study

The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple sources. However, analyzing evolutionary data in RDF is not trivial, due to the steep learning curve required to understand both the data models adopted by different RDF data sources, as well as the SPARQL query language. In this article, we provide a hands-on introduction to querying evolutionary data across multiple sources that publish orthology information in RDF, namely: The Orthologous MAtrix (OMA), the European Bioinformatics Institute (EBI) RDF platform, the Database of Orthologous Groups (OrthoDB) and the Microbial Genome Database (MBGD). We present four protocols in increasing order of complexity. In these protocols, we demonstrate through SPARQL queries how to retrieve pairwise orthologs, homologous groups, and hierarchical orthologous groups. Finally, we show how orthology information in different sources can be compared, through the use of federated SPARQL queries

UCL Discovery

Neutrino oscillations and the effect of the finite lifetime of the neutrino source

Author: B. Kayser
B. Kayser
B. Pontecorvo
C. Athanassopoulos
G. Drexlin
I.S. Gradshteyn
J. Rich
J.E. Campagne
K. Eitel
K. Kiers
M. Zrałek
P. Stockinger
S. Mohanty
S.M. Bilenky
S.M. Bilenky
S.M. Bilenky
V. Gribov
W. Grimus
W. Grimus
W. Grimus
Publication venue: 'American Physical Society (APS)'
Publication date: 09/04/1999
Field of study

We consider a neutrino source at rest and discuss a condition for the existence of neutrino oscillations which derives from the finite lifetime

\tau_S

of the neutrino source particle. This condition is present if the neutrino source is a free particle such that its wave function is non-stationary. For a Gaussian wave function and with some simplifying assumptions, we study the modification of the usual oscillation probability stemming from

\tau_S

. In the present accelerator experiments the effect of

\tau_S

can be neglected. We discuss some experimental situations where the source lifetime becomes relevant in the oscillation formula.Comment: 13 pages latex file with 2 figure

arXiv.org e-Print Archive

Crossref

CERN Document Server

Effects of neutrino oscillations and neutrino magnetic moments on elastic neutrino-electron scattering

Author: A. M. Cooper-Sarkar
A. M. Mourão
A. V. Derbin
A. V. Kyuldjiev
B. Achkar
B. Kayser
B. Pontecorvo
C. Athanassopoulos
C. Broggini
C. Giunti
C. Giunti
D. A. Krakauer
D. Yu. Bardin
F. Boehm
F. Ould-Saada
J. Rich
J. Rich
J. Segura
K. S. Hirata
L. Borodovsky
P. Stockinger
P. Stockinger
P. Vogel
R. Barbieri
R. C. Allen
R. N. Mohapatra
S. M. Bilenky
S. M. Bilenky
W. Grimus
W. Grimus
Y. Fukuda
Publication venue: 'American Physical Society (APS)'
Publication date: 07/08/1997
Field of study

We consider elastic antineutrino-electron scattering taking into account possible effects of neutrino masses and mixing and of neutrino magnetic moments and electric dipole moments. Having in mind antineutrinos produced in a nuclear reactor we compute, in particular, the weak-electromagnetic interference terms which are linear in the magnetic (electric dipole) moments and also in the neutrino masses. We show that these terms are, however, suppressed compared to the pure weak and electromagnetic cross section. We also comment upon the possibility of using the electromagnetic cross section to investigate neutrino oscillations.Comment: 12 pages, REVTEX file, no figures, submitted to Phys.Rev.

arXiv.org e-Print Archive

Crossref

DIANA Scheduling Hierarchies for Optimizing Bulk Job Scheduling

Author: Ali A.
Alvi O.
Anjum A.
Hasham K.
McClatchey R.
Sagheer M.
Stockinger H.
Thomas M.
Willers I.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2006
Field of study

The use of meta-schedulers for resource management in large-scale distributed systems often leads to a hierarchy of schedulers. In this paper, we discuss why existing meta-scheduling hierarchies are sometimes not sufficient for Grid systems due to their inability to re-organise jobs already scheduled locally. Such a job re-organisation is required to adapt to evolving loads which are common in heavily used Grid infrastructures. We propose a peer-to-peer scheduling model and evaluate it using case studies and mathematical modelling. We detail the DIANA (Data Intensive and Network Aware) scheduling algorithm and its queue management system for coping with the load distribution and for supporting bulk job scheduling. We demonstrate that such a system is beneficial for dynamic, distributed and self-organizing resource management and can assist in optimizing load or job distribution in complex Grid infrastructures.Comment: 8 pages, 9 figures. Presented at the 2nd IEEE Int Conference on eScience & Grid Computing. Amsterdam Netherlands, December 200

arXiv.org e-Print Archive

Crossref

Caltech Authors

Conventional Dendritic Cells Mount a Type I IFN Response against Candida spp. Requiring Novel Phagosomal TLR7-Mediated IFN-β Signaling

Author: Akira S.
Bourgeois C.
Decker T.
Frohner I.
Glaser W.
Hildering K.
Kuchler K.
Lesiak-Markowicz I.
Majer O.
Müller M.
Stockinger S.
Publication venue: 'The American Association of Immunologists'
Publication date: 01/03/2011
Field of study

MPG.PuRe

Activation of the Aryl Hydrocarbon Receptor Interferes with Early Embryonic Development.

Author: Choudhary JS
Gialitakis M
Li Y
Niakan K
Papayannopoulos V
Pardo M
Stockinger B
Tolaini M
Toribio A
Yu L
Publication venue: 'Elsevier BV'
Publication date: 01/11/2017
Field of study

The transcriptional program of early embryonic development is tightly regulated by a set of well-defined transcription factors that suppress premature expression of differentiation genes and sustain the pluripotent identity. It is generally accepted that this program can be perturbed by environmental factors such as chemical pollutants; however, the precise molecular mechanisms remain unknown. The aryl hydrocarbon receptor (AHR) is a widely expressed nuclear receptor that senses environmental stimuli and modulates target gene expression. Here, we have investigated the AHR interactome in embryonic stem cells by mass spectrometry and show that ectopic activation of AHR during early differentiation disrupts the differentiation program via the chromatin remodeling complex NuRD (nucleosome remodeling and deacetylation). The activated AHR/NuRD complex altered the expression of differentiation-specific genes that control the first two developmental decisions without affecting the pluripotency program. These findings identify a mechanism that allows environmental stimuli to disrupt embryonic development through AHR signaling

Directory of Open Access Journals

Institute of Cancer Research Repository