Search CORE

6 research outputs found

dBaby: Grounded Language Teaching through Games and Efficient Reinforcement Learning

Author: Barzdins Guntis
Barzdins Paulis F.
Gosko Didzis
Liepins Renars
Publication venue
Publication date
Field of study

This paper outlines a project proposal to be submitted to EC H2020 call ICT-29-2018. The purpose of the project is to create a digital Baby (dBaby) - an agent perceiving and interacting with the 3D world and communicating with its Teacher via natural language phrases to achieve the goals set by the Teacher. The novelty of the approach is that neither language nor visual capabilities are hard-coded in dBaby - instead, the Teacher defines a language learning Game grounded in the 3D world, and dBaby learns the language as a byproduct of the reinforcement learning from the raw pixels and character strings while maximizing the rewards in the Game. So far such approach successfully has been demonstrated only in the virtual 3D world with pre-programmed Games where it requires millions of episodes to learn a dozen words. Moving to human Teacher and real 3D environment requires an order-of-magnitude improvement to data-efficiency of the reinforcement learning. A novel Episodic Control based pre-training is demonstrated as a promising approach for bootstrapping the data-efficient reinforcement learning

ZENODO

SUMMA: Integrating Multiple NLP Technologies into an Open-source Platform for Multilingual Media Monitoring

Author: Barzdins Guntis
Germann Ulrich
Gosko Didzis
Liepins Renars
Publication venue
Publication date: 01/07/2018
Field of study

The open-source SUMMA Platform is a highly scalable distributed architecture for monitoring a large number of media broadcasts in parallel, with a lag behind actual broadcast time of at most a few minutes. It assembles numerous state-of-the-art NLP technologies into a fully automated media ingestion pipeline that can record live broadcasts, detect and transcribe spoken content, translate from several languages (original text or transcribed speech) into English,1 recognize Named Entities, detect topics, cluster and summarize documents across language barriers, and extract and store factual claims in these news items. This paper describes the intended use cases and discusses the system design decisions that allowed us to integrate state-of-theart NLP modules into an effective workflow with comparatively little effort

Edinburgh Research Explorer

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

The SUMMA Platform:A Scalable Infrastructure for Multi-lingual Multi-media Monitoring

Author: Barzdins Guntis
Germann Ulrich
Gosko Didzis
Liepins Renars
Miranda Sebastiao
Nogueira David
Publication venue
Publication date: 01/07/2018
Field of study

The open-source SUMMA Platform is a highly scalable distributed architecture for monitoring a large number of media broadcasts in parallel, with a lag behind actual broadcast time of at most a few minutes. The Platform offers a fully automated media ingestion pipeline capable of recording live broadcasts, detection and transcription of spoken content, translation of all text (original or transcribed) into English, recognition and linking of Named Entities, topic detection, clustering and crosslingual multi-document summarization of related media items, and last but not least, extraction and storage of factual claims in these news items. Browser-based graphical user interfaces provide humans with aggregated information as well as structured access to individual news items stored in the Platform’s database. This paper describes the intended use cases and provides an overview over the system’s implementation

Edinburgh Research Explorer

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Latvian-Americans in the Post-Soviet era: Cultural factors on return migration in oral history interviews

Author: Baltais Mirdza Kate
Bela-Krumina Baiba
Carpenter Inta Gāle
Carpenter Inta Gāle
Cassarino Jean-Pierre
Clarke Mary Marshall
Hinkle Maija
Hinkle Maija
Hinkle Maija
Jasinskaja-Lahti Inga
Kirss Tiina
Kundera Milan
Liepins Guntis
Maija Hinkle
Plakans A.
Rosenwald G.C.
Salinger Anne Grenn
Von Plato Alexander
Wyman Mark
Zirnite Mara
Zirnite Mara
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

The SUMMA Platform Prototype

We present the first prototype of the SUMMA Platform: an integrated platform for multilingual media monitoring. The platform contains a rich suite of low-level and high-level natural language processing technologies: automatic speech recognition of broadcast media, machine translation, automated tagging and classification of named entities, semantic parsing to detect relationships between entities, and automatic construction / augmentation of factual knowledge bases. Implemented on the Docker platform, it can easily be deployed, customised, and scaled to large volumes of incoming media streams

Infoscience - École polytechnique fédérale de Lausanne

Crossref

UCL Discovery

Edinburgh Research Explorer