4,271 research outputs found
A Universal Scheme for WynerâZiv Coding of Discrete Sources
We consider the WynerâZiv (WZ) problem of lossy compression where the decompressor observes a noisy version of the source, whose statistics are unknown. A new family of WZ coding algorithms is proposed and their universal optimality is proven. Compression consists of sliding-window processing followed by LempelâZiv (LZ) compression, while the decompressor is based on a modification of the discrete universal denoiser (DUDE) algorithm to take advantage of side information. The new algorithms not only universally attain the fundamental limits, but also suggest a paradigm for practical WZ coding. The effectiveness of our approach is illustrated with experiments on binary images, and English text using a low complexity algorithm motivated by our class of universally optimal WZ codes
Speech-to-speech Translation between Untranscribed Unknown Languages
In this paper, we explore a method for training speech-to-speech translation
tasks without any transcription or linguistic supervision. Our proposed method
consists of two steps: First, we train and generate discrete representation
with unsupervised term discovery with a discrete quantized autoencoder. Second,
we train a sequence-to-sequence model that directly maps the source language
speech to the target language's discrete representation. Our proposed method
can directly generate target speech without any auxiliary or pre-training steps
with a source or target transcription. To the best of our knowledge, this is
the first work that performed pure speech-to-speech translation between
untranscribed unknown languages.Comment: Accepted in IEEE ASRU 2019. Web-page for more samples & details:
https://sp2code-translation-v1.netlify.com
The impact of ICT sophistication on geographically distant networks: the case of space physics as seen from France
This paper examines scientific collaboration between French public research teams and distant partners. We first analyse the role and the development of trust and then, the relation between the degree of sophistication of Information and Communication Technologies (ICT) and the constraint of geographical proximity. In that purpose, we present a typology of the different kinds of knowledge and a classification of technologies. A case study in the field of space physics allows us to confront our theoretical elements to real life. We study the evolution of ICT sophistication parallel to collaboration patterns. Finally, we give some recommendations for public funding of virtual networks.collaboratory, knowledge transfer, trust, ICT classification, space physics
Few shot font generation via transferring similarity guided global style and quantization local style
Automatic few-shot font generation (AFFG), aiming at generating new fonts
with only a few glyph references, reduces the labor cost of manually designing
fonts. However, the traditional AFFG paradigm of style-content disentanglement
cannot capture the diverse local details of different fonts. So, many
component-based approaches are proposed to tackle this problem. The issue with
component-based approaches is that they usually require special pre-defined
glyph components, e.g., strokes and radicals, which is infeasible for AFFG of
different languages. In this paper, we present a novel font generation approach
by aggregating styles from character similarity-guided global features and
stylized component-level representations. We calculate the similarity scores of
the target character and the referenced samples by measuring the distance along
the corresponding channels from the content features, and assigning them as the
weights for aggregating the global style features. To better capture the local
styles, a cross-attention-based style transfer module is adopted to transfer
the styles of reference glyphs to the components, where the components are
self-learned discrete latent codes through vector quantization without manual
definition. With these designs, our AFFG method could obtain a complete set of
component-level style representations, and also control the global glyph
characteristics. The experimental results reflect the effectiveness and
generalization of the proposed method on different linguistic scripts, and also
show its superiority when compared with other state-of-the-art methods. The
source code can be found at https://github.com/awei669/VQ-Font.Comment: Accepted by ICCV 202
Portuguese media discourse on nuclear energy before and after Fukushima: prepared for the EFDA Workprogramme 2012 WP12-SER-ACIF-1
This report presents the results of the analysis of nuclear related content in Portuguese print media. The report is submitted to the first step of the research task untitled Public Discourse about Nuclear Fusion before and after the Fukushima accident, as part of the Socio-Economic Research on Fusion under the general coordination of EFDA Workprogramme 2012
- âŠ