1,331 research outputs found
Towards better subtitles: A multilingual approach for punctuation restoration of speech transcripts
This paper proposes a flexible approach for punctuation prediction that can be used to produce state-of-the-art results in a multilingual scenario. We have performed experiments using transcripts of TED Talks from the IWSLT 2017 and IWSLT 2011 evaluation campaigns. Our experiments show that the recognition errors of the ASR output degrade the performance of our models, in line with related literature. Our monolingual models perform consistently in Human-edited transcripts of German, Dutch, Portuguese and Romanian, suggesting that commas may be more difficult to predict than periods, using pre-trained contextual models. We have trained a single multilingual model that predicts punctuation in multiple languages that achieves results comparable with the ones achieved by monolingual models, revealing evidence of the potential of using a single multilingual model to solve the task for multiple languages. Then, we argue that usage of current punctuation systems in the literature are implicitly dependent on correct segmentation of ASR outputs for they rely on positional information to solve the punctuation task. This is too big of a requirement for use in a real life application. Through several experiments, we show that our method to train and test models is more robust to different segmentation. These contributions are of particular importance in our multilingual pipeline, since they avoid training a different model for each of the involved languages, and they guarantee that the model will be more robust to incorrect segmentation of the ASR outputs in comparison with other methods in the literature. To the best of our knowledge, we report the first experiments using a single multilingual model for punctuation restoration in multiple languages.info:eu-repo/semantics/acceptedVersio
Nonlinear interaction between two heralded single photons
Harnessing nonlinearities strong enough to allow two single photons to
interact with one another is not only a fascinating challenge but is central to
numerous advanced applications in quantum information science. Currently, all
known approaches are extremely challenging although a few have led to
experimental realisations with attenuated classical laser light. This has
included cross-phase modulation with weak classical light in atomic ensembles
and optical fibres, converting incident laser light into a non-classical stream
of photon or Rydberg blockades as well as all-optical switches with attenuated
classical light in various atomic systems. Here we report the observation of a
nonlinear parametric interaction between two true single photons. Single
photons are initially generated by heralding one photon from each of two
independent spontaneous parametric downconversion sources. The two heralded
single photons are subsequently combined in a nonlinear waveguide where they
are converted into a single photon with a higher energy. Our approach
highlights the potential for quantum nonlinear optics with integrated devices,
and as the photons are at telecom wavelengths, it is well adapted to
applications in quantum communication.Comment: 4 pages, 4 figure
Multilingual simultaneous sentence end and punctuation prediction
This paper describes the model and its corresponding setup, proposed by the Unbabel & INESC-ID team for the 1st Shared Task on Sentence End and Punctuation Prediction in NLG Text (SEPP-NLG 2021). The shared task covers 4 languages (English, German, French and Italian) and includes two subtasks: Subtask 1 - detecting the end of a sentence, and subtask 2 - predicting a range of punctuation marks. Our team proposes a single multilingual and multitask model that is able to produce suitable results for all the languages and subtasks involved. The results show that it is possible to achieve state-of-the-art results using one single multilingual model for both tasks and multiple languages. Using a single multilingual model to solve the task for multiple languages is of particular importance, since training a different model for each language is a cumbersome and time-consuming process.info:eu-repo/semantics/publishedVersio
Dark matter from cosmic defects on galactic scales?
We discuss the possible dynamical role of extended cosmic defects on galactic
scales, specifically focusing on the possibility that they may provide the dark
matter suggested by the classical problem of galactic rotation curves. We
emphasize that the more standard defects (such as Goto-Nambu strings) are
unsuitable for this task, but show that more general models (such as transonic
wiggly strings) could in principle have a better chance. In any case, we show
that observational data severely restricts any such scenarios.Comment: Submitted to Phys. Rev. D (Brief Reports). v2: Reference added and
some typos corrected, matches published versio
Diretrizes técnicas para a exploração de impacto reduzido em operações florestais de Terra Firme na Amazônia Brasileira.
Apresentação geral das diretrizes técnicas. Diretrizes para as atividades pré-exploratórias. diretrizes para as atividades de exploração florestal. Diretrizes para as atividades pós-exploratórias.bitstream/item/63188/1/Oriental-Doc64.pd
Yeasts vectored by migratory birds collected in the Mediterranean island of Ustica and description of Phaffomyces usticensis f.a. sp. nov., a new species related to the cactus ecoclade.
Nine yeast species belonging to genera Candida, Cryptococcus, Phaffomyces,
Rhodotorula and Wickerhamomyces, and one species of Aureobasidium genus
were isolated from the cloaca of migratory birds. Candida glabrata and
C. inconspicua were the species most frequently isolated and Wickerhamomyces
sylviae, which has recently been described as a new species isolated from bird
cloaca, was again found. The majority of isolates showed the ability to grow up
to 40 \ub0C and/or at pH 3.0, two environmental conditions typical of the digestive
tract of birds. The phylogenetic analysis of the D1/D2 domain of 26S
rRNA gene placed the cultures of Phaffomyces in a new lineage that differed
from the closest species, P. opuntiae, by 13 nucleotide substitutions. The new
species was able to grow at 40 \ub0C and at pH 2.5, which suggests a possible
adaptation to the bird cloaca. Moreover, the ability to grow in the presence of
digitonin at pH 3.7 and the assimilation of ethyl acetate indicates a potential
cactophilic origin. For the first time, the presence of yeasts belonging to the
Phaffomyces clade in Europe and also in non-cactus environments is reported.
The new species is formally described as P. usticensis sp. nov. (PYCC
6346T = CBS 12958T)
Considerations for implanting speech brain computer interfaces based on functional magnetic resonance imaging
Objective.Brain-computer interfaces (BCIs) have the potential to reinstate lost communication faculties. Results from speech decoding studies indicate that a usable speech BCI based on activity in the sensorimotor cortex (SMC) can be achieved using subdurally implanted electrodes. However, the optimal characteristics for a successful speech implant are largely unknown. We address this topic in a high field blood oxygenation level dependent functional magnetic resonance imaging (fMRI) study, by assessing the decodability of spoken words as a function of hemisphere, gyrus, sulcal depth, and position along the ventral/dorsal-axis. Approach.Twelve subjects conducted a 7T fMRI experiment in which they pronounced 6 different pseudo-words over 6 runs. We divided the SMC by hemisphere, gyrus, sulcal depth, and position along the ventral/dorsal axis. Classification was performed on in these SMC areas using multiclass support vector machine (SVM). Main results.Significant classification was possible from the SMC, but no preference for the left or right hemisphere, nor for the precentral or postcentral gyrus for optimal word classification was detected. Classification while using information from the cortical surface was slightly better than when using information from deep in the central sulcus and was highest within the ventral 50% of SMC. Confusion matrices where highly similar across the entire SMC. An SVM-searchlight analysis revealed significant classification in the superior temporal gyrus and left planum temporale in addition to the SMC. Significance.The current results support a unilateral implant using surface electrodes, covering the ventral 50% of the SMC. The added value of depth electrodes is unclear. We did not observe evidence for variations in the qualitative nature of information across SMC. The current results need to be confirmed in paralyzed patients performing attempted speech
Expressão de genes relacionados ao metabolismo de nitrogênio, fósforo de potássio em cafeeiros submetidos ao estresse biótico.
As plantas freqüentemente são ameaçadas por agentes externos que podem ser fatores bióticos e/ou abióticos. Para cada tipo de ameaça há um requintado tipo de resposta que caracteriza o processo de defesa. Dentre os processos de defesa enquadram-se a ativação ou repressão de genes relacionados como quitinases, glucanaes, lipoxigenases, transferases, entre outros. Quaisquer modificações bioquímicas que ocorram durante os ataques dos patógenos são indispensáveis para a compreensão deste mecanismo biológico. Correlacionar os mecanismos biológicos de regulação e transporte de íons com os diferentes períodos de infestação e infecção é uma dessas maneiras de elucidar os processos que ocorrem durante a infestação e infecção. Neste estudo foram analisados os perfis de expressão de 5 genes (HAK-5, KEA, PAP-1, PII, NTR) relacionados com o transporte e regulação de potássio e nitrogênio com diferentes estímulos bióticos e em diferentes etapas de cada um desses processos. Os resultados demonstraram que há padrões diferenciais de regulação entre as plantas suscetíveis e resistentes para todos os genes analisados. Estas análises preliminares indicam que a regulação da absorção e/ou transporte de nutrientes exerce um importante papel durante as respostas de defesa em cafeeiros contra o bicho mineiro
- …