45 research outputs found
Error analysis in automatic speech recognition and machine translation
Automatic speech recognition and machine translation are well-known terms in
the translation world nowadays. Systems that carry out these processes are taking over the work
of humans more and more. Reasons for this are the speed at which the tasks are performed and
their costs. However, the quality of these systems is debatable. They are not yet capable of
delivering the same performance as human transcribers or translators. The lack of creativity,
the ability to interpret texts and the sense of language is often cited as the reason why the
performance of machines is not yet at the level of human translation or transcribing work.
Despite this, there are companies that use these machines in their production pipelines.
Unbabel, an online translation platform powered by artificial intelligence, is one of these
companies. Through a combination of human translators and machines, Unbabel tries to
provide its customers with a translation of good quality. This internship report was written with
the aim of gaining an overview of the performance of these systems and the errors they produce.
Based on this work, we try to get a picture of possible error patterns produced by both systems.
The present work consists of an extensive analysis of errors produced by automatic speech
recognition and machine translation systems after automatically transcribing and translating 10
English videos into Dutch. Different videos were deliberately chosen to see if there were
significant differences in the error patterns between videos. The generated data and results from
this work, aims at providing possible ways to improve the quality of the services already
mentioned.O reconhecimento automåtico de fala e a tradução automåtica são termos conhecidos
no mundo da tradução, hoje em dia. Os sistemas que realizam esses processos estão a assumir
cada vez mais o trabalho dos humanos. As razÔes para isso são a velocidade com que as tarefas
sĂŁo realizadas e os seus custos. No entanto, a qualidade desses sistemas Ă© discutĂvel. As
mĂĄquinas ainda nĂŁo sĂŁo capazes de ter o mesmo desempenho dos transcritores ou tradutores
humanos. A falta de criatividade, de capacidade de interpretar textos e de sensibilidade
linguĂstica sĂŁo motivos frequentemente usados para justificar o facto de as mĂĄquinas ainda nĂŁo
estarem suficientemente desenvolvidas para terem um desempenho comparĂĄvel com o trabalho
de tradução ou transcrição humano. Mesmo assim, existem empresas que fazem uso dessas
mĂĄquinas. A Unbabel, uma plataforma de tradução online baseada em inteligĂȘncia artificial, Ă©
uma dessas empresas. Através de uma combinação de tradutores humanos e de måquinas, a
Unbabel procura oferecer aos seus clientes traduçÔes de boa qualidade. O presente relatório de
estĂĄgio foi feito com o intuito de obter uma visĂŁo geral do desempenho desses sistemas e das
falhas que cometem, propondo delinear uma imagem dos possĂveis padrĂ”es de erro existentes
nos mesmos. Para tal, fez-se uma anĂĄlise extensa das falhas que os sistemas de reconhecimento
automåtico de fala e de tradução automåtica cometeram, após a transcrição e a tradução
automĂĄtica de 10 vĂdeos. Foram deliberadamente escolhidos registos videogrĂĄficos diversos,
de modo a verificar possĂveis diferenças nos padrĂ”es de erro. AtravĂ©s dos dados gerados e dos
resultados obtidos, propÔe-se encontrar uma forma de melhorar a qualidade dos serviços jå
mencionados
Multispace & Multistructure. Neutrosophic Transdisciplinarity (100 Collected Papers of Sciences), Vol. IV
The fourth volume, in my book series of âCollected Papersâ, includes 100 published and unpublished articles, notes, (preliminary) drafts containing just ideas to be further investigated, scientific souvenirs, scientific blogs, project proposals, small experiments, solved and unsolved problems and conjectures, updated or alternative versions of previous papers, short or long humanistic essays, letters to the editors - all collected in the previous three decades (1980-2010) â but most of them are from the last decade (2000-2010), some of them being lost and found, yet others are extended, diversified, improved versions. This is an eclectic tome of 800 pages with papers in various fields of sciences, alphabetically listed, such as: astronomy, biology, calculus, chemistry, computer programming codification, economics and business and politics, education and administration, game theory, geometry, graph theory, information fusion, neutrosophic logic and set, non-Euclidean geometry, number theory, paradoxes, philosophy of science, psychology, quantum physics, scientific research methods, and statistics. It was my preoccupation and collaboration as author, co-author, translator, or cotranslator, and editor with many scientists from around the world for long time. Many topics from this book are incipient and need to be expanded in future explorations
Keys to The Gift
"Yuri Levingâs Keys to The Gift: A Guide to Vladimir Nabokovâs Novel is a new systematization of the main available data on Nabokovâs most complex Russian
novel, The Gift (1934â1939). From notes in Nabokovâs private correspondence to scholarly articles accumulated during the seventy years since the novelâs first appearance in print, this work draws from a broad spectrum of existing material in
a succinct and coherent way and provides innovative analyses. The first part of the
monograph, âThe Novel,â outlines the basic properties of The Gift (plot, characters, style, and motifs) and reconstructs its internal chronology. The second part, âThe Text,â describes the creation of the novel and the history of its publication, public and critical reaction, challenges of English translation, and post-Soviet reception. Along with annotations to all five chapters of The Gift, the commentary provides insight into problems of paleography, featuring a unique textological analysis of the novel
A Statistical Approach to the Alignment of fMRI Data
Multi-subject functional Magnetic Resonance Image studies are critical. The anatomical and functional structure varies across subjects, so the image alignment is necessary. We define a probabilistic model to describe functional alignment. Imposing a prior distribution, as the matrix Fisher Von Mises distribution, of the orthogonal transformation parameter, the anatomical information is embedded in the estimation of the parameters, i.e., penalizing the combination of spatially distant voxels. Real applications show an improvement in the classification and interpretability of the results compared to various functional alignment methods
A comparison of the CAR and DAGAR spatial random effects models with an application to diabetics rate estimation in Belgium
When hierarchically modelling an epidemiological phenomenon on a finite collection of sites in space, one must always take a latent spatial effect into account in order to capture the correlation structure that links the phenomenon to the territory. In this work, we compare two autoregressive spatial models that can be used for this purpose: the classical CAR model and the more recent DAGAR model. Differently from the former, the latter has a desirable property: its Ï parameter can be naturally interpreted as the average neighbor pair correlation and, in addition, this parameter can be directly estimated when the effect is modelled using a DAGAR rather than a CAR structure. As an application, we model the diabetics rate in Belgium in 2014 and show the adequacy of these models in predicting the response variable when no covariates are available
Keys to The Gift
"Yuri Levingâs Keys to The Gift: A Guide to Vladimir Nabokovâs Novel is a new systematization of the main available data on Nabokovâs most complex Russian
novel, The Gift (1934â1939). From notes in Nabokovâs private correspondence to scholarly articles accumulated during the seventy years since the novelâs first appearance in print, this work draws from a broad spectrum of existing material in
a succinct and coherent way and provides innovative analyses. The first part of the
monograph, âThe Novel,â outlines the basic properties of The Gift (plot, characters, style, and motifs) and reconstructs its internal chronology. The second part, âThe Text,â describes the creation of the novel and the history of its publication, public and critical reaction, challenges of English translation, and post-Soviet reception. Along with annotations to all five chapters of The Gift, the commentary provides insight into problems of paleography, featuring a unique textological analysis of the novel
Undergraduate catalog 2004-06, revised
The Undergraduate Catalog for the University of Missouri Columbia has been organized to enhance readability. The initial sections are related to University-wide programs, policies and procedures. The second section provides the listing of academic offerings, organized by the academic units (also may be called colleges or schools) that offer the courses and/or the degrees (major, minor or certificate) that students seek to earn. In addition to the Table of Contents, the Faculty listing and Index at the back of the catalog are invaluable for locating a person or topic quickly. An electronic version is also on the MU web site. Graduate and professional programs (Law, Medicine and Veterinary Medicine) have separate catalogs. -- Page 11.Revised October 2005
Undergraduate catalog 2004-06, original
The Undergraduate Catalog for the University of Missouri Columbia has been organized to enhance readability. The initial sections are related to University-wide programs, policies and procedures. The second section provides the listing of academic offerings, organized by the academic units (also may be called colleges or schools) that offer the courses and/or the degrees (major, minor or certificate) that students seek to earn. In addition to the Table of Contents, the Faculty listing and Index at the back of the catalog are invaluable for locating a person or topic quickly. An electronic version is also on the MU web site. Graduate and professional programs (Law, Medicine and Veterinary Medicine) have separate catalogs. -- Page 11.Origina