12,872 research outputs found
Research on Architectures for Integrated Speech/Language Systems in Verbmobil
The German joint research project Verbmobil (VM) aims at the development of a
speech to speech translation system. This paper reports on research done in our
group which belongs to Verbmobil's subproject on system architectures (TP15).
Our specific research areas are the construction of parsers for spontaneous
speech, investigations in the parallelization of parsing and to contribute to
the development of a flexible communication architecture with distributed
control.Comment: 6 pages, 2 Postscript figure
Integrated speech and morphological processing in a connectionist continuous speech understanding for Korean
A new tightly coupled speech and natural language integration model is
presented for a TDNN-based continuous possibly large vocabulary speech
recognition system for Korean. Unlike popular n-best techniques developed for
integrating mainly HMM-based speech recognition and natural language processing
in a {\em word level}, which is obviously inadequate for morphologically
complex agglutinative languages, our model constructs a spoken language system
based on a {\em morpheme-level} speech and language integration. With this
integration scheme, the spoken Korean processing engine (SKOPE) is designed and
implemented using a TDNN-based diphone recognition module integrated with a
Viterbi-based lexical decoding and symbolic phonological/morphological
co-analysis. Our experiment results show that the speaker-dependent continuous
{\em eojeol} (Korean word) recognition and integrated morphological analysis
can be achieved with over 80.6% success rate directly from speech inputs for
the middle-level vocabularies.Comment: latex source with a4 style, 15 pages, to be published in computer
processing of oriental language journa
Robust Processing of Natural Language
Previous approaches to robustness in natural language processing usually
treat deviant input by relaxing grammatical constraints whenever a successful
analysis cannot be provided by ``normal'' means. This schema implies, that
error detection always comes prior to error handling, a behaviour which hardly
can compete with its human model, where many erroneous situations are treated
without even noticing them.
The paper analyses the necessary preconditions for achieving a higher degree
of robustness in natural language processing and suggests a quite different
approach based on a procedure for structural disambiguation. It not only offers
the possibility to cope with robustness issues in a more natural way but
eventually might be suited to accommodate quite different aspects of robust
behaviour within a single framework.Comment: 16 pages, LaTeX, uses pstricks.sty, pstricks.tex, pstricks.pro,
pst-node.sty, pst-node.tex, pst-node.pro. To appear in: Proc. KI-95, 19th
German Conference on Artificial Intelligence, Bielefeld (Germany), Lecture
Notes in Computer Science, Springer 199
Machine learning for the diagnosis of early stage diabetes using temporal glucose profiles
Machine learning shows remarkable success for recognizing patterns in data.
Here we apply the machine learning (ML) for the diagnosis of early stage
diabetes, which is known as a challenging task in medicine. Blood glucose
levels are tightly regulated by two counter-regulatory hormones, insulin and
glucagon, and the failure of the glucose homeostasis leads to the common
metabolic disease, diabetes mellitus. It is a chronic disease that has a long
latent period the complicates detection of the disease at an early stage. The
vast majority of diabetics result from that diminished effectiveness of insulin
action. The insulin resistance must modify the temporal profile of blood
glucose. Thus we propose to use ML to detect the subtle change in the temporal
pattern of glucose concentration. Time series data of blood glucose with
sufficient resolution is currently unavailable, so we confirm the proposal
using synthetic data of glucose profiles produced by a biophysical model that
considers the glucose regulation and hormone action. Multi-layered perceptrons,
convolutional neural networks, and recurrent neural networks all identified the
degree of insulin resistance with high accuracy above .Comment: 4 pages, 2 figur
An investigation of tightly coupled time synchronous speech language interfaces using a unification grammar
This paper reports on some experiments on time synchronous interfaces between word recognition and parsing, performed with a beam decoder and a chart parser. Using the same acoustic models, language model, and unification grammar, bottom-up and two interactive protocols were implemented and examined. Results show that close integration is possible without unbearable time penalties, if restrictions from both modules are applied to focus the search process
AI Extenders: The Ethical and Societal Implications of Humans Cognitively Extended by AI
Humans and AI systems are usually portrayed as separate sys- tems that we need to align in values and goals. However, there is a great deal of AI technology found in non-autonomous systems that are used as cognitive tools by humans. Under the extended mind thesis, the functional contributions of these tools become as essential to our cognition as our brains. But AI can take cognitive extension towards totally new capabil- ities, posing new philosophical, ethical and technical chal- lenges. To analyse these challenges better, we define and place AI extenders in a continuum between fully-externalized systems, loosely coupled with humans, and fully-internalized processes, with operations ultimately performed by the brain, making the tool redundant. We dissect the landscape of cog- nitive capabilities that can foreseeably be extended by AI and examine their ethical implications. We suggest that cognitive extenders using AI be treated as distinct from other cognitive enhancers by all relevant stakeholders, including developers, policy makers, and human users
Mental states in communication
Abstract. This paper is concerned with the mental processes involved in intentional communication. I describe an agent's cognitive architecture as the set of cognitive dynamics (i.e., sequences of mental states with contents) she may entertain. I then describe intentional communication as one such specific dynamics, arguing against the prevailing view that communication consists in playing a role in a socially shared script. The cognitive capabilities needed for such dynamics are midreading (i.e., the ability to reason upon another individual's mental states), and communicative planning (i.e., the ability to dynamically represent and act in a communicative situation)
- …