73,434 research outputs found
A system design for human factors studies of speech-enabled Web browsing
This paper describes the design of a system which will subsequently be used as the basis of a range of empirical studies aimed at discovering how best to harness speech recognition capabilities in multimodal multimedia computing. Initial work focuses on speech-enabled browsing of the World Wide Web, which was never designed for such use. System design is complete, and is being evaluated via usability testing
Multimodality in Pervasive Environment
Future pervasive environments are expected to immerse users in a consistent
world of probes, sensors and actuators. Multimodal interfaces combined
with social computing interactions and high-performance networking can foster a
new generation of pervasive environments. However, much work is still needed to
harness the full potential of multimodal interaction. In this paper we discuss some
short-term research goals, including advanced techniques for joining and correlating
multiple data flows, each with its own approximations and uncertainty models.
Also, we discuss some longer term objectives, like providing users with a mental
model of their own multimodal "aura", enabling them to collaborate with the network
infrastructure toward inter-modal correlation of multimodal inputs, much in
the same way as the human brain extracts a single self-conscious experience from
multiple sensorial data flows
A Multi-channel Application Framework for Customer Care Service Using Best-First Search Technique
It has become imperative to find a solution to the dissatisfaction in response by mobile
service providers when interacting with their customer care centres. Problems faced with
Human to Human Interaction (H2H) between customer care centres and their customers
include delayed response time, inconsistent solutions to questions or enquires and lack of
dedicated access channels for interaction with customer care centres in some cases.
This paper presents a framework and development techniques for a multi-channel
application providing Human to System (H2S) interaction for customer care centre of a
mobile telecommunication provider. The proposed solution is called Interactive Customer
Service Agent (ICSA). Based on single-authoring, it will provide three media of interaction
with the customer care centre of a mobile telecommunication operator: voice, phone and
web browsing. A mathematical search technique called Best-First Search to generate
accurate results in a search environmen
Knowledge will Propel Machine Understanding of Content: Extrapolating from Current Examples
Machine Learning has been a big success story during the AI resurgence. One
particular stand out success relates to learning from a massive amount of data.
In spite of early assertions of the unreasonable effectiveness of data, there
is increasing recognition for utilizing knowledge whenever it is available or
can be created purposefully. In this paper, we discuss the indispensable role
of knowledge for deeper understanding of content where (i) large amounts of
training data are unavailable, (ii) the objects to be recognized are complex,
(e.g., implicit entities and highly subjective content), and (iii) applications
need to use complementary or related data in multiple modalities/media. What
brings us to the cusp of rapid progress is our ability to (a) create relevant
and reliable knowledge and (b) carefully exploit knowledge to enhance ML/NLP
techniques. Using diverse examples, we seek to foretell unprecedented progress
in our ability for deeper understanding and exploitation of multimodal data and
continued incorporation of knowledge in learning techniques.Comment: Pre-print of the paper accepted at 2017 IEEE/WIC/ACM International
Conference on Web Intelligence (WI). arXiv admin note: substantial text
overlap with arXiv:1610.0770
Using Sound to Enhance Users’ Experiences of Mobile Applications
The latest smartphones with GPS, electronic compass, directional audio, touch screens etc. hold potentials for location based services that are easier to use compared to traditional tools. Rather than interpreting maps, users may focus on their activities and the environment around them. Interfaces may be designed that let users search for information by simply pointing in a direction. Database queries can be created from GPS location and compass direction data. Users can get guidance to locations through pointing gestures, spatial sound and simple graphics. This article describes two studies testing prototypic applications with multimodal user interfaces built on spatial audio, graphics and text. Tests show that users appreciated the applications for their ease of use, for being fun and effective to use and for allowing users to interact directly with the environment rather than with abstractions of the same. The multimodal user interfaces contributed significantly to the overall user experience
An information assistant system for the prevention of tunnel vision in crisis management
In the crisis management environment, tunnel vision is a set of bias in decision makers’ cognitive process which often leads to incorrect understanding of the real crisis situation, biased perception of information, and improper decisions. The tunnel vision phenomenon is a consequence of both the challenges in the task and the natural limitation in a human being’s cognitive process. An information assistant system is proposed with the purpose of preventing tunnel vision. The system serves as a platform for monitoring the on-going crisis event. All information goes through the system before arrives at the user. The system enhances the data quality, reduces the data quantity and presents the crisis information in a manner that prevents or repairs the user’s cognitive overload. While working with such a system, the users (crisis managers) are expected to be more likely to stay aware of the actual situation, stay open minded to possibilities, and make proper decisions
Recommended from our members
Multimodal and ubiquitous computing systems: supporting independent-living older users
We document the rationale and design of a multimodal interface to a pervasive/ubiquitous computing system that supports independent living by older people in their own homes. The Millennium Home system involves fitting a resident’s home with sensors – these sensors can be used to trigger sequences of interaction with the resident to warn them about dangerous events, or to check if they need external help. We draw lessons from the design process and conclude the paper with implications for the design of multimodal interfaces to ubiquitous systems developed for the elderly and in healthcare, as well as for more general ubiquitous computing applications
- …