914 research outputs found
Proceedings: Voice Technology for Interactive Real-Time Command/Control Systems Application
Speech understanding among researchers and managers, current developments in voice technology, and an exchange of information concerning government voice technology efforts are discussed
Automatic translation of formal data specifications to voice data-input applications.
This thesis introduces a complete solution for automatic translation of formal data specifications to voice data-input applications. The objective of the research is to automatically generate applications for inputting data through speech from specifications of the structure of the data. The formal data specifications are XML DTDs. A new formalization called Grammar-DTD (G-DTD) is introduced as an extended DTD that contains grammars to describe valid values of the DTD elements and attributes. G-DTDs facilitate the automatic generation of Voice XML applications that correspond to the original DTD structure. The development of the automatic application-generator included identifying constraints on the G-DTD to ensure a feasible translation, using predicate calculus to build a knowledge base of inference rules that describes the mapping procedure, and writing an algorithm for the automatic translation based on the inference rules.Dept. of Computer Science. Paper copy at Leddy Library: Theses & Major Papers - Basement, West Bldg. / Call Number: Thesis2006 .H355. Source: Masters Abstracts International, Volume: 45-01, page: 0354. Thesis (M.Sc.)--University of Windsor (Canada), 2006
Recommended from our members
Voice user interfaces (VUIs) emerge
Human beings use speech as the primary means of communication. Therefore, voice user interfaces (VUIs) represent a natural way for people to interact with computers (Fournier, 1996; Hyde, 1979; Teja and Gonnella, 1983; Witten 1982). Voice input facilitates a much higher computer input rate than keyboard or mouse driven input. Voice output permits computer generated output in devices when output screens are not available (e.g., most phones) or in situations where the user\u27s eyes are busy elsewhere (e.g., driving a car, assembling a product, etc.). Thus, VUIs are viewed as the logical next generation in computer interfaces. Forecasts call for a rapid expansion of voice technology within our work environments over the next few years (Cone, 1997; Schwartz and Brier, 1997). This paper discusses the current state of and potential future of VUIs
Development of customized conversational interfaces with Deep Learning techniques
This Bachelor’s thesis will cover the end-to-end process of developing a personalized conversational interface for a specific domain, using Deep Learning techniques. In particular, it will focus on the study of the Dialog Manager module, which is in charge of deciding the next system response based on the current dialog state. AlthoughthereisplentyofliteratureaboutMachineLearningappliedtotheconstruction of dialog management models, there is very little reference to the utilization of Deep Learning for such task. As a result, this work analyzes the improvement that deep neural networks can bring to accuracy. Several models are created with TensorFlow, and comparisons are made with traditional Machine Learning solutions. Results show that Deep Learning is not the most recommended approach for this type of problems, yet further research is suggested for more complex datasets. After this, one of the Deep Learning models, based on a train scheduling domain, is used for the implementation of the dialog manager inside a real spoken dialog system. To integrate the rest of required components of such technology (automatic speech recognizer, natural language understanding module and text-to-speech service), a modern framework is used: DialogFlow. With this platform, a complete chatbot is built in the form of an assistant in the train scheduling domain. Evaluationof thespoken dialogsystemwith real users generatesavery positivefeedback, demonstrating that a Deep Learning based dialog manager is a valid solution in commercial conversational interfaces.IngenierĂa Informátic
The direction of technical change in AI and the trajectory effects of government funding
Government funding of innovation can have a significant impact not only on the rate of technical change, but also on its direction. In this paper, we examine the role that government grants and government departments played in the development of artificial intelligence (AI), an emergent general purpose technology with the potential to revolutionize many aspects of the economy and society. We analyze all AI patents filed at the US Patent and Trademark Office and develop network measures that capture each patent’s influence on all possible sequences of follow-on innovation. By identifying the effect of patents on technological trajectories, we are able to account for the long-term cumulative impact of new knowledge that is not captured by standard patent citation measures. We show that patents funded by government grants, but above all patents filed by federal agencies and state departments, profoundly influenced the development of AI. These long-term effects were especially significant in early phases, and weakened over time as private incentives took over. These results are robust to alternative specifications and controlling for endogeneity
Proceedings of the 3rd IUI Workshop on Interacting with Smart Objects
These are the Proceedings of the 3rd IUI Workshop on Interacting with Smart Objects. Objects that we use in our everyday life are expanding their restricted interaction capabilities and provide functionalities that go far beyond their original functionality. They feature computing capabilities and are thus able to capture information, process and store it and interact with their environments, turning them into smart objects
Dialogs Re-enacted Across Languages
To support machine learning of cross-language prosodic mappings and other
ways to improve speech-to-speech translation, we present a protocol for
collecting closely matched pairs of utterances across languages, a description
of the resulting data collection, and some observations and musings. This
report is intended for 1) people using the corpus, 2) people extending the
corpus, and 3) people designing similar collections of bilingual dialog data
- …