5,838 research outputs found
Towards Avatars with Artificial Minds: Role of Semantic Memory
he first step towards creating avatars with human-like artificial minds is to give them human-like memory structures with an access to general knowledge about the world. This type of knowledge is stored in semantic memory. Although many approaches to modeling of semantic memories have been proposed they are not very useful in real life applications because they lack knowledge comparable to the common sense that humans have, and they cannot be implemented in a computationally efficient way. The most drastic simplification of semantic memory leading to the simplest knowledge representation that is sufficient for many applications is based on the Concept Description Vectors (CDVs) that store, for each concept, an information whether a given property is applicable to this concept or not. Unfortunately even such simple information about real objects or concepts is not available. Experiments with automatic creation of concept description vectors from various sources, including ontologies, dictionaries, encyclopedias and unstructured text sources are described. Haptek-based talking head that has an access to this memory has been created as an example of a humanized interface (HIT) that can interact with web pages and exchange information in a natural way. A few examples of applications of an avatar with semantic memory are given, including the twenty questions game and automatic creation of word puzzles
Noisy-parallel and comparable corpora filtering methodology for the extraction of bi-lingual equivalent data at sentence level
Text alignment and text quality are critical to the accuracy of Machine
Translation (MT) systems, some NLP tools, and any other text processing tasks
requiring bilingual data. This research proposes a language independent
bi-sentence filtering approach based on Polish (not a position-sensitive
language) to English experiments. This cleaning approach was developed on the
TED Talks corpus and also initially tested on the Wikipedia comparable corpus,
but it can be used for any text domain or language pair. The proposed approach
implements various heuristics for sentence comparison. Some of them leverage
synonyms and semantic and structural analysis of text as additional
information. Minimization of data loss was ensured. An improvement in MT system
score with text processed using the tool is discussed.Comment: arXiv admin note: text overlap with arXiv:1509.09093,
arXiv:1509.0888
Design of a Controlled Language for Critical Infrastructures Protection
We describe a project for the construction of controlled language for critical infrastructures protection (CIP). This project originates
from the need to coordinate and categorize the communications on CIP at the European level. These communications can be physically
represented by official documents, reports on incidents, informal communications and plain e-mail. We explore the application of
traditional library science tools for the construction of controlled languages in order to achieve our goal. Our starting point is an
analogous work done during the sixties in the field of nuclear science known as the Euratom Thesaurus.JRC.G.6-Security technology assessmen
Natural User Interfaces (NUI): review
The article summarizes and systematizes knowledge concerning natural user interfaces. The most important facts related to this problem have been supplemented with examples of possible practical use of such type of human-computer communication. Moreover, the article contains descriptions of three most popular controllers: Microsoft Kinect, Nintendo Wii and Sony Move
Zarządzanie rozwojem systemów rozpoznawania mowy: problemy wydajności
Speech recognition enables the transformation of spoken words and sentences into text in digital form. This technology is a subject of numerous studies and commercial development for many years. The aim of this paper is to examine performance issues of speech recognition and to manage the development in this field. Thorough analysis of performance limitations of speech recognition systems we identified main 11 issues to overcome. They indicate the direction of managing development of speech recognition systems.Rozpoznawanie mowy umożliwia przekształcanie wypowiadanych słów i zdań w tekst w formie cyfrowej. Technologia ta jest od wielu lat przedmiotem licznych badań naukowych oraz komercyjnych. Celem niniejszego artykułu jest zbadanie zagadnień dotyczących wydajności systemów rozpoznawania mowy i zarządzanie rozwojem tych systemów. Dogłębna analiza w zakresie ograniczeń wydajnościowych systemów rozpoznawania mowy pozwoliła na zidentyfikowanie problemów, które trzeba przezwyciężyć. Wskazują one kierunek zmian w zarządzaniu rozwojem systemów rozpoznawania mowy
- …