1,361 research outputs found
Knowledge Representation and WordNets
Knowledge itself is a representation of “real facts”.
Knowledge is a logical model that presents facts from “the real world” witch can be expressed in a formal language. Representation means the construction of a model of some part of reality.
Knowledge representation is contingent to both cognitive science and artificial intelligence. In cognitive science it expresses the way people store and process the information. In the AI field the goal is to store knowledge in such way that permits intelligent programs to represent information as nearly as possible to human intelligence.
Knowledge Representation is referred to the formal representation of knowledge intended to be processed and stored by computers and to draw conclusions from this knowledge.
Examples of applications are expert systems, machine translation systems, computer-aided maintenance systems and information retrieval systems (including database front-ends).knowledge, representation, ai models, databases, cams
Challenges to knowledge representation in multilingual contexts
To meet the increasing demands of the complex inter-organizational processes and the demand for
continuous innovation and internationalization, it is evident that new forms of organisation are
being adopted, fostering more intensive collaboration processes and sharing of resources, in what
can be called collaborative networks (Camarinha-Matos, 2006:03). Information and knowledge are
crucial resources in collaborative networks, being their management fundamental processes to
optimize.
Knowledge organisation and collaboration systems are thus important instruments for the success of
collaborative networks of organisations having been researched in the last decade in the areas of
computer science, information science, management sciences, terminology and linguistics.
Nevertheless, research in this area didn’t give much attention to multilingual contexts of
collaboration, which pose specific and challenging problems. It is then clear that access to and
representation of knowledge will happen more and more on a multilingual setting which implies the
overcoming of difficulties inherent to the presence of multiple languages, through the use of
processes like localization of ontologies.
Although localization, like other processes that involve multilingualism, is a rather well-developed
practice and its methodologies and tools fruitfully employed by the language industry in the
development and adaptation of multilingual content, it has not yet been sufficiently explored as an
element of support to the development of knowledge representations - in particular ontologies -
expressed in more than one language. Multilingual knowledge representation is then an open
research area calling for cross-contributions from knowledge engineering, terminology, ontology
engineering, cognitive sciences, computational linguistics, natural language processing, and
management sciences.
This workshop joined researchers interested in multilingual knowledge representation, in a
multidisciplinary environment to debate the possibilities of cross-fertilization between knowledge
engineering, terminology, ontology engineering, cognitive sciences, computational linguistics,
natural language processing, and management sciences applied to contexts where multilingualism
continuously creates new and demanding challenges to current knowledge representation methods
and techniques.
In this workshop six papers dealing with different approaches to multilingual knowledge
representation are presented, most of them describing tools, approaches and results obtained in the
development of ongoing projects.
In the first case, Andrés Domínguez Burgos, Koen Kerremansa and Rita Temmerman present a
software module that is part of a workbench for terminological and ontological mining,
Termontospider, a wiki crawler that aims at optimally traverse Wikipedia in search of domainspecific
texts for extracting terminological and ontological information. The crawler is part of a tool
suite for automatically developing multilingual termontological databases, i.e. ontologicallyunderpinned
multilingual terminological databases. In this paper the authors describe the basic principles
behind the crawler and summarized the research setting in which the tool is currently tested.
In the second paper, Fumiko Kano presents a work comparing four feature-based similarity
measures derived from cognitive sciences. The purpose of the comparative analysis presented by the author is to verify the potentially most effective model that can be applied for mapping independent ontologies in a culturally influenced domain. For that, datasets based on standardized
pre-defined feature dimensions and values, which are obtainable from the UNESCO Institute for
Statistics (UIS) have been used for the comparative analysis of the similarity measures. The purpose
of the comparison is to verify the similarity measures based on the objectively developed datasets.
According to the author the results demonstrate that the Bayesian Model of Generalization provides
for the most effective cognitive model for identifying the most similar corresponding concepts
existing for a targeted socio-cultural community.
In another presentation, Thierry Declerck, Hans-Ulrich Krieger and Dagmar Gromann present an
ongoing work and propose an approach to automatic extraction of information from multilingual
financial Web resources, to provide candidate terms for building ontology elements or instances of
ontology concepts. The authors present a complementary approach to the direct
localization/translation of ontology labels, by acquiring terminologies through the access and
harvesting of multilingual Web presences of structured information providers in the field of finance,
leading to both the detection of candidate terms in various multilingual sources in the financial
domain that can be used not only as labels of ontology classes and properties but also for the
possible generation of (multilingual) domain ontologies themselves.
In the next paper, Manuel Silva, António Lucas Soares and Rute Costa claim that despite the
availability of tools, resources and techniques aimed at the construction of ontological artifacts,
developing a shared conceptualization of a given reality still raises questions about the principles
and methods that support the initial phases of conceptualization. These questions become, according
to the authors, more complex when the conceptualization occurs in a multilingual setting. To tackle
these issues the authors present a collaborative platform – conceptME - where terminological and
knowledge representation processes support domain experts throughout a conceptualization
framework, allowing the inclusion of multilingual data as a way to promote knowledge sharing and
enhance conceptualization and support a multilingual ontology specification.
In another presentation Frieda Steurs and Hendrik J. Kockaert present us TermWise, a large project
dealing with legal terminology and phraseology for the Belgian public services, i.e. the translation
office of the ministry of justice, a project which aims at developing an advanced tool including
expert knowledge in the algorithms that extract specialized language from textual data (legal
documents) and whose outcome is a knowledge database including Dutch/French equivalents for
legal concepts, enriched with the phraseology related to the terms under discussion.
Finally, Deborah Grbac, Luca Losito, Andrea Sada and Paolo Sirito report on the preliminary
results of a pilot project currently ongoing at UCSC Central Library, where they propose to adapt to
subject librarians, employed in large and multilingual Academic Institutions, the model used by
translators working within European Union Institutions. The authors are using User Experience
(UX) Analysis in order to provide subject librarians with a visual support, by means of “ontology
tables” depicting conceptual linking and connections of words with concepts presented according to
their semantic and linguistic meaning.
The organizers hope that the selection of papers presented here will be of interest to a broad audience, and will be a starting point for further discussion and cooperation
Microtheories for SDI - Accounting for diversity of local conceptualisations at a global level
Dissertation submitted in partial fulfillment of the requirements for the Degree of Master of Science in Geospatial Technologies.The categorization and conceptualization of geographic features is fundamental to cartography,
geographic information retrieval, routing applications, spatial decision support
and data sharing in general. However, there is no standard conceptualization of
the world. Humans conceptualize features based on numerous factors including cultural
background, knowledge, motivation and particularly space and time. Thus, geographic
features are prone to multiple, context-dependent conceptualizations reflecting local
conditions. This creates semantic heterogeneity and undermines interoperability. Standardization
of a shared definition is often employed to overcome semantic heterogeneity.
However, this approach loses important local diversity in feature conceptualizations and
may result in feature definitions which are too broad or too specific. This work proposes
the use of microtheories in Spatial Data Infrastructures, such as INSPIRE, to account
for diversity of local conceptualizations while maintaining interoperability at a global
level. It introduces a novel method of structuring microtheories based on space and
time, represented by administrative boundaries, to reflect variations in feature conceptualization.
A bottom-up approach, based on non-standard inference, is used to create
an appropriate global-level feature definition from the local definitions. Conceptualizations
of rivers, forests and estuaries throughout Europe are used to demonstrate how
the approach can improve the INSPIRE data model and ease its adoption by European
member states
Linking and Validating Nordic and Baltic Wordnets - A Multilingual Action in META-NORD
This project report describes a multilingual wordnet initiative embarked in the META-NORD project and concerned with the validation and pilot linking between Nordic and Baltic wordnets. The builders of these wordnets have applied very different compilation strategies: The Danish, Icelandic and Swedish wordnets are being developed via monolingual dictionaries and corpora and subsequently linked to Princeton WordNet. In contrast, the Finnish and Norwegian wordnets are applying the expand method by translating from Princeton WordNet and the Danish wordnet, DanNet, respectively. The Estonian wordnet was built as part of the EuroWordNet project and by translating the base concepts from English as a first basis for monolingual extension. The aim of the multilingual action is to test the perspective of a multilingual linking of the Nordic and Baltic wordnets and via this (pilot) linking to perform a tentative comparison and validation of the wordnets along the measures of taxonomical structure, coverage, granularity and completeness.Peer reviewe
Adjective attribution
"This book is the first typological study of adjective attribution marking. Its focus lies on Northern Eurasia, although it covers many more languages and presents an ontology of morphosyntactic categories relevant to noun phrase structure in general. Beside treating synchronic data, the study contributes to historical linguistics by reconstructing the origin of new types specifically in the language contact area between the Indo-European and Uralic families.
Adjective attribution
This book is the first typological study of adjective attribution marking. Its focus lies on Northern Eurasia, although it covers many more languages and presents an ontology of morphosyntactic categories relevant to noun phrase structure in general. Beside treating synchronic data, the study contributes to historical linguistics by reconstructing the origin of new types specifically in the language contact area between the Indo-European and Uralic families
Adjective attribution
This book is the first typological study of adjective attribution marking. Its focus lies on Northern Eurasia, although it covers many more languages and presents an ontology of morphosyntactic categories relevant to noun phrase structure in general. Beside treating synchronic data, the study contributes to historical linguistics by reconstructing the origin of new types specifically in the language contact area between the Indo-European and Uralic families
Adjective attribution
This book is the first typological study of adjective attribution marking. Its focus lies on Northern Eurasia, although it covers many more languages and presents an ontology of morphosyntactic categories relevant to noun phrase structure in general. Beside treating synchronic data, the study contributes to historical linguistics by reconstructing the origin of new types specifically in the language contact area between the Indo-European and Uralic families
Adjective attribution
This book is the first typological study of adjective attribution marking. Its focus lies on Northern Eurasia, although it covers many more languages and presents an ontology of morphosyntactic categories relevant to noun phrase structure in general. Beside treating synchronic data, the study contributes to historical linguistics by reconstructing the origin of new types specifically in the language contact area between the Indo-European and Uralic families
- …