772 research outputs found
From Paper to Digital Trail: Collections on the Semantic Web
Historical research on World War II and the impact of large-scale violence largely depends on the availability of source materials: diaries, newspapers, eyewitness accounts, archival documents, photographs and videos, etc. Currently, these resources are held by a large number of memory institutions, often in analogue formats. For scholars, it can be challenging to find out which collections are relevant for their research and also what information can be found in these collections. In this article it is argued that Semantic Web technologies, together with new digital tooling to automatically open up collections and interlink their contents, have the potential to revolutionize future access and use. By making the contents of collections machine-readable and enriching them with links to reference data, a shift can be made from a "web of documents" to a "web of data." By publishing all contents as linked open data, domain experts in research infrastructures (RIs) and thematic aggregators (TAs) are enabled to add their own "thematic" layers to the data, thus empowering themselves and others to explore the data in new, more sophisticated ways. Since we are only at the start of this development, the author advocates a close cooperation between archives, libraries, and museums (ALMs) and domain experts
MEDQUAL: Improving Medical Web Search over Time with Dynamic Credibility Heuristics
Performing a search on the World Wide Web (WWW) and traversing the
resulting links is an adventure in which one encounters both credible
and incredible web pages. Search engines, such as Google, rely on
macroscopic Web topology patterns and even highly ranked 'authoritative'
web sites may be a mixture of informed and uninformed opinions. Without
credibility heuristics to guide the user in a maze of facts, assertions,
and inferences, the Web remains an ineffective knowledge delivery
platform. This report presents the design and implementation of a
modular extension to the popular Google search engine, MEDQUAL, which
provisions both URL and content-based heuristic credibility rules to
reorder raw Google rankings in the medical domain. MEDQUAL, a software
system written in Java, starts with a bootstrap configuration file which
loads in basic heuristics in XML format. It then provides a subscription
mechanism so users can join birds of feather specialty groups, for
example Pediatrics, in order to load specialized heuristics as well. The
platform features a coordination mechanism whereby information seekers
can effectively become secondary authors, contributing by consensus vote
additional credibility heuristics. MEDQUAL uses standard XML namespace
conventions to divide opinion groups so that competing groups can be
supported simultaneously. The net effect is a merger of basic and
supplied heuristics so that the system continues to adapt and improve
itself over time to changing web content, changing opinions, and new
opinion groups. The key goal of leveraging the intelligence of a
large-scale and diffuse WWW user community is met and we conclude by
discussing our plans to develop MEDQUAL further and evaluate it
Implications and effectiveness of information management while restructuring an organisation
This research paper undertakes to outline factors of Information Management that organisations should consider when attempting a restructuring process. Conversely, valuable information and knowledge are often mislaid, overlooked or discarded to the eventual detriment of an organisation during the processes of downsizing, rightsizing, restructuring, reorganisation, reengineering, transformation or change. In the global economy organisations are always striving to keep ahead of competition and ultimately to improve their net profit. Information is at the very core of any organisation, its ads value, structure and power to an organisation. Information Management assists with the locating, storing and use of corporate information. A means of controlling and structuring corporate information is via the use of the Information Management elements. A case study using an organisation that is in the process of change and transformation was performed. Potential loss of knowledge and information was identified and examined. The organisation’s implementation of an information portal was highlighted as an effective way to minimise the loss of knowledge and information during the period of change.Professor A.S.A. du Toi
Professional Search in Pharmaceutical Research
In the mid 90s, visiting libraries – as means of retrieving the latest literature – was still a common necessity among professionals. Nowadays, professionals simply access information by ‘googling’. Indeed, the name of the Web search engine market leader “Google” became a synonym for searching and retrieving information. Despite the increased popularity of search as a method for retrieving relevant information, at the workplace search engines still do not deliver satisfying results to professionals.
Search engines for instance ignore that the relevance of answers (the satisfaction of a searcher’s needs) depends not only on the query (the information request) and the document corpus, but also on the working context (the user’s personal needs, education, etc.). In effect, an answer which might be appropriate to one user might not be appropriate to the other user, even though the query and the document corpus are the same for both. Personalization services addressing the context become therefore more and more popular and are an active field of research.
This is only one of several challenges encountered in ‘professional search’: How can the working context of the searcher be incorporated in the ranking process; how can unstructured free-text documents be enriched with semantic information so that the information need can be expressed precisely at query time; how and to which extent can a company’s knowledge be exploited for search purposes; how should data from distributed sources be accessed from into one-single-entry-point.
This thesis is devoted to ‘professional search’, i.e. search at the workplace, especially in industrial research and development. We contribute by compiling and developing several approaches for facing the challenges mentioned above. The approaches are implemented into the prototype YASA (Your Adaptive Search Agent) which provides meta-search, adaptive ranking of search results, guided navigation, and which uses domain knowledge to drive the search processes. YASA is deployed in the pharmaceutical research department of Roche in Penzberg – a major pharmaceutical company – in which the applied methods were empirically evaluated.
Being confronted with mostly unstructured free-text documents and having barely explicit metadata at hand, we faced a serious challenge. Incorporating semantics (i.e. formal knowledge representation) into the search process can only be as good as the underlying data. Nonetheless, we are able to demonstrate that this issue can be largely compensated by incorporating automatic metadata extraction techniques. The metadata we were able to extract automatically was not perfectly accurate, nor did the ontology we applied contain considerably “rich semantics”. Nonetheless, our results show that already the little semantics incorporated into the search process, suffices to achieve a significant improvement in search and retrieval.
We thus contribute to the research field of context-based search by incorporating the working context into the search process – an area which so far has not yet been well studied
Recommended from our members
Knowledge search for new product development: a multi-agent based methodology
Manufacturers are the leaders in developing new products to drive productivity. Higher productivity means more products based on the same materials, energy, labour, and capitals. New product development plays a critical role in the success of manufacturing firms. Activities in the product development process are dependent on the knowledge of new product development team members. Increasingly, many enterprises consider effective knowledge search to be a source of competitive advantage.
This research presents an exploratory case study conducted at an aircraft manufacturer. This investigation uncovered six, empirically derived and theoretically informed, problems to enterprise knowledge search. They have been articulated as (i) the effectual web bandwidth limits search speed; (ii) less relevant search results based on word-frequency recognition models of search engine; (iii) un-useable techniques for enterprise search; (iv) rigour security, reliability, and company policy; (v) poor search performance about unstructured enterprise knowledge; (vi) the lack of tacit knowledge sharing. Existing search methodologies have focused on the internet search, rather than providing effective search for enterprise.
This research aim is developed to assist the manufacturing enterprise in meeting the industrial requirements in the following way: a methodology and system that can improve the information and knowledge search performance in new product development process. Based on the exploratory case findings, a knowledge search methodology and system has been developed. Agent technology is used to fulfil the requirements of enterprise search. Some initial tests were conducted to better understand implementation issues and future deployment of the methodology and system in practice
Forgotten as data – remembered through information. Social memory institutions in the digital age: the case of the Europeana Initiative
The study of social memory has emerged as a rich field of research closely linked
to cultural artefacts, communication media and institutions as carriers of a past
that transcends the horizon of the individual’s lifetime. Within this domain of
research, the dissertation focuses on memory institutions (libraries, archives,
museums) and the shifts they are undergoing as the outcome of digitization and
the diffusion of online media. Very little is currently known about the impact that
digitality and computation may have on social memory institutions, specifically,
and social memory, more generally – an area of study that would benefit from
but, so far, has been mostly overlooked by information systems research.
The dissertation finds its point of departure in the conceptualization of
information as an event that occurs through the interaction between an observer
and the observed – an event that cannot be stored as information but merely as
data. In this context, memory is conceived as an operation that filters, thus
forgets, the singular details of an information event by making it comparable to
other events according to abstract classification criteria. Against this backdrop,
memory institutions are institutions of forgetting as they select, order and
preserve a canon of cultural heritage artefacts.
Supported by evidence from a case study on the Europeana initiative (a
digitization project of European libraries, archives and museums), the
dissertation reveals a fundamental shift in the field of memory institutions. The
case study demonstrates the disintegration of 1) the cultural heritage artefact, 2)
its standard modes of description and 3) the catalogue as such into a steadily
accruing assemblage of data and metadata. Dismembered into bits and bytes,
cultural heritage needs to be re-membered through the emulation of recognizable
cultural heritage artefacts and momentary renditions of order. In other words,
memory institutions forget as binary-based data and remember through
computational information
Semantic discovery and reuse of business process patterns
Patterns currently play an important role in modern information systems (IS) development and their use has mainly been restricted to the design and implementation phases of the development lifecycle. Given the increasing significance of business modelling in IS development, patterns have the potential of providing a viable solution for promoting reusability of recurrent generalized models in the very early stages of development. As a statement of research-in-progress this paper focuses on business process patterns and proposes an initial methodological framework for the discovery and reuse of business process patterns within the IS development lifecycle. The framework borrows ideas from the domain engineering literature and proposes the use of semantics to drive both the discovery of patterns as well as their reuse
Process-Oriented Information Logistics: Aligning Process Information with Business Processes
During the last decade, research in the field of business process management (BPM) has focused on the design, modeling, execution, monitoring, and optimization of business processes. What has been neglected, however, is the provision of knowledge workers and decision makers with needed information when performing knowledge-intensive business processes such as product engineering, customer support, or strategic management. Today, knowledge workers and decision makers are confronted with a massive load of
data, making it difficult for them to discover the information relevant for performing their tasks. Particularly challenging in this context is the alignment of process-related information (process information for short), such as e-mails, office files, forms, checklists,
guidelines, and best practices, with business processes and their tasks. In practice, process information is not only stored in large, distributed and heterogeneous sources, but usually managed separately from business processes. For example, shared drives, databases, enterprise portals, and enterprise information systems are used to store process information. In turn, business processes are managed using advanced process management technology. As a consequence, process information and business processes often need to be manually linked; i.e., process information is hard-wired to
business processes, e.g., in enterprise portals associating specific process information with process tasks. This approach often fails due to high maintenance efforts and missing support for the individual demands of knowledge workers and decision makers. In response to this problem, this thesis introduces process-oriented information logistics(POIL) as new paradigm for delivering the right process information, in the right format and quality, at the right place and the right point in time, to the right people. In particular, POIL allows for the process-oriented, context-aware (i.e., personalized) delivery
of process information to process participants. The goal is to no longer manually hard-wire process information to business processes, but to automatically identify and
deliver relevant process information to knowledge workers and decision makers. The core component of POIL is a semantic information network (SIN), which comprises
homogeneous information objects (e.g., e-mails, offce files, guidelines), process objects (e.g., tasks, events, roles), and relationships between them. In particular, a
SIN allows discovering objects linked with each other in different ways, e.g., objects addressing the same topic or needed when performing a particular process task. The SIN not only enables an integrated formal representation of process information and business processes, but also allows determining the relevance of process information for a given work context based on novel techniques and algorithms. Note that this becomes crucial in order to achieve the aforementioned overall goal of this thesis
- …