772 research outputs found

    From Paper to Digital Trail: Collections on the Semantic Web

    Get PDF
    Historical research on World War II and the impact of large-scale violence largely depends on the availability of source materials: diaries, newspapers, eyewitness accounts, archival documents, photographs and videos, etc. Currently, these resources are held by a large number of memory institutions, often in analogue formats. For scholars, it can be challenging to find out which collections are relevant for their research and also what information can be found in these collections. In this article it is argued that Semantic Web technologies, together with new digital tooling to automatically open up collections and interlink their contents, have the potential to revolutionize future access and use. By making the contents of collections machine-readable and enriching them with links to reference data, a shift can be made from a "web of documents" to a "web of data." By publishing all contents as linked open data, domain experts in research infrastructures (RIs) and thematic aggregators (TAs) are enabled to add their own "thematic" layers to the data, thus empowering themselves and others to explore the data in new, more sophisticated ways. Since we are only at the start of this development, the author advocates a close cooperation between archives, libraries, and museums (ALMs) and domain experts

    MEDQUAL: Improving Medical Web Search over Time with Dynamic Credibility Heuristics

    Get PDF
    Performing a search on the World Wide Web (WWW) and traversing the resulting links is an adventure in which one encounters both credible and incredible web pages. Search engines, such as Google, rely on macroscopic Web topology patterns and even highly ranked 'authoritative' web sites may be a mixture of informed and uninformed opinions. Without credibility heuristics to guide the user in a maze of facts, assertions, and inferences, the Web remains an ineffective knowledge delivery platform. This report presents the design and implementation of a modular extension to the popular Google search engine, MEDQUAL, which provisions both URL and content-based heuristic credibility rules to reorder raw Google rankings in the medical domain. MEDQUAL, a software system written in Java, starts with a bootstrap configuration file which loads in basic heuristics in XML format. It then provides a subscription mechanism so users can join birds of feather specialty groups, for example Pediatrics, in order to load specialized heuristics as well. The platform features a coordination mechanism whereby information seekers can effectively become secondary authors, contributing by consensus vote additional credibility heuristics. MEDQUAL uses standard XML namespace conventions to divide opinion groups so that competing groups can be supported simultaneously. The net effect is a merger of basic and supplied heuristics so that the system continues to adapt and improve itself over time to changing web content, changing opinions, and new opinion groups. The key goal of leveraging the intelligence of a large-scale and diffuse WWW user community is met and we conclude by discussing our plans to develop MEDQUAL further and evaluate it

    Implications and effectiveness of information management while restructuring an organisation

    Get PDF
    This research paper undertakes to outline factors of Information Management that organisations should consider when attempting a restructuring process. Conversely, valuable information and knowledge are often mislaid, overlooked or discarded to the eventual detriment of an organisation during the processes of downsizing, rightsizing, restructuring, reorganisation, reengineering, transformation or change. In the global economy organisations are always striving to keep ahead of competition and ultimately to improve their net profit. Information is at the very core of any organisation, its ads value, structure and power to an organisation. Information Management assists with the locating, storing and use of corporate information. A means of controlling and structuring corporate information is via the use of the Information Management elements. A case study using an organisation that is in the process of change and transformation was performed. Potential loss of knowledge and information was identified and examined. The organisation’s implementation of an information portal was highlighted as an effective way to minimise the loss of knowledge and information during the period of change.Professor A.S.A. du Toi

    Professional Search in Pharmaceutical Research

    Get PDF
    In the mid 90s, visiting libraries – as means of retrieving the latest literature – was still a common necessity among professionals. Nowadays, professionals simply access information by ‘googling’. Indeed, the name of the Web search engine market leader “Google” became a synonym for searching and retrieving information. Despite the increased popularity of search as a method for retrieving relevant information, at the workplace search engines still do not deliver satisfying results to professionals. Search engines for instance ignore that the relevance of answers (the satisfaction of a searcher’s needs) depends not only on the query (the information request) and the document corpus, but also on the working context (the user’s personal needs, education, etc.). In effect, an answer which might be appropriate to one user might not be appropriate to the other user, even though the query and the document corpus are the same for both. Personalization services addressing the context become therefore more and more popular and are an active field of research. This is only one of several challenges encountered in ‘professional search’: How can the working context of the searcher be incorporated in the ranking process; how can unstructured free-text documents be enriched with semantic information so that the information need can be expressed precisely at query time; how and to which extent can a company’s knowledge be exploited for search purposes; how should data from distributed sources be accessed from into one-single-entry-point. This thesis is devoted to ‘professional search’, i.e. search at the workplace, especially in industrial research and development. We contribute by compiling and developing several approaches for facing the challenges mentioned above. The approaches are implemented into the prototype YASA (Your Adaptive Search Agent) which provides meta-search, adaptive ranking of search results, guided navigation, and which uses domain knowledge to drive the search processes. YASA is deployed in the pharmaceutical research department of Roche in Penzberg – a major pharmaceutical company – in which the applied methods were empirically evaluated. Being confronted with mostly unstructured free-text documents and having barely explicit metadata at hand, we faced a serious challenge. Incorporating semantics (i.e. formal knowledge representation) into the search process can only be as good as the underlying data. Nonetheless, we are able to demonstrate that this issue can be largely compensated by incorporating automatic metadata extraction techniques. The metadata we were able to extract automatically was not perfectly accurate, nor did the ontology we applied contain considerably “rich semantics”. Nonetheless, our results show that already the little semantics incorporated into the search process, suffices to achieve a significant improvement in search and retrieval. We thus contribute to the research field of context-based search by incorporating the working context into the search process – an area which so far has not yet been well studied

    Forgotten as data – remembered through information. Social memory institutions in the digital age: the case of the Europeana Initiative

    Get PDF
    The study of social memory has emerged as a rich field of research closely linked to cultural artefacts, communication media and institutions as carriers of a past that transcends the horizon of the individual’s lifetime. Within this domain of research, the dissertation focuses on memory institutions (libraries, archives, museums) and the shifts they are undergoing as the outcome of digitization and the diffusion of online media. Very little is currently known about the impact that digitality and computation may have on social memory institutions, specifically, and social memory, more generally – an area of study that would benefit from but, so far, has been mostly overlooked by information systems research. The dissertation finds its point of departure in the conceptualization of information as an event that occurs through the interaction between an observer and the observed – an event that cannot be stored as information but merely as data. In this context, memory is conceived as an operation that filters, thus forgets, the singular details of an information event by making it comparable to other events according to abstract classification criteria. Against this backdrop, memory institutions are institutions of forgetting as they select, order and preserve a canon of cultural heritage artefacts. Supported by evidence from a case study on the Europeana initiative (a digitization project of European libraries, archives and museums), the dissertation reveals a fundamental shift in the field of memory institutions. The case study demonstrates the disintegration of 1) the cultural heritage artefact, 2) its standard modes of description and 3) the catalogue as such into a steadily accruing assemblage of data and metadata. Dismembered into bits and bytes, cultural heritage needs to be re-membered through the emulation of recognizable cultural heritage artefacts and momentary renditions of order. In other words, memory institutions forget as binary-based data and remember through computational information

    Semantic discovery and reuse of business process patterns

    Get PDF
    Patterns currently play an important role in modern information systems (IS) development and their use has mainly been restricted to the design and implementation phases of the development lifecycle. Given the increasing significance of business modelling in IS development, patterns have the potential of providing a viable solution for promoting reusability of recurrent generalized models in the very early stages of development. As a statement of research-in-progress this paper focuses on business process patterns and proposes an initial methodological framework for the discovery and reuse of business process patterns within the IS development lifecycle. The framework borrows ideas from the domain engineering literature and proposes the use of semantics to drive both the discovery of patterns as well as their reuse

    Process-Oriented Information Logistics: Aligning Process Information with Business Processes

    Get PDF
    During the last decade, research in the field of business process management (BPM) has focused on the design, modeling, execution, monitoring, and optimization of business processes. What has been neglected, however, is the provision of knowledge workers and decision makers with needed information when performing knowledge-intensive business processes such as product engineering, customer support, or strategic management. Today, knowledge workers and decision makers are confronted with a massive load of data, making it difficult for them to discover the information relevant for performing their tasks. Particularly challenging in this context is the alignment of process-related information (process information for short), such as e-mails, office files, forms, checklists, guidelines, and best practices, with business processes and their tasks. In practice, process information is not only stored in large, distributed and heterogeneous sources, but usually managed separately from business processes. For example, shared drives, databases, enterprise portals, and enterprise information systems are used to store process information. In turn, business processes are managed using advanced process management technology. As a consequence, process information and business processes often need to be manually linked; i.e., process information is hard-wired to business processes, e.g., in enterprise portals associating specific process information with process tasks. This approach often fails due to high maintenance efforts and missing support for the individual demands of knowledge workers and decision makers. In response to this problem, this thesis introduces process-oriented information logistics(POIL) as new paradigm for delivering the right process information, in the right format and quality, at the right place and the right point in time, to the right people. In particular, POIL allows for the process-oriented, context-aware (i.e., personalized) delivery of process information to process participants. The goal is to no longer manually hard-wire process information to business processes, but to automatically identify and deliver relevant process information to knowledge workers and decision makers. The core component of POIL is a semantic information network (SIN), which comprises homogeneous information objects (e.g., e-mails, offce files, guidelines), process objects (e.g., tasks, events, roles), and relationships between them. In particular, a SIN allows discovering objects linked with each other in different ways, e.g., objects addressing the same topic or needed when performing a particular process task. The SIN not only enables an integrated formal representation of process information and business processes, but also allows determining the relevance of process information for a given work context based on novel techniques and algorithms. Note that this becomes crucial in order to achieve the aforementioned overall goal of this thesis

    Semantic Web methods for knowledge management [online]

    Get PDF
    corecore