230,854 research outputs found

    The Relational Vector-space Model

    Get PDF
    This paper addresses the classification of linked entities. We introduce a relational vector (VS) model (in analogy to the VS model used in information retrieval) that abstracts the linked structure, representing entities by vectors of weights. Given labeled data as background knowledge training data, classification procedures can be defined for this model, including a straightforward, "direct" model using weighted adjacency vectors. Using a large set of tasks from the domain of company affiliation identification, we demonstrate that such classification procedures can be effective. We then examine the method in more detail, showing that as expected the classification performance correlates with the- relational auto correlation of the data set. We then turn the tables and use the relational VS scores as a way to analyze/visualize the relational autocorrelation present in a complex linked structure. The main contribution of the paper 1s to introduce the relational VS model as a potentially useful addition to the toolkit for relational data mining. It could provide useful constructed features for domains with low to moderate relational autocorrelation; it may be effective by itself for domains with high levels of relational autocorrelation, and it provides a useful abstraction for analyzing the properties of linked data.Information Systems Working Papers Serie

    "Natural relations" : a note on X'-structure

    Get PDF
    With the rise of minimalism, many concepts related to the geometrical relations of phrase structure held fast to in earlier approaches have been reconsidered. This article deals with distinguishing (relational and technical) properties of specifiers and adjuncts in a Bare Phrase Structure framework (X'-Theory). I extend specific aspects of X-structure relevant to the discussion of specifiers vs. adjuncts. I argue that unique specifiers can be derived from the system and that adjunction, possibly multiple, results from Direct Merge only. The final product is a series of relationships in line with recent thoughts and minimalist premises, but formally more similar to earlier conceptions of the X'-schema. I address conceptual, empirical and theoretical arguments against multiple specifiers and related issues next, that is beyond the predictions immediately following from the tripartitional view of clause structure proposed in Grohmann (2000). After laying out my motivations to critically consider the issue, I present a set of data that casts serious doubt over the justifications offered to replace Agr with v as the accusative casemarker. Having conceptual and empirical back-up, I then tackle the theoretical validity of specifiers, and ways to distinguish unique specifiers from (multiple) adjuncts. I introduce a version of Bare Phrase Structure that does so, yet keeps the spirit of defining structural identification over relational rather than categorial properties

    Interactive Process Identification and Selection from SAP ERP

    Full text link
    SAP ERP is one of the most popular information systems supporting various organizational processes, e.g., O2C and P2P. However, the amount of processes and data contained in SAP ERP is enormous. Thus, the identification of the processes that are contained in a specific SAP instance, and the creation of a list of related tables is a significant challenge. Eventually, one needs to extract an event log for process mining purposes from SAP ERP. This demo paper shows the tool Interactive SAP Explorer that tackles the process identification and selection problem by encoding the relational structure of SAP ERP in a labeled property graph. Our approach allows asking complex process-related queries along with advanced representations of the relational structure

    PropBase QueryLayer: a single portal to UK physical property databases

    Get PDF
    Until recently, the delivery of geological information for industry and public was achieved by geological mapping. Now pervasively available computers mean that 3D geological models can deliver realistic representations of the geometric location of geological units, represented as shells or volumes. The next phase of this process is to populate these with physical properties data that describe subsurface heterogeneity and its associated uncertainty. Achieving this requires capture and serving of physical, hydrological and other property information from diverse sources to populate these models. The British Geological Survey (BGS) holds large volumes of subsurface property data, derived both from their own research data collection and also other, often commercially derived data sources. This can be voxelated to incorporate this data into the models to demonstrate property variation within the subsurface geometry. All property data held by BGS has for many years been stored in relational databases to ensure their long-term continuity. However these have, by necessity, complex structures; each database contains positional reference data and model information, and also metadata such as sample identification information and attributes that define the source and processing. Whilst this is critical to assessing these analyses, it also hugely complicates the understanding of variability of the property under assessment and requires multiple queries to study related datasets making extracting physical properties from these databases difficult. Therefore the PropBase Query Layer has been created to allow simplified aggregation and extraction of all related data and its presentation of complex data in simple, mostly denormalized, tables which combine information from multiple databases into a single system. The structure from each relational database is denormalized in a generalised structure, so that each dataset can be viewed together in a common format using a simple interface. Data are re-engineered to facilitate easy loading. The query layer structure comprises tables, procedures, functions, triggers, views and materialised views. The structure contains a main table PRB_DATA which contains all of the data with the following attribution: ā€¢ a unique identifier ā€¢ the data source ā€¢ the unique identifier from the parent database for traceability ā€¢ the 3D location ā€¢ the property type ā€¢ the property value ā€¢ the units ā€¢ necessary qualifiers ā€¢ precision information and an audit trail Data sources, property type and units are constrained by dictionaries, a key component of the structure which defines what properties and inheritance hierarchies are to be coded and also guides the process as to what and how these are extracted from the structure. Data types served by the Query Layer include site investigation derived geotechnical data, hydrogeology datasets, regional geochemistry, geophysical logs as well as lithological and borehole metadata. The size and complexity of the data sets with multiple parent structures requires a technically robust approach to keep the layer synchronised. This is achieved through Oracle procedures written in PL/SQL containing the logic required to carry out the data manipulation (inserts, updates, deletes) to keep the layer synchronised with the underlying databases either as regular scheduled jobs (weekly, monthly etc.) or invoked on demand. The PropBase Query Layerā€™s implementation has enabled rapid data discovery, visualisation and interpretation of geological data with greater ease, simplifying the parameterisation of 3D model volumes and facilitating the study of intra-unit heterogeneity

    Structure motivator: a tool for exploring small three-dimensional elements in proteins

    Get PDF
    <br>Background: Protein structures incorporate characteristic three-dimensional elements defined by some or all of hydrogen bonding, dihedral angles and amino acid sequence. The software application, Structure Motivator, allows interactive exploration and analysis of such elements, and their resolution into sub-classes.</br> <br>Results: Structure Motivator is a standalone application with an embedded relational database of proteins that, as a starting point, can furnish the user with a palette of unclassified small peptides or a choice of pre-classified structural motifs. Alternatively the application accepts files of data generated externally. After loading, the structural elements are displayed as two-dimensional plots of dihedral angles (Ļ†/Ļˆ, Ļ†/Ļ‡1 or in combination) for each residue, with visualization options to allow the conformation or amino acid composition at one residue to be viewed in the context of that at other residues. Interactive selections may then be made and structural subsets saved to file for further sub-classification or external analysis. The application has been applied both to classical motifs, such as the Ī²-turn, and ā€˜non-motifā€™ structural elements, such as specific segments of helices.</br> <br>Conclusions: Structure Motivator allows structural biologists, whether or not they possess computational skills, to subject small structural elements in proteins to rapid interactive analysis that would otherwise require complex programming or database queries. Within a broad group of structural motifs, it facilitates the identification and separation of sub-classes with distinct stereochemical properties.</br&gt

    Medical ontology for treatment of clinical data from children and youth

    Get PDF
    The use of information technologies in the field of biomedical data management has grown considerably and is today one of the main fields of use of these technologies. There are several advantages arising either to an individualā€™s health or to public health, particularly because access to clinical data become available anywhere access via the Internet or individual health card. This card will contain personal data accessible from a terminal card reader, identical to the citizen card. This work focuses on the development of an ontology of universal data structure so that the information is accessible and organized in the same way, regardless of the system that use them. In this context there is the need to incorporate security mechanisms, the respect of ethical principles underlying the management and maintenance of clinical data, ensuring maximum confidentiality. To develop the proposed ontology, for the treatment of clinical data of children and youth is used as reference bulletin health in Portugal. Using this structure, it follows the clear and unambiguous identification of the fields required for registration of clinical information, standardized in a relational model. To ensure the confidentiality of data, identification of the individual is only the number of national health system and are not recorded on the card personal data such as name, address or contact forms

    The Role Of Industry Structure On Customer Value In Robotic Surgery

    Get PDF
    Spending on robot surgery is expected to increase by $17 billion in the next 6 years. This new surgical treatment has challenged hospitals with higher costs and varying performance. Healthcare executives struggle balancing the adoption of medical innovations with managing healthcare costs. This dilemma can be further complicated by industry structures relative to capital-intensive medical innovations. This research explores the interaction between industry structure and customer value. Specifically, how can hospitals apply an understanding of supplier industry structure and customer value to improve the value of a robotic surgery program (RSP)? This industry study represents an exhaustive longitudinal review of over 15 years of public data relative to robotic surgery, across three distinct time periods. Within the research, industry structure is evaluated using Porterā€™s 5-forces model. A framework based upon contributions from Grƶnroos as well as Menon, Homburg, and Beutin is introduced to assess customer value based upon clinical, financial and strategic (CFS) value. The implications of periodic industry structure on customer value were examined to identify opportunities for hospital executives to increase RSP customer value. There were several empirical and theoretical findings from this research. First, in the face of increasing industry structure the identification of favorable forces may create opportunities to increase RSP value. Secondarily, exploring customer value through the lens of core, add-on, relational and transactional benefits in the sub-context of CFS value aids in the identification of market power influences on customer value. The implications of the absence of high levels of relational and transactional benefits without high levels of core and add-on benefits may influence avenues of pursuit in improving RSP value overall. The research also suggests that clinical and strategic value was present despite varying degrees of industry structure. Finally, this study represents an empirical joint analysis of industry structure and customer value in robotic surgery. Some proponents may find the introduction of an integrative model for measuring customer value in robotic surgery, applicable to other capital-intensive medical innovations or disruptive technologies at large

    A Product Line Systems Engineering Process for Variability Identification and Reduction

    Full text link
    Software Product Line Engineering has attracted attention in the last two decades due to its promising capabilities to reduce costs and time to market through reuse of requirements and components. In practice, developing system level product lines in a large-scale company is not an easy task as there may be thousands of variants and multiple disciplines involved. The manual reuse of legacy system models at domain engineering to build reusable system libraries and configurations of variants to derive target products can be infeasible. To tackle this challenge, a Product Line Systems Engineering process is proposed. Specifically, the process extends research in the System Orthogonal Variability Model to support hierarchical variability modeling with formal definitions; utilizes Systems Engineering concepts and legacy system models to build the hierarchy for the variability model and to identify essential relations between variants; and finally, analyzes the identified relations to reduce the number of variation points. The process, which is automated by computational algorithms, is demonstrated through an illustrative example on generalized Rolls-Royce aircraft engine control systems. To evaluate the effectiveness of the process in the reduction of variation points, it is further applied to case studies in different engineering domains at different levels of complexity. Subject to system model availability, reduction of 14% to 40% in the number of variation points are demonstrated in the case studies.Comment: 12 pages, 6 figures, 2 tables; submitted to the IEEE Systems Journal on 3rd June 201
    • ā€¦
    corecore