295,664 research outputs found

    Knowledge Discovery for Decision Support in Law

    Get PDF
    The Split Up project applies knowledge discovery techniques (KDD) to legal domains. Theories of jurisprudence underpin a classification scheme that is used to identify tasks suited to KDD. Theoretical perspectives also guide the selection of cases appropriate for a KDD exercise. Further, jurisprudence underpins strategies for dealing with contradictory data. Argumentation theory is instrumental for representing domain expertise so that the KDD process can be constrained. Specifically, a variant of the argumentation structure proposed by Toulmin is used to decompose tasks into independent sub-tasks in the data transformation phase. This enables a complex KDD exercise to be decomposed into numerous simpler exercises that each require less data and have fewer instances of missing values. The use of the structure also facilitates the development of information systems that integrate multiple reasoning mechanisms such as first order logic, neural networks or fuzzy inferences and provides a convenient structure for the generation of explanations. The viability of this approach was tested with the development of a system that predicts property split outcomes in cases litigated in the Family Court of Australia. The system has been evaluated using a mix of strategies that derive from a framework proposed by Reich

    Fast Cell Discovery in mm-wave 5G Networks with Context Information

    Full text link
    The exploitation of mm-wave bands is one of the key-enabler for 5G mobile radio networks. However, the introduction of mm-wave technologies in cellular networks is not straightforward due to harsh propagation conditions that limit the mm-wave access availability. Mm-wave technologies require high-gain antenna systems to compensate for high path loss and limited power. As a consequence, directional transmissions must be used for cell discovery and synchronization processes: this can lead to a non-negligible access delay caused by the exploration of the cell area with multiple transmissions along different directions. The integration of mm-wave technologies and conventional wireless access networks with the objective of speeding up the cell search process requires new 5G network architectural solutions. Such architectures introduce a functional split between C-plane and U-plane, thereby guaranteeing the availability of a reliable signaling channel through conventional wireless technologies that provides the opportunity to collect useful context information from the network edge. In this article, we leverage the context information related to user positions to improve the directional cell discovery process. We investigate fundamental trade-offs of this process and the effects of the context information accuracy on the overall system performance. We also cope with obstacle obstructions in the cell area and propose an approach based on a geo-located context database where information gathered over time is stored to guide future searches. Analytic models and numerical results are provided to validate proposed strategies.Comment: 14 pages, submitted to IEEE Transaction on Mobile Computin

    Context Information for Fast Cell Discovery in mm-wave 5G Networks

    Full text link
    The exploitation of the mm-wave bands is one of the most promising solutions for 5G mobile radio networks. However, the use of mm-wave technologies in cellular networks is not straightforward due to mm-wave harsh propagation conditions that limit access availability. In order to overcome this obstacle, hybrid network architectures are being considered where mm-wave small cells can exploit an overlay coverage layer based on legacy technology. The additional mm-wave layer can also take advantage of a functional split between control and user plane, that allows to delegate most of the signaling functions to legacy base stations and to gather context information from users for resource optimization. However, mm-wave technology requires high gain antenna systems to compensate for high path loss and limited power, e.g., through the use of multiple antennas for high directivity. Directional transmissions must be also used for the cell discovery and synchronization process, and this can lead to a non-negligible delay due to the need to scan the cell area with multiple transmissions at different directions. In this paper, we propose to exploit the context information related to user position, provided by the separated control plane, to improve the cell discovery procedure and minimize delay. We investigate the fundamental trade-offs of the cell discovery process with directional antennas and the effects of the context information accuracy on its performance. Numerical results are provided to validate our observations.Comment: 6 pages, 8 figures, in Proceedings of European Wireless 201

    Analyze Large Multidimensional Datasets Using Algebraic Topology

    Get PDF
    This paper presents an efficient algorithm to extract knowledge from high-dimensionality, high- complexity datasets using algebraic topology, namely simplicial complexes. Based on concept of isomorphism of relations, our method turn a relational table into a geometric object (a simplicial complex is a polyhedron). So, conceptually association rule searching is turned into a geometric traversal problem. By leveraging on the core concepts behind Simplicial Complex, we use a new technique (in computer science) that improves the performance over existing methods and uses far less memory. It was designed and developed with a strong emphasis on scalability, reliability, and extensibility. This paper also investigate the possibility of Hadoop integration and the challenges that come with the framework

    Visual Data Mining

    Get PDF
    Occlusion is one of the major problems for interactive visual knowledge discovery and data mining in the process of finding patterns in multidimensional data.This project proposes a hybrid method that combines visual and analytical means to deal with occlusion in visual knowledge discovery called as GLC-S which uses visualization of n-D data in 2D in a set of Shifted Paired Coordinates (SPC). A set of Shifted Paired Coordinates for n-D data consists of n/2 pairs of common Cartesian coordinates that are shifted relative to each other to avoid their overlap. Each n-D point A is represented as a directed graph A* in SPC, where each node is the 2D projection of A in a respective pair of the Cartesian coordinates. The proposed GLC-S method significantly decrease cognitive load for analysis of n-D data and simplify pattern discovery in n-D data. The GLC-S method iteratively splits n-D data into non-overlapping clusters (hyper-rectangles) around local centers and visualizes only data within these clusters at each iteration. The requirements for these clusters are to contain cases of only one class and be the largest cluster with this property in SPC visualization. Such sequential splitting allows: (1) avoiding occlusion, (2) finding visually local classification patterns, rules, and (3) combine local sub-rules to a global rule that classifies all given data of two or more classes. The computational experiment with Wisconsin Breast Cancer data(9-D), User Knowledge Modeling data(6-D), and Letter Recognition data(17-D) from UCI Machine Learning Repository confirm this capability. At each iteration, these data have been split into training (70%) and validation (30%) data. It required 3 iterations in Wisconsin Breast Cancer data, 4 iterations in User Knowledge Modeling and 5 iterations in Letter Recognition data and respectively 3, 4, 5 local sub-rules that covered over 95% of all n-D data points with 100% accuracy at both training and validation experiments. After each iteration, the data that were used in this iteration are removed and remaining data are used in the next iteration. This removal process helps to decrease occlusion too. The GLC-S algorithm refuses to classify remaining cases that are not covered by these rules, i.e.,., do not belong to found hyper-rectangles. The interactive visualization process in SPC allows adjusting the sides of the hyper-rectangles to maximize the size of the hyper-rectangle without its overlap with the hyper-rectangles of the opposing classes. The GLC-S method splits data using the fixed split of n coordinates to pairs. This hybrid visual and analytical approach avoids throwing all data of several classes into a visualization plot that typically ends up in a messy highly occluded picture that hides useful patterns. This approach allows revealing these hidden patterns. The visualization process in SPC is reversible (lossless). i.e.,., all n-D information is visualized in 2D and can be restored from 2D visualization for each n-D case. This hybrid visual analytics method allowed classifying n-D data in a way that can be communicated to the user’s in the understandable and visual form

    Automatically attaching web pages to an ontology

    Get PDF
    This paper describes a proposed system for automatically attaching material from the world wide web to concepts in an ontology. The motivation for this research stems from the Diogene project, which requires the project's own databases of learning objects to be augmented with additional resources from the web. Two main approaches to this problem are being taken: one using ontology mapping, and another based on the conventional text search facilities of the web, covered in this paper. By generating queries based on the concepts in the ontology, the aim is to retrieve material from the web, and then filter it to ensure its proper correspondence with a concept. The Diogene system will be briefly outlined, before the query-generation system is described. A small pilot experiment, designed to provide some initial results and insight into the problem, is then presented
    • …
    corecore