1,731 research outputs found

    Establishing a New State-of-the-Art for French Named Entity Recognition

    Get PDF
    The French TreeBank developed at the University Paris 7 is the main source of morphosyntactic and syntactic annotations for French. However, it does not include explicit information related to named entities, which are among the most useful information for several natural language processing tasks and applications. Moreover, no large-scale French corpus with named entity annotations contain referential information, which complement the type and the span of each mention with an indication of the entity it refers to. We have manually annotated the French TreeBank with such information, after an automatic pre-annotation step. We sketch the underlying annotation guidelines and we provide a few figures about the resulting annotations

    Information Delivery Systems:An Exploration of Web Pull and Push Technologies

    Get PDF
    The Web is alive with news stories, pictures, music, and videos. How will organizations, managers, and other users find out what content is available, then locate it, analyze it, and make it meaningful? In this tutorial, we identify and classify eight types of information delivery systems (IDS) that we refer to as alpha, beta, gamma and delta and push technologies. For pull technologies we explain surfing the Web , search engines, spiders and bots, personal agents, and finally evolutionary agents. For push technologies we explain Webcasting, channels and subscriptions, and data mining methods for determining preferences and filtering topics. We also examine the role of the evolutionary agents in push technologies. Throughout the paper, we provide examples of current pull and push technologies in each of the categories for pull and push. We include both personal and corporate applications. We then examine the managerial and social implications of higher-level IDS and suggest what is in store for users of information delivery systems in the future

    Towards a framework for knowledge discovery: an architecture for distributed inductive databases

    Get PDF
    We discuss how data mining, patternbases and databases can be integrated into inductive databases, which make data mining an inductive query process. We propose a software architecture for such inductive databases, and extend this architecture to support the clustering of inductive databases and to make them suitable for data mining on the grid.Applications in Artificial Intelligence - Knowledge DiscoveryRed de Universidades con Carreras en Informática (RedUNCI

    Organizing XML data in a wireless broadcast system by exploiting structural similarities

    Get PDF
    Wireless data broadcast is an efficient way of delivering data of common interest to a large population of mobile devices within a proximate area, such as smart cities, battle fields, etc. In this work, we focus ourselves on studying the data placement problem of periodic XML data broadcast in mobile and wireless environments. This is an important issue, particularly when XML becomes prevalent in today’s ubiquitous and mobile computing devices and applications. Taking advantage of the structured characteristics of XML data, effective broadcast programs can be generated based on the XML data on the server only. An XML data broadcast system is developed and a theoretical analysis on the XML data placement on a wireless channel is also presented, which forms the basis of the novel data placement algorithm in this work. The proposed algorithm is validated through a set of experiments. The results show that the proposed algorithm can effectively place XML data on air and significantly improve the overall access efficiency
    corecore