1,731 research outputs found
Establishing a New State-of-the-Art for French Named Entity Recognition
The French TreeBank developed at the University Paris 7 is the main source of
morphosyntactic and syntactic annotations for French. However, it does not
include explicit information related to named entities, which are among the
most useful information for several natural language processing tasks and
applications. Moreover, no large-scale French corpus with named entity
annotations contain referential information, which complement the type and the
span of each mention with an indication of the entity it refers to. We have
manually annotated the French TreeBank with such information, after an
automatic pre-annotation step. We sketch the underlying annotation guidelines
and we provide a few figures about the resulting annotations
Information Delivery Systems:An Exploration of Web Pull and Push Technologies
The Web is alive with news stories, pictures, music, and videos. How will organizations, managers, and other users find out what content is available, then locate it, analyze it, and make it meaningful? In this tutorial, we identify and classify eight types of information delivery systems (IDS) that we refer to as alpha, beta, gamma and delta and push technologies. For pull technologies we explain surfing the Web , search engines, spiders and bots, personal agents, and finally evolutionary agents. For push technologies we explain Webcasting, channels and subscriptions, and data mining methods for determining preferences and filtering topics. We also examine the role of the evolutionary agents in push technologies. Throughout the paper, we provide examples of current pull and push technologies in each of the categories for pull and push. We include both personal and corporate applications. We then examine the managerial and social implications of higher-level IDS and suggest what is in store for users of information delivery systems in the future
Towards a framework for knowledge discovery: an architecture for distributed inductive databases
We discuss how data mining, patternbases and databases can be integrated into inductive databases, which make data mining an inductive query process. We propose a software architecture for such inductive databases, and extend this architecture to support the clustering of inductive databases and to make them suitable for data mining on the grid.Applications in Artificial Intelligence - Knowledge DiscoveryRed de Universidades con Carreras en Informática (RedUNCI
Organizing XML data in a wireless broadcast system by exploiting structural similarities
Wireless data broadcast is an efficient way of delivering data of common interest to a large population of mobile devices within a proximate area, such as smart cities, battle fields, etc. In this work, we focus ourselves on studying the data placement problem of periodic XML data broadcast in mobile and wireless environments. This is an important issue, particularly when XML becomes prevalent in today’s ubiquitous and mobile computing devices and applications. Taking advantage of the structured characteristics of XML data, effective broadcast programs can be generated based on the XML data on the server only. An XML data broadcast system is developed and a theoretical analysis on the XML data placement on a wireless channel is also presented, which forms the basis of the novel data placement algorithm in this work. The proposed algorithm is validated through a set of experiments. The results show that the proposed algorithm can effectively place XML data on air and significantly improve the overall access efficiency
- …