Search CORE

1,731 research outputs found

Establishing a New State-of-the-Art for French Named Entity Recognition

Author: Dupont Yoann
Muller Benjamin
Romary Laurent
Sagot Benoît
Suárez Pedro Javier Ortiz
Publication venue
Publication date: 11/05/2020
Field of study

The French TreeBank developed at the University Paris 7 is the main source of morphosyntactic and syntactic annotations for French. However, it does not include explicit information related to named entities, which are among the most useful information for several natural language processing tasks and applications. Moreover, no large-scale French corpus with named entity annotations contain referential information, which complement the type and the span of each mention with an indication of the entity it refers to. We have manually annotated the French TreeBank with such information, after an automatic pre-annotation step. We sketch the underlying annotation guidelines and we provide a few figures about the resulting annotations

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Information Delivery Systems:An Exploration of Web Pull and Push Technologies

Author: Kendall Julie E.
Kendall Kenneth E.
Publication venue: AIS Electronic Library (AISeL)
Publication date: 26/04/1999
Field of study

The Web is alive with news stories, pictures, music, and videos. How will organizations, managers, and other users find out what content is available, then locate it, analyze it, and make it meaningful? In this tutorial, we identify and classify eight types of information delivery systems (IDS) that we refer to as alpha, beta, gamma and delta and push technologies. For pull technologies we explain surfing the Web , search engines, spiders and bots, personal agents, and finally evolutionary agents. For push technologies we explain Webcasting, channels and subscriptions, and data mining methods for determining preferences and filtering topics. We also examine the role of the evolutionary agents in push technologies. Throughout the paper, we provide examples of current pull and push technologies in each of the categories for pull and push. We include both personal and corporate applications. We then examine the managerial and social implications of higher-level IDS and suggest what is in store for users of information delivery systems in the future

AIS Electronic Library (AISeL)

Towards a framework for knowledge discovery: an architecture for distributed inductive databases

Author: Bruin Jeroen S. de
Kok Joost
Publication venue
Publication date: 01/08/2006
Field of study

We discuss how data mining, patternbases and databases can be integrated into inductive databases, which make data mining an inductive query process. We propose a software architecture for such inductive databases, and extend this architecture to support the clustering of inductive databases and to make them suitable for data mining on the grid.Applications in Artificial Intelligence - Knowledge DiscoveryRed de Universidades con Carreras en Informática (RedUNCI

Organizing XML data in a wireless broadcast system by exploiting structural similarities

Author: C-S Park
Hua Wang
J Chen
JP Park
LR Dice
MA Viredaz
Nickolas J. G. Falkner
P Ganesan
Quan Z. Sheng
RQ Shaddad
SH Kang
T Imielinski
W Lian
W Sun
X Jianliang
Y Diao
YD Chung
YD Chung
Yongrui Qin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/08/2017
Field of study

Wireless data broadcast is an efficient way of delivering data of common interest to a large population of mobile devices within a proximate area, such as smart cities, battle fields, etc. In this work, we focus ourselves on studying the data placement problem of periodic XML data broadcast in mobile and wireless environments. This is an important issue, particularly when XML becomes prevalent in today’s ubiquitous and mobile computing devices and applications. Taking advantage of the structured characteristics of XML data, effective broadcast programs can be generated based on the XML data on the server only. An XML data broadcast system is developed and a theoretical analysis on the XML data placement on a wireless channel is also presented, which forms the basis of the novel data placement algorithm in this work. The proposed algorithm is validated through a set of experiments. The results show that the proposed algorithm can effectively place XML data on air and significantly improve the overall access efficiency

University of Huddersfield Repository

Huddersfield Research Portal