3,099 research outputs found

    Handling uncertainty in information extraction

    Get PDF
    This position paper proposes an interactive approach for developing information extractors based on the ontology definition process with knowledge about possible (in)correctness of annotations. We discuss the problem of managing and manipulating probabilistic dependencies

    Neogeography: The Challenge of Channelling Large and Ill-Behaved Data Streams

    Get PDF
    Neogeography is the combination of user generated data and experiences with mapping technologies. In this article we present a research project to extract valuable structured information with a geographic component from unstructured user generated text in wikis, forums, or SMSes. The extracted information should be integrated together to form a collective knowledge about certain domain. This structured information can be used further to help users from the same domain who want to get information using simple question answering system. The project intends to help workers communities in developing countries to share their knowledge, providing a simple and cheap way to contribute and get benefit using the available communication technology

    Named Entity Extraction and Disambiguation: The Reinforcement Effect.

    Get PDF
    Named entity extraction and disambiguation have received much attention in recent years. Typical fields addressing these topics are information retrieval, natural language processing, and semantic web. Although these topics are highly dependent, almost no existing works examine this dependency. It is the aim of this paper to examine the dependency and show how one affects the other, and vice versa. We conducted experiments with a set of descriptions of holiday homes with the aim to extract and disambiguate toponyms as a representative example of named entities. We experimented with three approaches for disambiguation with the purpose to infer the country of the holiday home. We examined how the effectiveness of extraction influences the effectiveness of disambiguation, and reciprocally, how filtering out ambiguous names (an activity that depends on the disambiguation process) improves the effectiveness of extraction. Since this, in turn, may improve the effectiveness of disambiguation again, it shows that extraction and disambiguation may reinforce each other.\u

    Information Extraction, Data Integration, and Uncertain Data Management: The State of The Art

    Get PDF
    Information Extraction, data Integration, and uncertain data management are different areas of research that got vast focus in the last two decades. Many researches tackled those areas of research individually. However, information extraction systems should have integrated with data integration methods to make use of the extracted information. Handling uncertainty in extraction and integration process is an important issue to enhance the quality of the data in such integrated systems. This article presents the state of the art of the mentioned areas of research and shows the common grounds and how to integrate information extraction and data integration under uncertainty management cover

    Unsupervised improvement of named entity extraction in short informal context using disambiguation clues

    Get PDF
    Short context messages (like tweets and SMS’s) are a potentially rich source of continuously and instantly updated information. Shortness and informality of such messages are challenges for Natural Language Processing tasks. Most efforts done in this direction rely on machine learning techniques which are expensive in terms of data collection and training. In this paper we present an unsupervised Semantic Web-driven approach to improve the extraction process by using clues from the disambiguation process. For extraction we used a simple Knowledge-Base matching technique combined with a clustering-based approach for disambiguation. Experimental results on a self-collected set of tweets (as an example of short context messages) show improvement in extraction results when using unsupervised feedback from the disambiguation process

    Large quantum gravity effects: Cylindrical waves in four dimensions

    Get PDF
    Linearly polarized cylindrical waves in four-dimensional vacuum gravity are mathematically equivalent to rotationally symmetric gravity coupled to a Maxwell (or Klein-Gordon) field in three dimensions. The quantization of this latter system was performed by Ashtekar and Pierri in a recent work. Employing that quantization, we obtain here a complete quantum theory which describes the four-dimensional geometry of the Einstein-Rosen waves. In particular, we construct regularized operators to represent the metric. It is shown that the results achieved by Ashtekar about the existence of important quantum gravity effects in the Einstein-Maxwell system at large distances from the symmetry axis continue to be valid from a four-dimensional point of view. The only significant difference is that, in order to admit an approximate classical description in the asymptotic region, states that are coherent in the Maxwell field need not contain a large number of photons anymore. We also analyze the metric fluctuations on the symmetry axis and argue that they are generally relevant for all of the coherent states.Comment: Version accepted for publication in Int. J. Mod. Phys.

    Concept Extraction Challenge: University of Twente at #MSM2013

    Get PDF
    Twitter messages are a potentially rich source of continuously and instantly updated information. Shortness and informality of such messages are challenges for Natural Language Processing tasks. In this paper we present a hybrid approach for Named Entity Extraction (NEE) and Classification (NEC) for tweets. The system uses the power of the Conditional Random Fields (CRF) and the Support Vector Machines (SVM) in a hybrid way to achieve better results. For named entity type classification we used AIDA \cite{YosefHBSW11} disambiguation system to disambiguate the extracted named entities and hence find their type

    Preposed Topic Specification in Berber: An Innovation Induced by Contact with Arabic

    Get PDF
    This article deals with preposed topic specification in Berber and demonstrates how this pragmatic phenomenon was engendered by contact with Arabic by means of two grammaticalisation processes: replica grammaticalisation (Heine and Kuteva 2003), which led to the Type-1 topic specifier, whose borrowed matter has undergone light or heavy processing, and (ordinary) contact-induced grammaticalisation (Heine and Kuteva 2003), which led to the Type-2 topic specifier, whose matter was provided by Berber itself by means of system-internal developments. Furthermore, the article accounts for the functional parameter of contrast as being the probable trigger of the whole innovation process and hence corroborates Matras’ hypothesis (1998) regarding contrast as a motivating factor for borrowing

    Determining the Neutrino Mass Hierarchy and CP Violation in NOvA with a Second Off-Axis Detector

    Get PDF
    We consider a Super-NOvA-like experimental configuration based on the use of two detectors in a long-baseline experiment as NOvA. We take the far detector as in the present NOvA proposal and add a second detector at a shorter baseline. The location of the second off-axis detector is chosen such that the ratio L/E is the same for both detectors, being L the baseline and E the neutrino energy. We consider liquid argon and water-Cherenkov techniques for the second off-axis detector and study, for different experimental setups, the detector mass required for the determination of the neutrino mass hierarchy, for different values of theta13. We also study the capabilities of such an experimental setup for determining CP violation in the neutrino sector. Our results show that by adding a second off-axis detector a remarkable enhancement on the capabilities of the current NOvA experiment could be achieved.Comment: 20 p

    Negation in Berber: variation, evolution, and typology

    Get PDF
    International audienceDouble and triple negation marking is an ancient and deep-rooted feature that is attested in almost the entire Berber-speaking area (North Africa and diaspora), regardless of the type of negators in use; i.e. discontinuous markers (preverbal and postverbal negators) and dedicated negative verb stem alternations. In this article, we deal with the main stages that have led to the present Berber negation patterns and we argue, from a typological viewpoint, that certain morphophonetic mechanisms are to be regarded as a hitherto overlooked source for new negators. Moreover, we present a number of motivations that account for the hypothesis that, in Berber, those languages with both a preverbal and a postverbal negator belong to a diachronic stage prior to the attested languages with a preverbal negator only. Consequently, the study demonstrates that the Jespersen Cycle is back to the beginning in certain Berber languages. In doing so, we also show that Berber is to be regarded as a substrate in the development of double negation in North African Arabic. In addition, the study accounts for the asymmetric nature of Berber negation, although some new developments towards more symmetrical negation configurations are also attested
    • 

    corecore