Search CORE

12,499 research outputs found

Ontology Driven Web Extraction from Semi-structured and Unstructured Data for B2B Market Analysis

Author: Darlington John
Imtiaz Hazzaz
Zuo Landong
Publication venue
Publication date: 01/09/2009
Field of study

The Market Blended Insight project1 has the objective of improving the UK business to business marketing performance using the semantic web technologies. In this project, we are implementing an ontology driven web extraction and translation framework to supplement our backend triple store of UK companies, people and geographical information. It deals with both the semi-structured data and the unstructured text on the web, to annotate and then translate the extracted data according to the backend schema

Southampton (e-Prints Soton)

Analysis of ecosystem services provision in the Colombian Amazon using participatory research and mapping techniques

Author: Cruz-Garcia Gisella S.
Honzák Miroslav
Palacios Erwin
Poppy Guy M.
Pérez-Miñana Elena
Ramirez-Gomez Sara O.I.
Schreckenberg Kate
Torres-Vitolas Carlos A.
Verweij Pita A.
Willcock Simon
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Over the last two decades indigenous peoples in the lower Caquetá River basin in Colombia have experienced detrimental changes in the provision of important ecosystem services in ways that have significant implications for the maintenance of their traditional livelihoods. To assess these changes we conducted eight participatory mapping activities and convened 22 focus group discussions. We focused the analysis on two types of change: (1) changes in the location of ecosystem services provisioning areas and (2) changes in the stock of ecosystem services. The focal ecosystem services include services such as provision of food, raw materials and medicinal resources. Results from the study show that in the past two decades the demand for food and raw materials has intensified and, as a result, locations of provisioning areas and the stocks of ecosystem services have changed. We found anecdotal evidence that these changes correlate well with socio-economic factors such as greater need for income generation, change in livelihood practices and consumption patterns. We discuss the use of participatory mapping techniques in the context of marginalized and data-poor regions. We also show how this kind of information can strengthen existing ecosystem-based management strategies used by indigenous peoples in the Colombian Amazon

Southampton (e-Prints Soton)

Elsevier - Publisher Connector

CGSpace

King's Research Portal

Utrecht University Repository

Empirical Methodology for Crowdsourcing Ground Truth

Author: Aroyo Lora
Dumitrache Anca
Inel Oana
Ortiz Carlos
Sips Robert-Jan
Timmermans Benjamin
Welty Chris
Publication venue
Publication date: 24/09/2018
Field of study

The process of gathering ground truth data through human annotation is a major bottleneck in the use of information extraction methods for populating the Semantic Web. Crowdsourcing-based approaches are gaining popularity in the attempt to solve the issues related to volume of data and lack of annotators. Typically these practices use inter-annotator agreement as a measure of quality. However, in many domains, such as event detection, there is ambiguity in the data, as well as a multitude of perspectives of the information examples. We present an empirically derived methodology for efficiently gathering of ground truth data in a diverse set of use cases covering a variety of domains and annotation tasks. Central to our approach is the use of CrowdTruth metrics that capture inter-annotator disagreement. We show that measuring disagreement is essential for acquiring a high quality ground truth. We achieve this by comparing the quality of the data aggregated with CrowdTruth metrics with majority vote, over a set of diverse crowdsourcing tasks: Medical Relation Extraction, Twitter Event Identification, News Event Extraction and Sound Interpretation. We also show that an increased number of crowd workers leads to growth and stabilization in the quality of annotations, going against the usual practice of employing a small number of annotators.Comment: in publication at the Semantic Web Journa

arXiv.org e-Print Archive

Portable extraction of partially structured facts from the web

Author: Jones Gareth J.F.
Kelly Liadh
Salway Andrew
Skadiņa Inguna
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

A novel fact extraction task is defined to fill a gap between current information retrieval and information extraction technologies. It is shown that it is possible to extract useful partially structured facts about different kinds of entities in a broad domain, i.e. all kinds of places depicted in tourist images. Importantly the approach does not rely on existing linguistic resources (gazetteers, taggers, parsers, etc.) and it ported easily and cheaply between two very different languages (English and Latvian). Previous fact extraction from the web has focused on the extraction of structured data, e.g. (Building-LocatedIn-Town). In contrast we extract richer and more interesting facts, such as a fact explaining why a building was built. Enough structure is maintained to facilitate subsequent processing of the information. For example, this partial structure enables straightforward template-based text generation. We report positive results for the correctness and interest of English and Latvian facts and for the utility of the extracted facts in enhancing image captions

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Security After 9/11: Strategy Choices and Budget Tradeoffs

Author
Publication venue: World Security Institute
Publication date: 01/01/2003
Field of study

The White House issued a National Security Strategy document in 2002 that stated the nation's new foreign policy and national security policy goals. Are the choices it identifies the right choices, and how best should resources be allocated to reach those or alternative goals? This briefing book includes data and analysis of these topics by analysts from several research organizations, intended to help expand and deepen public debate on these issues

IssueLab

Enhancing Undergraduate AI Courses through Machine Learning Projects

Author: Coleman Susan
Markov Zdravko
Neller Todd W.
Russell Ingrid
Publication venue: The Cupola: Scholarship at Gettysburg College
Publication date: 19/10/2005
Field of study

It is generally recognized that an undergraduate introductory Artificial Intelligence course is challenging to teach. This is, in part, due to the diverse and seemingly disconnected core topics that are typically covered. The paper presents work funded by the National Science Foundation to address this problem and to enhance the student learning experience in the course. Our work involves the development of an adaptable framework for the presentation of core AI topics through a unifying theme of machine learning. A suite of hands-on semester-long projects are developed, each involving the design and implementation of a learning system that enhances a commonly-deployed application. The projects use machine learning as a unifying theme to tie together the core AI topics. In this paper, we will first provide an overview of our model and the projects being developed and will then present in some detail our experiences with one of the projects – Web User Profiling which we have used in our AI class

Gettysburg College