Search CORE

120,376 research outputs found

FOSTER D2.1 - Technical protocol for rich metadata categorization and content classification

Author: Davidson Joy
Jones Sarah
Kuchma Iryna
Orth Astrid
Proudman Vanessa
Publication venue: FOSTER Project
Publication date: 10/07/2014
Field of study

FOSTER aims to set in place sustainable mechanisms for EU researchers to FOSTER OPEN SCIENCE in their daily workflow, supporting researchers optimizing their research visibility and impact and the adoption of EU open access policies in line with the EU objectives on Responsible Research & Innovation. More specifically, the FOSTER objectives are to: • Support different stakeholders, especially young researchers, in adopting open access in the context of the European Research Area (ERA) and in complying with the open access policies and rules of participation set out for Horizon 2020; • Integrate open access principles and practice in the current research workflow by targeting the young researcher training environment; • Strengthen the institutional training capacity to foster compliance with the open access policies of the ERA and Horizon 2020 (beyond the FOSTER project); • Facilitate the adoption, reinforcement and implementation of open access policies from other European funders, in line with the EC’s recommendation, in partnership with PASTEUR4OA project. As stated in the project Description of Work (DoW) these objectives will be pursued and achieved through the combination of 3 main activities: content identification, repacking and creation; creation of the FOSTER Portal; delivery of training. The core activity of the Task T2.1 will be to define a basic quality control protocol for content, and map available content by target group, and content type in parallel with WP3 Task 3.1. Training materials include the full range of classical (structured presentation slides) and multi-media content (short videos, interactive e-books, ) that clearly and succinctly frames a problem and offers a working solution, in support of the learning objectives of each target group, and the range of learning options to be used in WP4 (elearning, blended learning, self-learning). The map of existing content metadata will be delivered to WP3 for best choice of system requirements for continuous and sustainable content aggregation, enhancement and content delivery via “Tasks 3.2 e-Learning Portal” and “Task 3.4 Content Upload”. The resulting content compilation will be tailored to each Target Group and delivered to WP4

Enlighten

Thumbs up? Sentiment Classification using Machine Learning Techniques

Author: Lee Lillian
Pang Bo
Vaithyanathan Shivakumar
Publication venue
Publication date: 01/01/2002
Field of study

We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, we find that standard machine learning techniques definitively outperform human-produced baselines. However, the three machine learning methods we employed (Naive Bayes, maximum entropy classification, and support vector machines) do not perform as well on sentiment classification as on traditional topic-based categorization. We conclude by examining factors that make the sentiment classification problem more challenging.Comment: To appear in EMNLP-200

arXiv.org e-Print Archive

CiteSeerX

KACST Arabic Text Classification Project: Overview and Preliminary Results

Author: Al-Rajeh A.
Alharbi S.
Almuhareb A.
Althubaity A.
Khorsheed M.
Publication venue
Publication date: 01/01/2008
Field of study

Electronically formatted Arabic free-texts can be found in abundance these days on the World Wide Web, often linked to commercial enterprises and/or government organizations. Vast tracts of knowledge and relations lie hidden within these texts, knowledge that can be exploited once the correct intelligent tools have been identified and applied. For example, text mining may help with text classification and categorization. Text classification aims to automatically assign text to a predefined category based on identifiable linguistic features. Such a process has different useful applications including, but not restricted to, E-Mail spam detection, web pages content filtering, and automatic message routing. In this paper an overview of King Abdulaziz City for Science and Technology (KACST) Arabic Text Classification Project will be illustrated along with some preliminary results. This project will contribute to the better understanding and elaboration of Arabic text classification techniques

Southampton (e-Prints Soton)

Boundaries of Semantic Distraction: Dominance and Lexicality Act at Retrieval

Author: A Buchner
A Buchner
A Buchner
A Hantsch
AD Baddeley
AM Mood
AP Smith
C Hulme
C Maidhof
CB Mervis
CB Neely
CJP Oswald
CN Cofer
CP Beaman
D Broadbent
DJ Burns
DM Jones
DM Jones
DM Jones
DM Jones
DM Jones
Dylan M. Jones
E Sundstrom
E Tulving
EM Elliott
F Frankel
FBR Parmentier
G Underwood
G Underwood
G Underwood
GH Bower
GW Evans
I Neath
J Saint-Aubin
JE Marsh
JE Marsh
JE Marsh
JE Marsh
JE Marsh
JE Marsh
JE Marsh
JH Neely
JH Neely
John E. Marsh
JP Overschelde Van
L Tan
M Allen
M Miozzo
M Moscovitch
MC Anderson
MD Murphy
MW Mulligan
N Cowan
NE Wetherick
Nick Perham
NJ Slamecka
O Neumann
P Sörqvist
P Sörqvist
P Sörqvist
PA Tun
Patrik Sörqvist
R Bell
R Bell
R Bell
RC Martin
RL Hudson
RR Hunt
RW Hughes
RW Hughes
RW Hughes
RW Hughes
S Hygge
S Tremblay
SD Gronlund
SE Gathercole
SM Sheffert
T Witterseh
TJ Shuell
TJ Shuell
WA Bousfield
WA Bousfield
WA Wallis
WJ Macken
Y Zhang
YB Sirotin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2014
Field of study

Three experiments investigated memory for semantic information with the goal of determining boundary conditions for the manifestation of semantic auditory distraction. Irrelevant speech disrupted the free recall of semantic category-exemplars to an equal degree regardless of whether the speech coincided with presentation or test phases of the task (Experiment 1) and occurred regardless of whether it comprised random words or coherent sentences (Experiment 2). The effects of background speech were greater when the irrelevant speech was semantically related to the to-be-remembered material, but only when the irrelevant words were high in output dominance (Experiment 3). The implications of these findings in relation to the processing of task material and the processing of background speech is discussed

CLoK

Crossref

Online Research @ Cardiff