Search CORE

182 research outputs found

The effects of the English Baccalaureate

Author: Greevy Helen
Knox Anastasia
Nunney Fay
Pye Julia
Publication venue: Department for Education (DFE)
Publication date: 01/01/2012
Field of study

Automatic text categorisation of racist webpages

Author: Greevy Edel
Publication venue: Dublin City University. School of Computing
Publication date: 01/01/2004
Field of study

Automatic Text Categorisation (TC) involves the assignment of one or more predefined categories to text documents in order that they can be effectively managed. In this thesis we examine the possibility of applying automatic text categorisation to the problem of categorising texts (web pages) based on whether or not they are racist. TC has proven successful for topic-based problems such as news story categorisation. However, the problem of detecting racism is dissimilar to topic-based problems in that lexical items present in racist documents can also appear in anti-racist documents or indeed potentially any document. The mere presence of a potentially racist term does not necessarily mean the document is racist. The difficulty is finding what discerns racist documents from non-racist. We use a machine learning method called Support Vector Machines (SVM) to automatically learn features of racism in order to be capable of making a decision about the target class of unseen documents. We examine various representations within an SVM so as to identify the most effective method for handling this problem. Our work shows that it is possible to develop automatic categorisation of web pages, based on these approache

Irish Universities

DCU Online Research Access Service

Second-generation p-values: improved rigor, reproducibility, & transparency in statistical analyses

Author: Blume Jeffrey D.
Dupont William D.
Greevy Robert A.
McGowan Lucy DAgostino
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 04/11/2017
Field of study

Verifying that a statistically significant result is scientifically meaningful is not only good scientific practice, it is a natural way to control the Type I error rate. Here we introduce a novel extension of the p-value - a second-generation p-value - that formally accounts for scientific relevance and leverages this natural Type I Error control. The approach relies on a pre-specified interval null hypothesis that represents the collection of effect sizes that are scientifically uninteresting or are practically null. The second-generation p-value is the proportion of data-supported hypotheses that are also null hypotheses. As such, second-generation p-values indicate when the data are compatible with null hypotheses, or with alternative hypotheses, or when the data are inconclusive. Moreover, second-generation p-values provide a proper scientific adjustment for multiple comparisons and reduce false discovery rates. This is an advance for environments rich in data, where traditional p-value adjustments are needlessly punitive. Second-generation p-values promote transparency, rigor and reproducibility of scientific results by a priori specifying which candidate hypotheses are practically meaningful and by providing a more reliable statistical summary of when the data are compatible with alternative or null hypotheses.Comment: 29 pages, 29 page Supplemen

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare

Fit for purpose? : the view of the higher education sector, teachers and employers on the suitability of A levels

Author: Boal Naomi
Donaldson Rory
Ginnis Steven
Greevy Helen
Higton John
Noble James
Pope Sarah
Publication venue: Office of Qualifications and Examinations Regulation
Publication date: 01/01/2012
Field of study

Digital Education Resource Archive

Revised: The effects of the English Baccalaureate

Author: Greevy Helen
Knox Anastasia
Nunney Fay
Pye Julia
Publication venue: Department for Education
Publication date: 01/01/2013
Field of study

Digital Education Resource Archive

Classifying racist texts using a support vector machine

Author: Greevy Edel
Smeaton Alan F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2004
Field of study

In this poster we present an overview of the techniques we used to develop and evaluate a text categorisation system to automatically classify racist texts. Detecting racism is difficult because the presence of indicator words is insufficient to indicate racist texts, unlike some other text classification tasks. Support Vector Machines (SVM) are used to automatically categorise web pages based on whether or not they are racist. Different interpretations of what constitutes a term are taken, and in this poster we look at three representations of a web page within an SVM -- bag-of-words, bigrams and part-of-speech tags

Crossref

Irish Universities

DCU Online Research Access Service