Search CORE

11,957 research outputs found

Now the wars are over: The past, present and future of Scottish battlefields

Author: C Duffy
D Smurthwaite
DD Scott
G Foard
G MacDonald Fraser
GS Maxwell
Iain Banks
J Cooper
J Fraser
K Durham
L Alcock
O Lelong
P Harrington
P Salway
S Reid
T Pollard
T Pollard
T Pollard
T Pollard
T Pollard
T Pollard
T Pollard
Tony Pollard
W Hutton
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/05/2010
Field of study

Battlefield archaeology has provided a new way of appreciating historic battlefields. This paper provides a summary of the long history of warfare and conflict in Scotland which has given rise to a large number of battlefield sites. Recent moves to highlight the archaeological importance of these sites, in the form of Historic Scotland’s Battlefields Inventory are discussed, along with some of the problems associated with the preservation and management of these important cultural sites

Crossref

Enlighten

Alliances, assemblages, and affects: Three moments of building collective working-class literacies

Author: Harding J
Parks S
Pauszek J
Pollard Nicholas
Publication venue: National Council of Teachers of English
Publication date: 01/09/2018
Field of study

© 2018 by the National Council of Teachers of English. All rights reserved. This article explores how assemblage and affect theories can enable research into the formation of a collective working-class identity, inclusive of written, print, publication, and organizational literacies through the origins of the Federation of Worker Writer and Community Publishers, an organization that expanded its collectivity as new heritages, ethnicities, and immigrant identities altered the organization’s membership and “class” identity

Sheffield Hallam University Research Archive

Baby-Step Giant-Step Algorithms for the Symmetric Group

Author: Babai L.
Greene D. H.
McCurley K. S.
Pollard J. M.
Rosenbaum D.
Shanks D.
Teske E.
Publication venue
Publication date: 11/12/2016
Field of study

We study discrete logarithms in the setting of group actions. Suppose that

G

is a group that acts on a set

S

. When

r,s \in S

, a solution

g \in G

r^g = s

can be thought of as a kind of logarithm. In this paper, we study the case where

G = S_n

, and develop analogs to the Shanks baby-step / giant-step procedure for ordinary discrete logarithms. Specifically, we compute two sets

A, B \subseteq S_n

such that every permutation of

S_n

can be written as a product

ab

of elements

a \in A

and

b \in B

. Our deterministic procedure is optimal up to constant factors, in the sense that

A

and

B

can be computed in optimal asymptotic complexity, and

|A|

and

|B|

are a small constant from

\sqrt{n!}

in size. We also analyze randomized "collision" algorithms for the same problem

arXiv.org e-Print Archive

Crossref

Supervised Distance Matrices: Theory and Applications to Genomics

Author: POLLARD Katherine S.
van der Laan Mark J.
Publication venue: Collection of Biostatistics Research Archive
Publication date: 16/06/2008
Field of study

We propose a new approach to studying the relationship between a very high dimensional random variable and an outcome. Our method is based on a novel concept, the supervised distance matrix, which quantifies pairwise similarity between variables based on their association with the outcome. A supervised distance matrix is derived in two stages. The first stage involves a transformation based on a particular model for association. In particular, one might regress the outcome on each variable and then use the residuals or the influence curve from each regression as a data transformation. In the second stage, a choice of distance measure is used to compute all pairwise distances between variables in this transformed data. When the outcome is right-censored, we show that the supervised distance matrix can be consistently estimated using inverse probability of censoring weighted (IPCW) estimators based on the mean and covariance of the transformed data. The proposed methodology is illustrated with examples of gene expression data analysis with a survival outcome. This approach is widely applicable in genomics and other fields where high-dimensional data is collected on each subject

Collection Of Biostatistics Research Archive

Resampling-based Multiple Testing: Asymptotic Control of Type I Error and Applications to Gene Expression Data

Author: Pollard Katherine S.
van der Laan Mark J.
Publication venue: Collection of Biostatistics Research Archive
Publication date: 24/06/2003
Field of study

We define a general statistical framework for multiple hypothesis testing and show that the correct null distribution for the test statistics is obtained by projecting the true distribution of the test statistics onto the space of mean zero distributions. For common choices of test statistics (based on an asymptotically linear parameter estimator), this distribution is asymptotically multivariate normal with mean zero and the covariance of the vector influence curve for the parameter estimator. This test statistic null distribution can be estimated by applying the non-parametric or parametric bootstrap to correctly centered test statistics. We prove that this bootstrap estimated null distribution provides asymptotic control of most type I error rates. We show that obtaining a test statistic null distribution from a data null distribution, e.g. projecting the data generating distribution onto the space of all distributions satisfying the complete null), only provides the correct test statistic null distribution if the covariance of the vector influence curve is the same under the data null distribution as under the true data distribution. This condition is a weak version of the subset pivotality condition. We show that our multiple testing methodology controlling type I error is equivalent to constructing an error-specific confidence region for the true parameter and checking if it contains the hypothesized value. We also study the two sample problem and show that the permutation distribution produces an asymptotically correct null distribution if (i) the sample sizes are equal or (ii) the populations have the same covariance structure. We include a discussion of the application of multiple testing to gene expression data, where the dimension typically far exceeds the sample size. An analysis of a cancer gene expression data set illustrates the methodology

Collection Of Biostatistics Research Archive

Statistical Inference for Simultaneous Clustering of Gene Expression Data

Author: Pollard Katherine S.
van der Laan Mark J.
Publication venue: Collection of Biostatistics Research Archive
Publication date: 01/07/2001
Field of study

Current methods for analysis of gene expression data are mostly based on clustering and classification of either genes or samples. We offer support for the idea that more complex patterns can be identified in the data if genes and samples are considered simultaneously. We formalize the approach and propose a statistical framework for two-way clustering. A simultaneous clustering parameter is defined as a function of the true data generating distribution, and an estimate is obtained by applying this function to the empirical distribution. We illustrate that a wide range of clustering procedures, including generalized hierarchical methods, can be defined as parameters which are compositions of individual mappings for clustering patients and genes. This framework allows one to assess classical properties of clustering methods, such as consistency, and to formally study statistical inference regarding the clustering parameter. We present results of simulations designed to assess the asymptotic validity of different bootstrap methods for estimating the distributions of estimated simultaneous clustering parameters. The method is illustrated on a publicly available data set

Collection Of Biostatistics Research Archive

From treebank resources to LFG F-structures

Author: A Cahill
A Frank
A Frank.
C Pollard
E Charniak.
G Leech
J Bresnan.
J Genabith van
L Sadler
RM Kaplan
S Abney.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

We present two methods for automatically annotating treebank resources with functional structures. Both methods define systematic patterns of correspondence between partial PS configurations and functional structures. These are applied to PS rules extracted from treebanks, or directly to constraint set encodings of treebank PS trees

University of Essex Research Repository

Crossref

DCU Online Research Access Service

PhylOTU: a high-throughput procedure quantifies microbial community diversity and resolves novel taxa from metagenomic data.

Author: Eisen Jonathan A
Green Jessica L
Kembel Steven W
Ladau Joshua
O'Dwyer James P
Pollard Katherine S
Riesenfeld Samantha J
Sharpton Thomas J
Publication venue: eScholarship, University of California
Publication date: 01/01/2011
Field of study

Microbial diversity is typically characterized by clustering ribosomal RNA (SSU-rRNA) sequences into operational taxonomic units (OTUs). Targeted sequencing of environmental SSU-rRNA markers via PCR may fail to detect OTUs due to biases in priming and amplification. Analysis of shotgun sequenced environmental DNA, known as metagenomics, avoids amplification bias but generates fragmentary, non-overlapping sequence reads that cannot be clustered by existing OTU-finding methods. To circumvent these limitations, we developed PhylOTU, a computational workflow that identifies OTUs from metagenomic SSU-rRNA sequence data through the use of phylogenetic principles and probabilistic sequence profiles. Using simulated metagenomic data, we quantified the accuracy with which PhylOTU clusters reads into OTUs. Comparisons of PCR and shotgun sequenced SSU-rRNA markers derived from the global open ocean revealed that while PCR libraries identify more OTUs per sequenced residue, metagenomic libraries recover a greater taxonomic diversity of OTUs. In addition, we discover novel species, genera and families in the metagenomic libraries, including OTUs from phyla missed by analysis of PCR sequences. Taken together, these results suggest that PhylOTU enables characterization of part of the biosphere currently hidden from PCR-based surveys of diversity

Directory of Open Access Journals

PubMed Central

eScholarship - University of California