Search CORE

774 research outputs found

Multiple‐systems analysis for the quantification of modern slavery: classical and Bayesian approaches

Author: Fienberg S. E.
Her Majesty's Government
Rivest L.‐P.
Sharifi Far S.
Smith D. M.
Publication venue: 'Wiley'
Publication date: 11/08/2019
Field of study

Multiple systems estimation is a key approach for quantifying hidden populations such as the number of victims of modern slavery. The UK Government published an estimate of 10,000 to 13,000 victims, constructed by the present author, as part of the strategy leading to the Modern Slavery Act 2015. This estimate was obtained by a stepwise multiple systems method based on six lists. Further investigation shows that a small proportion of the possible models give rather different answers, and that other model fitting approaches may choose one of these. Three data sets collected in the field of modern slavery, together with a data set about the death toll in the Kosovo conflict, are used to investigate the stability and robustness of various multiple systems estimate methods. The crucial aspect is the way that interactions between lists are modelled, because these can substantially affect the results. Model selection and Bayesian approaches are considered in detail, in particular to assess their stability and robustness when applied to real modern slavery data. A new Markov Chain Monte Carlo Bayesian approach is developed; overall, this gives robust and stable results at least for the examples considered. The software and datasets are freely and publicly available to facilitate wider implementation and further research

arXiv.org e-Print Archive

Crossref

Repository@Nottingham

Sharing Social Network Data: Differentially Private Estimation of Exponential-Family Random Graph Models

Author: Carroll R. J.
Chaudhuri A.
Duchi J. C.
Fienberg S.
Geyer C. J.
Hunter D. R.
Karwa V.
Kinney S. K.
Lu W.
Morris M.
Raghunathan T. E.
Reiter J. P.
Snijders T. A.
Zhou Y.
Publication venue
Publication date: 23/09/2016
Field of study

Motivated by a real-life problem of sharing social network data that contain sensitive personal information, we propose a novel approach to release and analyze synthetic graphs in order to protect privacy of individual relationships captured by the social network while maintaining the validity of statistical results. A case study using a version of the Enron e-mail corpus dataset demonstrates the application and usefulness of the proposed techniques in solving the challenging problem of maintaining privacy \emph{and} supporting open access to network data to ensure reproducibility of existing studies and discovering new scientific insights that can be obtained by analyzing such data. We use a simple yet effective randomized response mechanism to generate synthetic networks under

\epsilon

-edge differential privacy, and then use likelihood based inference for missing data and Markov chain Monte Carlo techniques to fit exponential-family random graph models to the generated synthetic networks.Comment: Updated, 39 page

arXiv.org e-Print Archive

Crossref

Research Online

Statistical Inference in a Directed Network Model with Covariates

Author: Adamic L. A.
Amemiya T.
Binyan Jiang
Cai W.
Chenlei Leng
Diesner J.
Dzemski A.
Fellows I.
Haberman S. J.
Hillar C.
Jochmans K.
Newman M. E. J.
Olhede S. C.
Perry P. O.
Rasch G.
Sadeghi K.
Stephen E. Fienberg
Ting Yan
Yin M.
———
Publication venue
Publication date: 10/03/2018
Field of study

Networks are often characterized by node heterogeneity for which nodes exhibit different degrees of interaction and link homophily for which nodes sharing common features tend to associate with each other. In this paper, we propose a new directed network model to capture the former via node-specific parametrization and the latter by incorporating covariates. In particular, this model quantifies the extent of heterogeneity in terms of outgoingness and incomingness of each node by different parameters, thus allowing the number of heterogeneity parameters to be twice the number of nodes. We study the maximum likelihood estimation of the model and establish the uniform consistency and asymptotic normality of the resulting estimators. Numerical studies demonstrate our theoretical findings and a data analysis confirms the usefulness of our model.Comment: 29 pages. minor revisio

arXiv.org e-Print Archive

Crossref

Warwick Research Archives Portal Repository

FigShare

Gravitational and electromagnetic fields of a charged tachyon

Author: A K Raychaudhuri
D F Bartlett
E Recami
G Fienberg
J V Narlikar
J V Narlikar
K S Virbhadra
K S Virbhadra
M V Davis
N Dadhich
O M P Bilaniuk
O M P Bilaniuk
P C Vaidya
P V Bhatt
P V Ramana Murthy
S V Dhurandhar
T Alväger
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/09/1995
Field of study

An axially symmetric exact solution of the Einstein-Maxwell equations is obtained and is interpreted to give the gravitational and electromagnetic fields of a charged tachyon. Switching off the charge parameter yields the solution for the uncharged tachyon which was earlier obtained by Vaidya. The null surfaces for the charged tachyon are discussed.Comment: 8 pages, LaTex, To appear in Pramana- J. Physic

arXiv.org e-Print Archive

Crossref

Model Selection for Degree-corrected Block Models

Author: Airoldi E M
Cosma Shalizi
Cristopher Moore
Dempster A P
Faloutsos M
Fienberg S E
Fienberg S E
Florent Krzakala
Geyer C J
Hodas N
Jacob E Jensen
Lenka Zdeborová
Moore C
Mossel E
Mørup M
Pan Zhang
Perry P O
Porter M A
Porter M A
Rinaldo A
Xiaoran Yan
Yaojia Zhu
Yedidia J S
Zachary W W
Zhu Y
Publication venue: 'IOP Publishing'
Publication date: 05/12/2012
Field of study

The proliferation of models for networks raises challenging problems of model selection: the data are sparse and globally dependent, and models are typically high-dimensional and have large numbers of latent variables. Together, these issues mean that the usual model-selection criteria do not work properly for networks. We illustrate these challenges, and show one way to resolve them, by considering the key network-analysis problem of dividing a graph into communities or blocks of nodes with homogeneous patterns of links to the rest of the network. The standard tool for doing this is the stochastic block model, under which the probability of a link between two nodes is a function solely of the blocks to which they belong. This imposes a homogeneous degree distribution within each block; this can be unrealistic, so degree-corrected block models add a parameter for each node, modulating its over-all degree. The choice between ordinary and degree-corrected block models matters because they make very different inferences about communities. We present the first principled and tractable approach to model selection between standard and degree-corrected block models, based on new large-graph asymptotics for the distribution of log-likelihood ratios under the stochastic block model, finding substantial departures from classical results for sparse graphs. We also develop linear-time approximations for log-likelihoods under both the stochastic block model and the degree-corrected model, using belief propagation. Applications to simulated and real networks show excellent agreement with our approximations. Our results thus both solve the practical problem of deciding on degree correction, and point to a general approach to model selection in network analysis

arXiv.org e-Print Archive

CiteSeerX

Crossref

R.A.Fisher, design theory, and the Indian connection

Design Theory, a branch of mathematics, was born out of the experimental statistics research of the population geneticist R. A. Fisher and of Indian mathematical statisticians in the 1930s. The field combines elements of combinatorics, finite projective geometries, Latin squares, and a variety of further mathematical structures, brought together in surprising ways. This essay will present these structures and ideas as well as how the field came together, in itself an interesting story.Comment: 11 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Louisiana State University

The harvest plot: A method for synthesising evidence about the differential effects of interventions

Author: Amanda Sowden
David Ogilvie
Debra Fayter
E Lewit
E Tufte
F Chaloupka
Gill Worthy
H Thomas
H Wainer
J Deeks
J Popay
M Miles
Margaret Whitehead
Mark Petticrew
N Jackson
N Jackson
P Briss
P Lucas
P Tugwell
R Slavin
S Fienberg
S Lewis
S Macintyre
Sian Thomas
T Evans
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background One attraction of meta-analysis is the forest plot, a compact overview of the essential data included in a systematic review and the overall 'result'. However, meta-analysis is not always suitable for synthesising evidence about the effects of interventions which may influence the wider determinants of health. As part of a systematic review of the effects of population-level tobacco control interventions on social inequalities in smoking, we designed a novel approach to synthesis intended to bring aspects of the graphical directness of a forest plot to bear on the problem of synthesising evidence from a complex and diverse group of studies. Methods We coded the included studies (n = 85) on two methodological dimensions (suitability of study design and quality of execution) and extracted data on effects stratified by up to six different dimensions of inequality (income, occupation, education, gender, race or ethnicity, and age), distinguishing between 'hard' (behavioural) and 'intermediate' (process or attitudinal) outcomes. Adopting a hypothesis-testing approach, we then assessed which of three competing hypotheses (positive social gradient, negative social gradient, or no gradient) was best supported by each study for each dimension of inequality. Results We plotted the results on a matrix ('harvest plot') for each category of intervention, weighting studies by the methodological criteria and distributing them between the competing hypotheses. These matrices formed part of the analytical process and helped to encapsulate the output, for example by drawing attention to the finding that increasing the price of tobacco products may be more effective in discouraging smoking among people with lower incomes and in lower occupational groups. Conclusion The harvest plot is a novel and useful method for synthesising evidence about the differential effects of population-level interventions. It contributes to the challenge of making best use of all available evidence by incorporating all relevant data. The visual display assists both the process of synthesis and the assimilation of the findings. The method is suitable for adaptation to a variety of questions in evidence synthesis and may be particularly useful for systematic reviews addressing the broader type of research question which may be most relevant to policymakers.</p

University of Liverpool Repository

Crossref

LSHTM Research Online

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Apollo (Cambridge)