377 research outputs found
A review of statistical methods in the analysis of data arising from observer reliability studies (Part II) *
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/75734/1/j.1467-9574.1975.tb00259.x.pd
DNA methylation-based age prediction and telomere length in white blood cells and cumulus cells of infertile women with normal or poor response to ovarian stimulation.
An algorithm assessing the methylation levels of 353 informative CpG sites in the human genome permits accurate prediction of the chronologic age of a subject. Interestingly, when there is discrepancy between the predicted age and chronologic age (age acceleration or AgeAccel ), patients are at risk for morbidity and mortality. Identification of infertile patients at risk for accelerated reproductive senescence may permit preventative action. This study aimed to assess the accuracy of the epigenetic clock concept in reproductive age women undergoing fertility treatment by applying the age prediction algorithm in peripheral (white blood cells [WBCs]) and follicular somatic cells (cumulus cells [CCs]), and to identify whether women with premature reproductive aging (diminished ovarian reserve) were at risk of AgeAccel in their age prediction. Results indicated that the epigenetic algorithm accurately predicts age when applied to WBCs but not to CCs. The age prediction of CCs was substantially younger than chronologic age regardless of the patient\u27s age or response to stimulation. In addition, telomeres of CCs were significantly longer than that of WBCs. Our findings suggest that CCs do not demonstrate changes in methylome-predicted age or telomere-length in association with increasing female age or ovarian response to stimulation
Gold Standard Online Debates Summaries and First Experiments Towards Automatic Summarization of Online Debate Data
Usage of online textual media is steadily increasing. Daily, more and more
news stories, blog posts and scientific articles are added to the online
volumes. These are all freely accessible and have been employed extensively in
multiple research areas, e.g. automatic text summarization, information
retrieval, information extraction, etc. Meanwhile, online debate forums have
recently become popular, but have remained largely unexplored. For this reason,
there are no sufficient resources of annotated debate data available for
conducting research in this genre. In this paper, we collected and annotated
debate data for an automatic summarization task. Similar to extractive gold
standard summary generation our data contains sentences worthy to include into
a summary. Five human annotators performed this task. Inter-annotator
agreement, based on semantic similarity, is 36% for Cohen's kappa and 48% for
Krippendorff's alpha. Moreover, we also implement an extractive summarization
system for online debates and discuss prominent features for the task of
summarizing online debate data automatically.Comment: accepted and presented at the CICLING 2017 - 18th International
Conference on Intelligent Text Processing and Computational Linguistic
A comparative evaluation of two roadside brake testing procedures+
A field survey was conducted to evaluate two procedures designed to measure the effectiveness of motor vehicle braking systems. Selected failure rates and agreement measures were computed using a recently developed unified approach to the analysis of multivariate categorical data. It was found that the procedures agree only weakly, and that the agreement varied with certain pass/fail criteria. On the basis of conditional arguments, a moving-stopping test was found to be more stringent than a wheel removal inspection.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/22847/1/0000408.pd
A computer program for the generalized chi-square analysis of categorical data using weighted least squares (GENCAT)
GENCAT is a computer program which implements an extremely general methodology for the analysis of multivariate categorical data. This approach essentially involves the construction of test statistics for hypotheses involving functions of the observed proportions which are directed at the relationships under investigation and the estimation of corresponding model parameters via weighted least squares computations. Any compounded function of the observed proportions which can be formulated as a sequence of the following transformations of the data vector -- linear, logarithmic, exponential, or the addition of a vector of constants -- can be analyzed within this general framework. This algorithm produces minimum modified chi-square statistics which are obtained by partitioning the sums of squares as in ANOVA. The input data can be either: (a) frequencies from a multidimensional contingency table; (b) a vector of functions with its estimated covariance matrix; and (c) raw data in the form of integer-valued variables associated with each subject. The input format is completely flexible for the data as well as for the matrices.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/21627/1/0000006.pd
Natural Wormholes as Gravitational Lenses
Visser has suggested traversable 3-dimensional wormholes that could plausibly
form naturally during Big Bang inflation. A wormhole mouth embedded in high
mass density might accrete mass, giving the other mouth a net *negative* mass
of unusual gravitational properties. The lensing of such a gravitationally
negative anomalous compact halo object (GNACHO) will enhance background stars
with a time profile that is observable and qualitatively different from that
recently observed for massive compact halo objects (MACHOs) of positive mass.
We recommend that MACHO search data be analyzed for GNACHOs.Comment: 4 pages; plus 4 figures; ReV_TeX 3.0; DOE/ER/40537-001/NPL94-07-01
Whose Tweets are Surveilled for the Police: An Audit of Social-Media Monitoring Tool via Log Files
Social media monitoring by law enforcement is becoming commonplace, but
little is known about what software packages for it do. Through public records
requests, we obtained log files from the Corvallis (Oregon) Police Department's
use of social media monitoring software called DigitalStakeout. These log files
include the results of proprietary searches by DigitalStakeout that were
running over a period of 13 months and include 7240 social media posts. In this
paper, we focus on the Tweets logged in this data and consider the racial and
ethnic identity (through manual coding) of the users that are therein flagged
by DigitalStakeout. We observe differences in the demographics of the users
whose Tweets are flagged by DigitalStakeout compared to the demographics of the
Twitter users in the region, however, our sample size is too small to determine
significance. Further, the demographics of the Twitter users in the region do
not seem to reflect that of the residents of the region, with an apparent
higher representation of Black and Hispanic people. We also reconstruct the
keywords related to a Narcotics report set up by DigitalStakeout for the
Corvallis Police Department and find that these keywords flag Tweets unrelated
to narcotics or flag Tweets related to marijuana, a drug that is legal for
recreational use in Oregon. Almost all of the keywords have a common meaning
unrelated to narcotics (e.g.\ broken, snow, hop, high) that call into question
the utility that such a keyword based search could have to law enforcement.Comment: 21 Pages, 2 figures. To to be Published in FAT* 2020 Proceeding
- …