371 research outputs found

    A review of statistical methods in the analysis of data arising from observer reliability studies (Part II) *

    Full text link
    Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/75734/1/j.1467-9574.1975.tb00259.x.pd

    DNA methylation-based age prediction and telomere length in white blood cells and cumulus cells of infertile women with normal or poor response to ovarian stimulation.

    Get PDF
    An algorithm assessing the methylation levels of 353 informative CpG sites in the human genome permits accurate prediction of the chronologic age of a subject. Interestingly, when there is discrepancy between the predicted age and chronologic age (age acceleration or AgeAccel ), patients are at risk for morbidity and mortality. Identification of infertile patients at risk for accelerated reproductive senescence may permit preventative action. This study aimed to assess the accuracy of the epigenetic clock concept in reproductive age women undergoing fertility treatment by applying the age prediction algorithm in peripheral (white blood cells [WBCs]) and follicular somatic cells (cumulus cells [CCs]), and to identify whether women with premature reproductive aging (diminished ovarian reserve) were at risk of AgeAccel in their age prediction. Results indicated that the epigenetic algorithm accurately predicts age when applied to WBCs but not to CCs. The age prediction of CCs was substantially younger than chronologic age regardless of the patient\u27s age or response to stimulation. In addition, telomeres of CCs were significantly longer than that of WBCs. Our findings suggest that CCs do not demonstrate changes in methylome-predicted age or telomere-length in association with increasing female age or ovarian response to stimulation

    Gold Standard Online Debates Summaries and First Experiments Towards Automatic Summarization of Online Debate Data

    Full text link
    Usage of online textual media is steadily increasing. Daily, more and more news stories, blog posts and scientific articles are added to the online volumes. These are all freely accessible and have been employed extensively in multiple research areas, e.g. automatic text summarization, information retrieval, information extraction, etc. Meanwhile, online debate forums have recently become popular, but have remained largely unexplored. For this reason, there are no sufficient resources of annotated debate data available for conducting research in this genre. In this paper, we collected and annotated debate data for an automatic summarization task. Similar to extractive gold standard summary generation our data contains sentences worthy to include into a summary. Five human annotators performed this task. Inter-annotator agreement, based on semantic similarity, is 36% for Cohen's kappa and 48% for Krippendorff's alpha. Moreover, we also implement an extractive summarization system for online debates and discuss prominent features for the task of summarizing online debate data automatically.Comment: accepted and presented at the CICLING 2017 - 18th International Conference on Intelligent Text Processing and Computational Linguistic

    A comparative evaluation of two roadside brake testing procedures+

    Full text link
    A field survey was conducted to evaluate two procedures designed to measure the effectiveness of motor vehicle braking systems. Selected failure rates and agreement measures were computed using a recently developed unified approach to the analysis of multivariate categorical data. It was found that the procedures agree only weakly, and that the agreement varied with certain pass/fail criteria. On the basis of conditional arguments, a moving-stopping test was found to be more stringent than a wheel removal inspection.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/22847/1/0000408.pd

    A computer program for the generalized chi-square analysis of categorical data using weighted least squares (GENCAT)

    Full text link
    GENCAT is a computer program which implements an extremely general methodology for the analysis of multivariate categorical data. This approach essentially involves the construction of test statistics for hypotheses involving functions of the observed proportions which are directed at the relationships under investigation and the estimation of corresponding model parameters via weighted least squares computations. Any compounded function of the observed proportions which can be formulated as a sequence of the following transformations of the data vector -- linear, logarithmic, exponential, or the addition of a vector of constants -- can be analyzed within this general framework. This algorithm produces minimum modified chi-square statistics which are obtained by partitioning the sums of squares as in ANOVA. The input data can be either: (a) frequencies from a multidimensional contingency table; (b) a vector of functions with its estimated covariance matrix; and (c) raw data in the form of integer-valued variables associated with each subject. The input format is completely flexible for the data as well as for the matrices.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/21627/1/0000006.pd

    Natural Wormholes as Gravitational Lenses

    Get PDF
    Visser has suggested traversable 3-dimensional wormholes that could plausibly form naturally during Big Bang inflation. A wormhole mouth embedded in high mass density might accrete mass, giving the other mouth a net *negative* mass of unusual gravitational properties. The lensing of such a gravitationally negative anomalous compact halo object (GNACHO) will enhance background stars with a time profile that is observable and qualitatively different from that recently observed for massive compact halo objects (MACHOs) of positive mass. We recommend that MACHO search data be analyzed for GNACHOs.Comment: 4 pages; plus 4 figures; ReV_TeX 3.0; DOE/ER/40537-001/NPL94-07-01

    Whose Tweets are Surveilled for the Police: An Audit of Social-Media Monitoring Tool via Log Files

    Full text link
    Social media monitoring by law enforcement is becoming commonplace, but little is known about what software packages for it do. Through public records requests, we obtained log files from the Corvallis (Oregon) Police Department's use of social media monitoring software called DigitalStakeout. These log files include the results of proprietary searches by DigitalStakeout that were running over a period of 13 months and include 7240 social media posts. In this paper, we focus on the Tweets logged in this data and consider the racial and ethnic identity (through manual coding) of the users that are therein flagged by DigitalStakeout. We observe differences in the demographics of the users whose Tweets are flagged by DigitalStakeout compared to the demographics of the Twitter users in the region, however, our sample size is too small to determine significance. Further, the demographics of the Twitter users in the region do not seem to reflect that of the residents of the region, with an apparent higher representation of Black and Hispanic people. We also reconstruct the keywords related to a Narcotics report set up by DigitalStakeout for the Corvallis Police Department and find that these keywords flag Tweets unrelated to narcotics or flag Tweets related to marijuana, a drug that is legal for recreational use in Oregon. Almost all of the keywords have a common meaning unrelated to narcotics (e.g.\ broken, snow, hop, high) that call into question the utility that such a keyword based search could have to law enforcement.Comment: 21 Pages, 2 figures. To to be Published in FAT* 2020 Proceeding
    • …
    corecore