250 research outputs found

    Quantitative Analysis of Genealogy Using Digitised Family Trees

    Full text link
    Driven by the popularity of television shows such as Who Do You Think You Are? many millions of users have uploaded their family tree to web projects such as WikiTree. Analysis of this corpus enables us to investigate genealogy computationally. The study of heritage in the social sciences has led to an increased understanding of ancestry and descent but such efforts are hampered by difficult to access data. Genealogical research is typically a tedious process involving trawling through sources such as birth and death certificates, wills, letters and land deeds. Decades of research have developed and examined hypotheses on population sex ratios, marriage trends, fertility, lifespan, and the frequency of twins and triplets. These can now be tested on vast datasets containing many billions of entries using machine learning tools. Here we survey the use of genealogy data mining using family trees dating back centuries and featuring profiles on nearly 7 million individuals based in over 160 countries. These data are not typically created by trained genealogists and so we verify them with reference to third party censuses. We present results on a range of aspects of population dynamics. Our approach extends the boundaries of genealogy inquiry to precise measurement of underlying human phenomena

    Using data science to understand the film industry’s gender gap

    Get PDF
    Data science can offer answers to a wide range of social science questions. Here we turn attention to the portrayal of women in movies, an industry that has a significant influence on society, impacting such aspects of life as self-esteem and career choice. To this end, we fused data from the online movie database IMDb with a dataset of movie dialogue subtitles to create the largest available corpus of movie social networks (15,540 networks). Analyzing this data, we investigated gender bias in on-screen female characters over the past century. We find a trend of improvement in all aspects of women's roles in movies, including a constant rise in the centrality of female characters. There has also been an increase in the number of movies that pass the well-known Bechdel test, a popular-albeit flawed-measure of women in fiction. Here we propose a new and better alternative to this test for evaluating female roles in movies. Our study introduces fresh data, an open-code framework, and novel techniques that present new opportunities in the research and analysis of movies

    Behavioral clusters and coronary heart disease risk

    Get PDF
    The purpose of the present study was to empirically identify individuals who differed in their patterns of components derived from the structured interview (SI), and to evaluate whether individuals characterized by the different patterns varied in terms of their risk for coronary heart disease (CHD). The present study represents a reanalysis of data from the Western Collaborative Group Study in which components of Type A were individually related to risk for CHD. Subgroups of individuals who differed in the patterns of their component scores were identified by means of cluster analytic techniques and were found to vary in their risk of CHD. As expected, a pattern of characteristics in which hostility was salient was found to be predictive of CHD. Moreover, another pattern of characteristics that appears to reflect pressured, controlling, socially dominant behavior in which hostility was not salient also was found to be predictive of CHD. Further, two patterns of characteristics were identified that were unrelated to CHD risk. Finally, two patterns of characteristics were identified that were related to reduced risk of CHD. Overall, these results suggest that future research should investigate variables in addition to hostility in regard to risk for and protection from CHD

    The DREAM complex promotes gene body H2A.Z for target repression.

    Get PDF
    The DREAM (DP, Retinoblastoma [Rb]-like, E2F, and MuvB) complex controls cellular quiescence by repressing cell cycle genes, but its mechanism of action is poorly understood. Here we show that Caenorhabditis elegans DREAM targets have an unusual pattern of high gene body HTZ-1/H2A.Z. In mutants of lin-35, the sole p130/Rb-like gene in C. elegans, DREAM targets have reduced gene body HTZ-1/H2A.Z and increased expression. Consistent with a repressive role for gene body H2A.Z, many DREAM targets are up-regulated in htz-1/H2A.Z mutants. Our results indicate that the DREAM complex facilitates high gene body HTZ-1/H2A.Z, which plays a role in target gene repression.We are grateful to D. Fay for providing the 5× outcrossed lin-35 strain, and Robert Horvitz for antibodies. I.L., M.A.C., P.S., A.A., and J.A. were supported by Wellcome Trust Senior Research Fellowships to J.A. (054523 and 101863). J.A. also acknowledges support by core funding from the Wellcome Trust and Cancer Research UK. J.M.G. and S.S. were supported by National Institutes of Health (NIH) R01 grant GM34059. Part of this work was supported by NIH National Human Genome Research Institute (NHGRI) grant U01 HG004270 to the modENCODE consortium headed by J.D. Lieb.This is the final version of the article. It first appeared from CSH Press via http://dx.doi.org/10.1101/gad.255810.11

    The valuation of clean spread options: linking electricity, emissions and fuels

    Get PDF
    The purpose of the paper is to present a new pricing method for clean spread options, and to illustrate its main features on a set of numerical examples produced by a dedicated computer code. The novelty of the approach is embedded in the use of a structural model as opposed to reduced-form models which fail to capture properly the fundamental dependencies between the economic factors entering the production process

    The RIVUR Voiding Cystourethrogram Pilot Study: Experience with Radiologic Reading Concordance

    Get PDF
    Published cohorts of children with vesicoureteral reflux placed on antibiotic prophylaxis differ in baseline characteristics and methodology. These data have been combined in meta-analyses to derive treatment recommendations. We analyzed these cohorts in an attempt to understand the disparate outcomes reported
    corecore