381 research outputs found

    Bridging the demand and the offer in data science

    Get PDF
    During the last several years, we have observed an exponential increase in the demand for Data Scientists in the job market. As a result, a number of trainings, courses, books, and university educational programs (both at undergraduate, graduate and postgraduate levels) have been labeled as “Big data” or “Data Science”; the fil‐rouge of each of them is the aim at forming people with the right competencies and skills to satisfy the business sector needs. In this paper, we report on some of the exercises done in analyzing current Data Science education offer and matching with the needs of the job markets to propose a scalable matching service, ie, COmpetencies ClassificatiOn (E‐CO‐2), based on Data Science techniques. The E‐CO‐2 service can help to extract relevant information from Data Science–related documents (course descriptions, job Ads, blogs, or papers), which enable the comparison of the demand and offer in the field of Data Science Education and HR management, ultimately helping to establish the profession of Data Scientist.publishedVersio

    The role of mutation rate variation and genetic diversity in the architecture of human disease

    Get PDF
    Background We have investigated the role that the mutation rate and the structure of genetic variation at a locus play in determining whether a gene is involved in disease. We predict that the mutation rate and its genetic diversity should be higher in genes associated with disease, unless all genes that could cause disease have already been identified. Results Consistent with our predictions we find that genes associated with Mendelian and complex disease are substantially longer than non-disease genes. However, we find that both Mendelian and complex disease genes are found in regions of the genome with relatively low mutation rates, as inferred from intron divergence between humans and chimpanzees, and they are predicted to have similar rates of non-synonymous mutation as other genes. Finally, we find that disease genes are in regions of significantly elevated genetic diversity, even when variation in the rate of mutation is controlled for. The effect is small nevertheless. Conclusions Our results suggest that gene length contributes to whether a gene is associated with disease. However, the mutation rate and the genetic architecture of the locus appear to play only a minor role in determining whether a gene is associated with disease

    Haptoglobin Phenotype, Preeclampsia Risk and the Efficacy of Vitamin C and E Supplementation to Prevent Preeclampsia in a Racially Diverse Population

    Get PDF
    Haptoglobin's (Hp) antioxidant and pro-angiogenic properties differ between the 1-1, 2-1, and 2-2 phenotypes. Hp phenotype affects cardiovascular disease risk and treatment response to antioxidant vitamins in some non-pregnant populations. We previously demonstrated that preeclampsia risk was doubled in white Hp 2-1 women, compared to Hp 1-1 women. Our objectives were to determine whether we could reproduce this finding in a larger cohort, and to determine whether Hp phenotype influences lack of efficacy of antioxidant vitamins in preventing preeclampsia and serious complications of pregnancy-associated hypertension (PAH). This is a secondary analysis of a randomized controlled trial in which 10,154 low-risk women received daily vitamin C and E, or placebo, from 9-16 weeks gestation until delivery. Hp phenotype was determined in the study prediction cohort (n = 2,393) and a case-control cohort (703 cases, 1,406 controls). The primary outcome was severe PAH, or mild or severe PAH with elevated liver enzymes, elevated serum creatinine, thrombocytopenia, eclampsia, fetal growth restriction, medically indicated preterm birth or perinatal death. Preeclampsia was a secondary outcome. Odds ratios were estimated by logistic regression. Sampling weights were used to reduce bias from an overrepresentation of women with preeclampsia or the primary outcome. There was no relationship between Hp phenotype and the primary outcome or preeclampsia in Hispanic, white/other or black women. Vitamin supplementation did not reduce the risk of the primary outcome or preeclampsia in women of any phenotype. Supplementation increased preeclampsia risk (odds ratio 3.30; 95% confidence interval 1.61-6.82, p<0.01) in Hispanic Hp 2-2 women. Hp phenotype does not influence preeclampsia risk, or identify a subset of women who may benefit from vitamin C and E supplementation to prevent preeclampsia

    The Achievement of a Decentralized Water Management Through Stakeholder Participation: An Example from the Drôme River Catchment Area in France (1981–2008)

    Get PDF
    International audienceDifferent water Acts (e.g., the European Water Framework Directive) and stakeholders involved in aquatic affairs have promoted integrated river basin management (IRBM) over recent decades. However, few studies have provided feedback on these policies. The aim of the current article is to fill this gap by exploring how local newspapers reflect the implementation of a broad public participation within a catchment of France known for its innovation with regard to this domain. The media coverage of a water management strategy in the Drôme watershed from 1981 to 2008 was investigated using a content analysis and a geographic information system (GIS). We sought to determine what public participation and decentralized decision-making can be in practice. The results showed that this policy was integrated because of its social perspective, the high number of involved stakeholders, the willingness to handle water issues, and the local scale suitable for participation. We emphasized the prominence of the watershed scale guaranteed by the local water authority. This area was also characterized by compromise, arrangements, and power dynamics on a fine scale. We examined the most politically engaged writings regarding water management, which topics each group emphasized, and how the groups agreed and disagreed on issues based on their values and context. The temporal pattern of participation implementation was progressive but worked by fits and starts

    Identifying Selected Regions from Heterozygosity and Divergence Using a Light-Coverage Genomic Dataset from Two Human Populations

    Get PDF
    When a selective sweep occurs in the chromosomal region around a target gene in two populations that have recently separated, it produces three dramatic genomic consequences: 1) decreased multi-locus heterozygosity in the region; 2) elevated or diminished genetic divergence (FST) of multiple polymorphic variants adjacent to the selected locus between the divergent populations, due to the alternative fixation of alleles; and 3) a consequent regional increase in the variance of FST (S2FST) for the same clustered variants, due to the increased alternative fixation of alleles in the loci surrounding the selection target. In the first part of our study, to search for potential targets of directional selection, we developed and validated a resampling-based computational approach; we then scanned an array of 31 different-sized moving windows of SNP variants (5–65 SNPs) across the human genome in a set of European and African American population samples with 183,997 SNP loci after correcting for the recombination rate variation. The analysis revealed 180 regions of recent selection with very strong evidence in either population or both. In the second part of our study, we compared the newly discovered putative regions to those sites previously postulated in the literature, using methods based on inspecting patterns of linkage disequilibrium, population divergence and other methodologies. The newly found regions were cross-validated with those found in nine other studies that have searched for selection signals. Our study was replicated especially well in those regions confirmed by three or more studies. These validated regions were independently verified, using a combination of different methods and different databases in other studies, and should include fewer false positives. The main strength of our analysis method compared to others is that it does not require dense genotyping and therefore can be used with data from population-based genome SNP scans from smaller studies of humans or other species

    Mystery Solved: The Identification of the Two Missing Romanov Children Using DNA Analysis

    Get PDF
    One of the greatest mysteries for most of the twentieth century was the fate of the Romanov family, the last Russian monarchy. Following the abdication of Tsar Nicholas II, he and his wife, Alexandra, and their five children were eventually exiled to the city of Yekaterinburg. The family, along with four loyal members of their staff, was held captive by members of the Ural Soviet. According to historical reports, in the early morning hours of July 17, 1918 the entire family along with four loyal members of their staff was executed by a firing squad. After a failed attempt to dispose of the remains in an abandoned mine shaft, the bodies were transported to an open field only a few kilometers from the mine shaft. Nine members of the group were buried in one mass grave while two of the children were buried in a separate grave. With the official discovery of the larger mass grave in 1991, and subsequent DNA testing to confirm the identities of the Tsar, the Tsarina, and three of their daughters – doubt persisted that these remains were in fact those of the Romanov family. In the summer of 2007, a group of amateur archeologists discovered a collection of remains from the second grave approximately 70 meters from the larger grave. We report forensic DNA testing on the remains discovered in 2007 using mitochondrial DNA (mtDNA), autosomal STR, and Y- STR testing. Combined with additional DNA testing of material from the 1991 grave, we have virtually irrefutable evidence that the two individuals recovered from the 2007 grave are the two missing children of the Romanov family: the Tsarevich Alexei and one of his sisters

    A Potential Role for Bat Tail Membranes in Flight Control

    Get PDF
    Wind tunnel tests conducted on a model based on the long-eared bat Plecotus auritus indicated that the positioning of the tail membrane (uropatagium) can significantly influence flight control. Adjusting tail position by increasing the angle of the legs ventrally relative to the body has a two-fold effect; increasing leg-induced wing camber (i.e., locally increased camber of the inner wing surface) and increasing the angle of attack of the tail membrane. We also used our model to examine the effects of flying with and without a tail membrane. For the bat model with a tail membrane increasing leg angle increased the lift, drag and pitching moment (nose-down) produced. However, removing the tail membrane significantly reduced the change in pitching moment with increasing leg angle, but it had no significant effect on the level of lift produced. The drag on the model also significantly increased with the removal of the tail membrane. The tail membrane, therefore, is potentially important for controlling the level of pitching moment produced by bats and an aid to flight control, specifically improving agility and manoeuvrability. Although the tail of bats is different from that of birds, in that it is only divided from the wings by the legs, it nonetheless, may, in addition to its prey capturing function, fulfil a similar role in aiding flight control

    A high resolution genome-wide scan for significant selective sweeps: an application to pooled sequence data in laying chickens

    Get PDF
    In most studies aimed at localizing footprints of past selection, outliers at tails of the empirical distribution of a given test statistic are assumed to reflect locus-specific selective forces. Significance cutoffs are subjectively determined, rather than being related to a clear set of hypotheses. Here, we define an empirical p-value for the summary statistic by means of a permutation method that uses the observed SNP structure in the real data. To illustrate the methodology, we applied our approach to a panel of 2.9 million autosomal SNPs identified from re-sequencing a pool of 15 individuals from a brown egg layer line. We scanned the genome for local reductions in heterozygosity, suggestive of selective sweeps. We also employed a modified sliding window approach that accounts for gaps in the sequence and increases scanning resolution by moving the overlapping windows by steps of one SNP only, and suggest to call this a "creeping window" strategy. The approach confirmed selective sweeps in the region of previously described candidate genes, i.e. TSHR, PRL, PRLHR, INSR, LEPR, IGF1, and NRAMP1 when used as positive controls. The genome scan revealed 82 distinct regions with strong evidence of selection (genome-wide p-value<0.001), including genes known to be associated with eggshell structure and immune system such as CALB1 and GAL cluster, respectively. A substantial proportion of signals was found in poor gene content regions including the most extreme signal on chromosome 1. The observation of multiple signals in a highly selected layer line of chicken is consistent with the hypothesis that egg production is a complex trait controlled by many genes

    Why Do Dolphins Carry Sponges?

    Get PDF
    Tool use is rare in wild animals, but of widespread interest because of its relationship to animal cognition, social learning and culture. Despite such attention, quantifying the costs and benefits of tool use has been difficult, largely because if tool use occurs, all population members typically exhibit the behavior. In Shark Bay, Australia, only a subset of the bottlenose dolphin population uses marine sponges as tools, providing an opportunity to assess both proximate and ultimate costs and benefits and document patterns of transmission. We compared sponge-carrying (sponger) females to non-sponge-carrying (non-sponger) females and show that spongers were more solitary, spent more time in deep water channel habitats, dived for longer durations, and devoted more time to foraging than non-spongers; and, even with these potential proximate costs, calving success of sponger females was not significantly different from non-spongers. We also show a clear female-bias in the ontogeny of sponging. With a solitary lifestyle, specialization, and high foraging demands, spongers used tools more than any non-human animal. We suggest that the ecological, social, and developmental mechanisms involved likely (1) help explain the high intrapopulation variation in female behaviour, (2) indicate tradeoffs (e.g., time allocation) between ecological and social factors and, (3) constrain the spread of this innovation to primarily vertical transmission

    Epistasis between COMT and MTHFR in Maternal-Fetal Dyads Increases Risk for Preeclampsia

    Get PDF
    Preeclampsia is a leading cause of perinatal morbidity and mortality. This disorder is thought to be multifactorial in origin, with multiple genes, environmental and social factors, contributing to disease. One proposed mechanism is placental hypoxia-driven imbalances in angiogenic and anti-angiogenic factors, causing endothelial cell dysfunction. Catechol-O-methyltransferase (Comt)-deficient pregnant mice have a preeclampsia phenotype that is reversed by exogenous 2-methoxyestradiol (2-ME), an estrogen metabolite generated by COMT. 2-ME inhibits Hypoxia Inducible Factor 1α, a transcription factor mediating hypoxic responses. COMT has been shown to interact with methylenetetrahydrofolate reductase (MTHFR), which modulates the availability of S-adenosylmethionine (SAM), a COMT cofactor. Variations in MTHFR have been associated with preeclampsia. By accounting for allelic variation in both genes, the role of COMT has been clarified. COMT allelic variation is linked to enzyme activity and four single nucleotide polymorphisms (SNPs) (rs6269, rs4633, rs4680, and rs4818) form haplotypes that characterize COMT activity. We tested for association between COMT haplotypes and the MTHFR 677 C→T polymorphism and preeclampsia risk in 1103 Chilean maternal-fetal dyads. The maternal ACCG COMT haplotype was associated with reduced risk for preeclampsia (P = 0.004), and that risk increased linearly from low to high activity haplotypes (P = 0.003). In fetal samples, we found that the fetal ATCA COMT haplotype and the fetal MTHFR minor “T” allele interact to increase preeclampsia risk (p = 0.022). We found a higher than expected number of patients with preeclampsia with both the fetal risk alleles alone (P = 0.052) and the fetal risk alleles in combination with a maternal balancing allele (P<0.001). This non-random distribution was not observed in controls (P = 0.341 and P = 0.219, respectively). Our findings demonstrate a role for both maternal and fetal COMT in preeclampsia and highlight the importance of including allelic variation in MTHFR
    corecore