166 research outputs found

    Towards certain fixes with editing rules and master data

    Get PDF
    A variety of integrity constraints have been studied for data cleaning. While these constraints can detect the presence of errors, they fall short of guiding us to correct the errors. Indeed, data repairing based on these constraints may notfind certain fixes that are absolutely correct, and worse, may introduce new errors when repairing the data. We propose a method for finding certain fixes, based on master data, a notion of certain regions, and a class of editing rules. A certain region is a set of attributes that are assured correct by the users. Given a certain region and master data, editing rules tell us what attributes to fix and how to update them. We show how the method can be used in data monitoring and enrichment. We develop techniques for reasoning about editing rules, to decide whether they lead to a unique fix and whether they are able to fix all the attributes in a tuple, relative to master data and a certain region. We also provide an algorithm to identify minimal certain regions, such that a certain fix is warranted by editing rules and master data as long as one of the regions is correct. We experimentally verify the effectiveness and scalability of the algorithm

    FGF receptor genes and breast cancer susceptibility: results from the Breast Cancer Association Consortium

    Get PDF
    Background:Breast cancer is one of the most common malignancies in women. Genome-wide association studies have identified FGFR2 as a breast cancer susceptibility gene. Common variation in other fibroblast growth factor (FGF) receptors might also modify risk. We tested this hypothesis by studying genotyped single-nucleotide polymorphisms (SNPs) and imputed SNPs in FGFR1, FGFR3, FGFR4 and FGFRL1 in the Breast Cancer Association Consortium. Methods:Data were combined from 49 studies, including 53 835 cases and 50 156 controls, of which 89 050 (46 450 cases and 42 600 controls) were of European ancestry, 12 893 (6269 cases and 6624 controls) of Asian and 2048 (1116 cases and 932 controls) of African ancestry. Associations with risk of breast cancer, overall and by disease sub-type, were assessed using unconditional logistic regression. Results:Little evidence of association with breast cancer risk was observed for SNPs in the FGF receptor genes. The strongest evidence in European women was for rs743682 in FGFR3; the estimated per-allele odds ratio was 1.05 (95 confidence interval=1.02-1.09, P=0.0020), which is substantially lower than that observed for SNPs in FGFR2. Conclusion:Our results suggest that common variants in the other FGF receptors are not associated with risk of breast cancer to the degree observed for FGFR2. © 2014 Cancer Research UK

    Landscape-scale drivers of pollinator communities may depend on land-use configuration

    Get PDF
    Research into pollinators in managed landscapes has recently combined approaches of pollination ecology and landscape ecology, because key stressors are likely to interact across wide areas. While laboratory and field experiments are valuable for furthering understanding, studies are required to investigate the interacting drivers of pollinator health and diversity across a broader range of landscapes and a wider array of taxa. Here, we use a network of 96 study landscapes in six topographically diverse regions of Britain, to test the combined importance of honeybee density, insecticide loadings, floral resource availability and habitat diversity to pollinator communities. We also explore the interactions between these drivers and the cover and proximity of semi-natural habitat. We found that among our four drivers, only honeybee density was positively related to wild pollinator abundance and diversity, and the positive association between abundance and floral resources depended on insecticide loadings and habitat diversity. By contrast, our exploratory models including habitat composition metrics revealed a complex suite of interactive effects. These results demonstrate that improving pollinator community composition and health is unlikely to be achieved with general resource enhancements only. Rather, local land-use context should be considered in fine-tuning pollinator management and conservation

    The XMM Cluster Survey: Exploring scaling relations and completeness of the Dark Energy Survey Year 3 redMaPPer cluster catalogue

    Get PDF
    We cross-match and compare characteristics of galaxy clusters identified in observations from two sky surveys using two completely different techniques. One sample is optically selected from the analysis of three years of Dark Energy Survey observations using the redMaPPer cluster detection algorithm. The second is X-ray selected from XMM observations analysed by the XMM Cluster Survey. The samples comprise a total area of 57.4 deg2^2, bounded by the area of 4 contiguous XMM survey regions that overlap the DES footprint. We find that the X-ray selected sample is fully matched with entries in the redMaPPer catalogue, above λ>\lambda>20 and within 0.1<z<< z <0.9. Conversely, only 38\% of the redMaPPer catalogue is matched to an X-ray extended source. Next, using 120 optically clusters and 184 X-ray selected clusters, we investigate the form of the X-ray luminosity-temperature (LXTXL_{X}-T_{X}), luminosity-richness (LXλL_{X}-\lambda) and temperature-richness (TXλT_{X}-\lambda) scaling relations. We find that the fitted forms of the LXTXL_{X}-T_{X} relations are consistent between the two selection methods and also with other studies in the literature. However, we find tentative evidence for a steepening of the slope of the relation for low richness systems in the X-ray selected sample. When considering the scaling of richness with X-ray properties, we again find consistency in the relations (i.e., LXλL_{X}-\lambda and TXλT_{X}-\lambda) between the optical and X-ray selected samples. This is contrary to previous similar works that find a significant increase in the scatter of the luminosity scaling relation for X-ray selected samples compared to optically selected samples.Comment: Accepted for publication to MNRA

    A new flowering time gene on wheat chromosome 3B characterization and genetic mapping

    Get PDF
    Genes that alter disease risk only in combination with certain environmental exposures may not be detected in genetic association analysis. By using methods accounting for gene-environment (G x E) interaction, we aimed to identify novel genetic loci associated with breast cancer risk. Up to 34,475 cases and 34,786 controls of European ancestry from up to 23 studies in the Breast Cancer Association Consortium were included. Overall, 71,527 single nucleotide polymorphisms (SNPs), enriched for association with breast cancer, were tested for interaction with 10 environmental risk factors using three recently proposed hybrid methods and a joint test of association and interaction. Analyses were adjusted for age, study, population stratification, and confounding factors as applicable. Three SNPs in two independent loci showed statistically significant association: SNPs rs10483028 and rs2242714 in perfect linkage disequilibrium on chromosome 21 and rs12197388 in ARID1B on chromosome 6. While rs12197388 was identified using the joint test with parity and with age at menarche (P-values = 3 x 10(-07)), the variants on chromosome 21 q22.12, which showed interaction with adult body mass index (BMI) in 8,891 postmenopausal women, were identified by all methods applied. SNP rs10483028 was associated with breast cancer in women with a BMI below 25 kg/m(2) (OR = 1.26, 95% CI 1.15-1.38) but not in women with a BMI of 30 kg/m(2) or higher (OR = 0.89, 95% CI 0.72-1.11, P for interaction = 3.2 x 10(-05)). Our findings confirm comparable power of the recent methods for detecting G x E interaction and the utility of using G x E interaction analyses to identify new susceptibility loci

    Meta-analysis of genome-wide association studies identifies common susceptibility polymorphisms for colorectal and endometrial cancer near SH2B3 and TSHZ1

    Get PDF
    High-risk mutations in several genes predispose to both colorectal cancer (CRC) and endometrial cancer (EC). We therefore hypothesised that some lower-risk genetic variants might also predispose to both CRC and EC. Using CRC and EC genome-wide association series, totalling 13,265 cancer cases and 40,245 controls, we found that the protective allele [G] at one previously-identified CRC polymorphism, rs2736100 near TERT, was associated with EC risk (odds ratio (OR) = 1.08, P = 0.000167); this polymorphism influences the risk of several other cancers. A further CRC polymorphism near TERC also showed evidence of association with EC (OR = 0.92; P = 0.03). Overall, however, there was no good evidence that the set of CRC polymorphisms was associated with EC risk, and neither of two previously-reported EC polymorphisms was associated with CRC risk. A combined analysis revealed one genome-wide significant polymorphism, rs3184504, on chromosome 12q24 (OR = 1.10, P = 7.23 × 10−9) with shared effects on CRC and EC risk. This polymorphism, a missense variant in the gene SH2B3, is also associated with haematological and autoimmune disorders, suggesting that it influences cancer risk through the immune response. Another polymorphism, rs12970291 near gene TSHZ1, was associated with both CRC and EC (OR = 1.26, P = 4.82 × 10−8), with the alleles showing opposite effects on the risks of the two cancers

    The Atacama Cosmology Telescope: A Catalog of >4000 Sunyaev–Zel’dovich Galaxy Clusters

    Get PDF
    We present a catalog of 4195 optically confirmed Sunyaev–Zel'dovich (SZ) selected galaxy clusters detected with signal-to-noise ratio >4 in 13,211 deg2 of sky surveyed by the Atacama Cosmology Telescope (ACT). Cluster candidates were selected by applying a multifrequency matched filter to 98 and 150 GHz maps constructed from ACT observations obtained from 2008 to 2018 and confirmed using deep, wide-area optical surveys. The clusters span the redshift range 0.04 1 clusters, and a total of 868 systems are new discoveries. Assuming an SZ signal versus mass-scaling relation calibrated from X-ray observations, the sample has a 90% completeness mass limit of M500c > 3.8 × 1014 M⊙, evaluated at z = 0.5, for clusters detected at signal-to-noise ratio >5 in maps filtered at an angular scale of 2farcm4. The survey has a large overlap with deep optical weak-lensing surveys that are being used to calibrate the SZ signal mass-scaling relation, such as the Dark Energy Survey (4566 deg2), the Hyper Suprime-Cam Subaru Strategic Program (469 deg2), and the Kilo Degree Survey (825 deg2). We highlight some noteworthy objects in the sample, including potentially projected systems, clusters with strong lensing features, clusters with active central galaxies or star formation, and systems of multiple clusters that may be physically associated. The cluster catalog will be a useful resource for future cosmological analyses and studying the evolution of the intracluster medium and galaxies in massive clusters over the past 10 Gyr

    Refined histopathological predictors of BRCA1 and BRCA2 mutation status: A large-scale analysis of breast cancer characteristics from the BCAC, CIMBA, and ENIGMA consortia

    Get PDF
    Introduction: The distribution of histopathological features of invasive breast tumors in BRCA1 or BRCA2 germline mutation carriers differs from that of individuals with no known mutation. Histopathological features thus have utility for mutation prediction, including statistical modeling to assess pathogenicity of BRCA1 or BRCA2 variants of uncertain clinical significance. We analyzed large pathology datasets accrued by the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA) and the Breast Cancer Association Consortium (BCAC) to reassess histopathological predictors of BRCA1 and BRCA2 mutation status, and provide robust likelihood ratio (LR) estimates for statistical modeling. Methods: Selection criteria for study/center inclusion were estrogen receptor (ER) status or grade data available for invasive breast cancer diagnosed younger than 70 years. The dataset included 4,477 BRCA1 mutation carriers, 2,565 BRCA2 mutation carriers, and 47,565 BCAC breast cancer cases. Country-stratified estimates of the
    corecore