1,083 research outputs found

    A Carotene Precursor: Its Proposed Structure and Place in Biosynthetic Sequence

    Full text link

    SNPpy - Database Management for SNP Data from Genome Wide Association Studies

    Get PDF
    Background: We describe SNPpy, a hybrid script database system using the Python SQLAlchemy library coupled with the PostgreSQL database to manage genotype data from Genome-Wide Association Studies (GWAS). This system makes it possible to merge study data with HapMap data and merge across studies for meta-analyses, including data filtering based on the values of phenotype and Single-Nucleotide Polymorphism (SNP) data. SNPpy and its dependencies are open source software. Results: The current version of SNPpy offers utility functions to import genotype and annotation data from two commercial platforms. We use these to import data from two GWAS studies and the HapMap Project. We then export these individual datasets to standard data format files that can be imported into statistical software for downstream analyses. Conclusions: By leveraging the power of relational databases, SNPpy offers integrated management and manipulation of genotype and phenotype data from GWAS studies. The analysis of these studies requires merging across GWAS datasets as well as patient and marker selection. To this end, SNPpy enables the user to filter the data and output the results as standardized GWAS file formats. It does low level and flexible data validation, including validation of patient data. SNPpy is

    How the extinction of extragalactic background light affects surface photometry of galaxies, groups and clusters

    Full text link
    The faint regions of galaxies, groups and clusters hold important clues about how these objects formed, and surface photometry at optical and near-infrared wavelengths represents a powerful tool for studying such structures. Here, we identify a hitherto unrecognized problem with this technique, related to how the night sky flux is typically measured and subtracted from astronomical images. While most of the sky flux comes from regions between the observer and the target object, a small fraction - the extragalactic background light (EBL) - comes from behind. We argue that since this part of the sky flux can be subject to extinction by dust present in the galaxy/group/cluster studied, standard reduction procedures may lead to a systematic oversubtraction of the EBL. Even very small amounts of extinction can lead to spurious features in radial surface surface brightness profiles and colour maps of extended objects. We assess the likely impact of this effect on a number of topics in extragalactic astronomy where very deep surface photometry is currently attempted, including studies of stellar halos, starburst host galaxies, disc truncations and diffuse intragroup/intracluster light. We argue that EBL extinction may provide at least a partial explanation for the anomalously red colours reported for the halos of disc galaxies and the hosts of local starburst galaxies. EBL extinction effects also mimic truncations in discs with unusually high dust opacities, but are unlikely to be the cause of such features in general. Failure to account for EBL extinction can also give rise to a non-negligible underestimate of intragroup and intracluster light at the faintest surface brightness levels currently probed. (Abridged)Comment: 15 pages, 10 figures, accepted for publication in MNRA

    Brain activation during cognitive planning in twins discordant or concordant for obsessive–compulsive symptoms

    Get PDF
    Neuroimaging studies have indicated abnormalities in cortico-striatal-thalamo-cortical circuits in patients with obsessive–compulsive disorder compared with controls. However, there are inconsistencies between studies regarding the exact set of brain structures involved and the direction of anatomical and functional changes. These inconsistencies may reflect the differential impact of environmental and genetic risk factors for obsessive–compulsive disorder on different parts of the brain. To distinguish between functional brain changes underlying environmentally and genetically mediated obsessive–compulsive disorder, we compared task performance and brain activation during a Tower of London planning paradigm in monozygotic twins discordant (n = 38) or concordant (n = 100) for obsessive–compulsive symptoms. Twins who score high on obsessive–compulsive symptoms can be considered at high risk for obsessive–compulsive disorder. We found that subjects at high risk for obsessive–compulsive disorder did not differ from the low-risk subjects behaviourally, but we obtained evidence that the high-risk subjects differed from the low-risk subjects in the patterns of brain activation accompanying task execution. These regions can be separated into those that were affected by mainly environmental risk (dorsolateral prefrontal cortex and lingual cortex), genetic risk (frontopolar cortex, inferior frontal cortex, globus pallidus and caudate nucleus) and regions affected by both environmental and genetic risk factors (cingulate cortex, premotor cortex and parts of the parietal cortex). Our results suggest that neurobiological changes related to obsessive–compulsive symptoms induced by environmental factors involve primarily the dorsolateral prefrontal cortex, whereas neurobiological changes induced by genetic factors involve orbitofrontal–basal ganglia structures. Regions showing similar changes in high-risk twins from discordant and concordant pairs may be part of compensatory networks that keep planning performance intact, in spite of cortico-striatal-thalamo-cortical deficits

    Hundreds of variants clustered in genomic loci and biological pathways affect human height

    Get PDF
    Most common human traits and diseases have a polygenic pattern of inheritance: DNA sequence variants at many genetic loci influence the phenotype. Genome-wide association (GWA) studies have identified more than 600 variants associated with human traits, but these typically explain small fractions of phenotypic variation, raising questions about the use of further studies. Here, using 183,727 individuals, we show that hundreds of genetic variants, in at least 180 loci, influence adult height, a highly heritable and classic polygenic trait. The large number of loci reveals patterns with important implications for genetic studies of common human diseases and traits. First, the 180 loci are not random, but instead are enriched for genes that are connected in biological pathways (P = 0.016) and that underlie skeletal growth defects (P < 0.001). Second, the likely causal gene is often located near the most strongly associated variant: in 13 of 21 loci containing a known skeletal growth gene, that gene was closest to the associated variant. Third, at least 19 loci have multiple independently associated variants, suggesting that allelic heterogeneity is a frequent feature of polygenic traits, that comprehensive explorations of already-discovered loci should discover additional variants and that an appreciable fraction of associated loci may have been identified. Fourth, associated variants are enriched for likely functional effects on genes, being over-represented among variants that alter amino-acid structure of proteins and expression levels of nearby genes. Our data explain approximately 10% of the phenotypic variation in height, and we estimate that unidentified common variants of similar effect sizes would increase this figure to approximately 16% of phenotypic variation (approximately 20% of heritable variation). Although additional approaches are needed to dissect the genetic architecture of polygenic human traits fully, our findings indicate that GWA studies can identify large numbers of loci that implicate biologically relevant genes and pathways.

    The SysteMHC Atlas project.

    Get PDF
    Mass spectrometry (MS)-based immunopeptidomics investigates the repertoire of peptides presented at the cell surface by major histocompatibility complex (MHC) molecules. The broad clinical relevance of MHC-associated peptides, e.g. in precision medicine, provides a strong rationale for the large-scale generation of immunopeptidomic datasets and recent developments in MS-based peptide analysis technologies now support the generation of the required data. Importantly, the availability of diverse immunopeptidomic datasets has resulted in an increasing need to standardize, store and exchange this type of data to enable better collaborations among researchers, to advance the field more efficiently and to establish quality measures required for the meaningful comparison of datasets. Here we present the SysteMHC Atlas (https://systemhcatlas.org), a public database that aims at collecting, organizing, sharing, visualizing and exploring immunopeptidomic data generated by MS. The Atlas includes raw mass spectrometer output files collected from several laboratories around the globe, a catalog of context-specific datasets of MHC class I and class II peptides, standardized MHC allele-specific peptide spectral libraries consisting of consensus spectra calculated from repeat measurements of the same peptide sequence, and links to other proteomics and immunology databases. The SysteMHC Atlas project was created and will be further expanded using a uniform and open computational pipeline that controls the quality of peptide identifications and peptide annotations. Thus, the SysteMHC Atlas disseminates quality controlled immunopeptidomic information to the public domain and serves as a community resource toward the generation of a high-quality comprehensive map of the human immunopeptidome and the support of consistent measurement of immunopeptidomic sample cohorts

    The relationship between fertility and lifespan in humans

    Get PDF
    Evolutionary theories of aging predict a trade-off between fertility and lifespan, where increased lifespan comes at the cost of reduced fertility. Support for this prediction has been obtained from various sources. However, which genes underlie this relationship is unknown. To assess it, we first analyzed the association of fertility with age at menarche and menopause, and with mortality in 3,575 married female participants of the Rotterdam Study. In addition, we conducted a candidate gene study where 1,664 single nucleotide polymorphisms (SNPs) in 25 candidate genes were analyzed in relation to number of children as a measure of fertility. SNPs that associated with fertility were analyzed for association with mortality. We observed no associations between fertility and age at menarche (p = 0.38) and menopause (p = 0.07). In contrast, fertility was associated with mortality. Women with two to three children had significantly lower mortality (hazard ratio (HR), 0.82; 95% confidence interval (95% CI), 0.69–0.97) compared to women with no children. No such benefit was observed for women with four or more children, who had a similar mortality risk (HR, 0.93; 95% CI, 0.76–1.13) as women with no children. The analysis of candidate genes revealed four genes that influence fertility after correction for multiple testing: CGB/LHB gene cluster (p = 0.0036), FSHR (p = 0.023), FST (p = 0.023), and INHBA (p = 0.021). However, none of the independent SNPs in these genes predicted mortality. In conclusion, women who bear two to three children live longer than those who bear none or many children, but this relationship was not mediated by the candidate genes analyzed in this study

    A 32 kb Critical Region Excluding Y402H in CFH Mediates Risk for Age-Related Macular Degeneration

    Get PDF
    Complement factor H shows very strong association with Age-related Macular Degeneration (AMD), and recent data suggest that multiple causal variants are associated with disease. To refine the location of the disease associated variants, we characterized in detail the structural variation at CFH and its paralogs, including two copy number polymorphisms (CNP), CNP147 and CNP148, and several rare deletions and duplications. Examination of 34 AMD-enriched extended families (N = 293) and AMD cases (White N = 4210 Indian = 134; Malay = 140) and controls (White N = 3229; Indian = 117; Malay = 2390) demonstrated that deletion CNP148 was protective against AMD, independent of SNPs at CFH. Regression analysis of seven common haplotypes showed three haplotypes, H1, H6 and H7, as conferring risk for AMD development. Being the most common haplotype H1 confers the greatest risk by increasing the odds of AMD by 2.75-fold (95% CI = [2.51, 3.01]; p = 8.31×10−109); Caucasian (H6) and Indian-specific (H7) recombinant haplotypes increase the odds of AMD by 1.85-fold (p = 3.52×10−9) and by 15.57-fold (P = 0.007), respectively. We identified a 32-kb region downstream of Y402H (rs1061170), shared by all three risk haplotypes, suggesting that this region may be critical for AMD development. Further analysis showed that two SNPs within the 32 kb block, rs1329428 and rs203687, optimally explain disease association. rs1329428 resides in 20 kb unique sequence block, but rs203687 resides in a 12 kb block that is 89% similar to a noncoding region contained in ΔCNP148. We conclude that causal variation in this region potentially encompasses both regulatory effects at single markers and copy number

    Large-scale analyses of common and rare variants identify 12 new loci associated with atrial fibrillation

    Get PDF
    Atrial fibrillation affects more than 33 million people worldwide and increases the risk of stroke, heart failure, and death. Fourteen genetic loci have been associated with atrial fibrillation in European and Asian ancestry groups. To further define the genetic basis of atrial fibrillation, we performed large-scale, trans-ancestry meta-analyses of common and rare variant association studies. The genome-wide association studies (GWAS) included 17,931 individuals with atrial fibrillation and 115,142 referents; the exome-wide association studies (ExWAS) and rare variant association studies (RVAS) involved 22,346 cases and 132,086 referents. We identified 12 new genetic loci that exceeded genome-wide significance, implicating genes involved in cardiac electrical and structural remodeling. Our results nearly double the number of known genetic loci for atrial fibrillation, provide insights into the molecular basis of atrial fibrillation, and may facilitate the identification of new potential targets for drug discovery
    corecore