22 research outputs found
Kankakee Sands Ornate Box Turtle (Terrapene ornata) Population and Ecosystem Assessment
ID: 8982; Wildlife Preservation Fund Grant IDNR RC03L21W and Nature Conservancy Grant:
Nature Conserv 1131626100 Final Report issued December 4, 2003INHS Technical Report prepared for Illinois Department of Natural Resources and Nature
Conservanc
Amphibian and Reptile Survey of Sangamon River Corridor Restoration Project Final Report
Submitted in fulfillment of requirements of WPF Grant # 03-029WINHS Technical Report submitted to Illinois Department of Natural Resources, Wildlife
Preservation FundLimitedINHS Staff asked that this be restricted because report contains sensitive information
Identifying and mitigating batch effects in whole genome sequencing data
Abstract Background Large sample sets of whole genome sequencing with deep coverage are being generated, however assembling datasets from different sources inevitably introduces batch effects. These batch effects are not well understood and can be due to changes in the sequencing protocol or bioinformatics tools used to process the data. No systematic algorithms or heuristics exist to detect and filter batch effects or remove associations impacted by batch effects in whole genome sequencing data. Results We describe key quality metrics, provide a freely available software package to compute them, and demonstrate that identification of batch effects is aided by principal components analysis of these metrics. To mitigate batch effects, we developed new site-specific filters that identified and removed variants that falsely associated with the phenotype due to batch effect. These include filtering based on: a haplotype based genotype correction, a differential genotype quality test, and removing sites with missing genotype rate greater than 30% after setting genotypes with quality scores less than 20 to missing. This method removed 96.1% of unconfirmed genome-wide significant SNP associations and 97.6% of unconfirmed genome-wide significant indel associations. We performed analyses to demonstrate that: 1) These filters impacted variants known to be disease associated as 2 out of 16 confirmed associations in an AMD candidate SNP analysis were filtered, representing a reduction in power of 12.5%, 2) In the absence of batch effects, these filters removed only a small proportion of variants across the genome (type I error rate of 3%), and 3) in an independent dataset, the method removed 90.2% of unconfirmed genome-wide SNP associations and 89.8% of unconfirmed genome-wide indel associations. Conclusions Researchers currently do not have effective tools to identify and mitigate batch effects in whole genome sequencing data. We developed and validated methods and filters to address this deficiency
Cybersecurity in a Large-Scale Research Facility—One Institution’s Approach
A cybersecurity approach for a large-scale user facility is presented—utilizing the National High Magnetic Field Laboratory (NHMFL) at Florida State University (FSU) as an example. The NHMFL provides access to the highest magnetic fields for scientific research teams from a range of disciplines. The unique challenges of cybersecurity at a widely accessible user facility are showcased, and relevant cybersecurity frameworks for the complex needs of a user facility with industrial-style equipment and hazards are discussed, along with the approach for risk identification and management, which determine cybersecurity requirements and priorities. Essential differences between information technology and research technology are identified, along with unique requirements and constraints. The need to plan for the introduction of new technology and manage legacy technologies with long usage lifecycles is identified in the context of implementing cybersecurity controls rooted in pragmatic decisions to avoid hindering research activities while enabling secure practices, which includes FAIR (findable, accessible, interoperable, and reusable) and open data management principles. The NHMFL’s approach to FAIR data management is presented. Critical success factors include obtaining resources to implement and maintain necessary security protocols, interdisciplinary and diverse skill sets, phased implementation, and shared allocation of NHMFL and FSU responsibilities
Additional file 3: of Identifying and mitigating batch effects in whole genome sequencing data
Sample level summary statistics and annotations calculated by genotypeeval. (CSV 102Â kb
Influence of genetic copy number variants of the human GLUT3 glucose transporter gene SLC2A3 on protein expression, glycolysis and rheumatoid arthritis risk: A genetic replication study
Objectives: The gene encoding glucose transporter 3 (GLUT3, SLC2A3) is present in the human population at variable copy number. An overt disease phenotype of SLC2A3 copy number variants has not been reported; however, deletion of SLC2A3 has been previously reported to protect carriers from rheumatoid arthritis, implicating GLUT3 as a therapeutic target in rheumatoid arthritis. Here we aim to perform functional analysis of GLUT3 copy number variants in immune cells, and test the reported protective association of the GLUT3 copy number variants for rheumatoid arthritis in a genetic replication study. Methods: Cells from genotyped healthy controls were analyzed for SLC2A3/GLUT3 expression and glycolysis capacity. We genotyped the SLC2A3 copy number variant in four independent cohorts of rheumatoid arthritis and controls and one cohort of multiple sclerosis and controls. Results: Heterozygous deletion of SLC2A3 correlates directly with expression levels of GLUT3 and influences glycolysis rates in the human immune system. The frequency of the SLC2A3 copy number variant is not different between rheumatoid arthritis, multiple sclerosis and control groups. Conclusions: Despite a robust SLC2A3 gene copy number dependent phenotype, our study of large groups of rheumatoid arthritis cases and controls provides no evidence for rheumatoid arthritis disease protection in deletion carriers. These data emphasize the importance of well powered replication studies to confirm or refute genetic associations, particularly for relatively rare variants. Keywords: GLUT3, SLC2A3, Glycolysis, Deletion, Rheumatoid arthritis, Copy number variant, Glucose transpor
Recommended from our members
Analysis of protein-altering variants in telomerase genes and their association with MUC5B common variant status in patients with idiopathic pulmonary fibrosis: a candidate gene sequencing study
BackgroundIdiopathic pulmonary fibrosis (IPF) risk has a strong genetic component. Studies have implicated variations at several loci, including TERT, surfactant genes, and a single nucleotide polymorphism at chr11p15 (rs35705950) in the intergenic region between TOLLIP and MUC5B. Patients with IPF who have risk alleles at rs35705950 have longer survival from the time of IPF diagnosis than do patients homozygous for the non-risk allele, whereas patients with shorter telomeres have shorter survival times. We aimed to assess whether rare protein-altering variants in genes regulating telomere length are enriched in patients with IPF homozygous for the non-risk alleles at rs35705950.MethodsBetween Nov 1, 2014, and Nov 1, 2016, we assessed blood samples from patients aged 40 years or older and of European ancestry with sporadic IPF from three international phase 3 clinical trials (INSPIRE, CAPACITY, ASCEND), one phase 2 study (RIFF), and US-based observational studies (Vanderbilt Clinical Interstitial Lung Disease Registry and the UCSF Interstitial Lung Disease Clinic registry cohorts) at the Broad Institute (Cambridge, MA, USA) and Human Longevity (San Diego, CA, USA). We also assessed blood samples from non-IPF controls in several clinical trials. We did whole-genome sequencing to assess telomere length and identify rare protein-altering variants, stratified by rs35705950 genotype. We also assessed rare functional variation in TERT exons and compared telomere length and disease progression across genotypes.FindingsWe assessed samples from 1510 patients with IPF and 1874 non-IPF controls. 30 (3%) of 1046 patients with an rs35705950 risk allele had a rare protein-altering variant in TERT compared with 34 (7%) of 464 non-risk allele carriers (odds ratio 0·40 [95% CI 0·24-0·66], p=0·00039). Subsequent analyses identified enrichment of rare protein-altering variants in PARN and RTEL1, and rare variation in TERC in patients with IPF compared with controls. We expanded our study population to provide a more accurate estimation of rare variant frequency in these four loci, and to calculate telomere length. The proportion of patients with at least one rare variant in TERT, PARN, TERC, or RTEL1 was higher in patients with IPF than in controls (149 [9%] of 1739 patients vs 205 [2%] of 8645 controls, p=2·44 × 10-8). Patients with IPF who had a variant in any of the four identified telomerase component genes had telomeres that were 3·69-16·10% shorter than patients without a variant in any of the four genes and had an earlier mean age of disease onset than patients without one or more variants (65·1 years [SD 7·8] vs 67·1 years [7·9], p=0·004). In the placebo arms of clinical trials, shorter telomeres were significantly associated with faster disease progression (1·7% predicted forced vital capacity per kb per year, p=0·002). Pirfenidone had treatment benefit regardless of telomere length (p=4·24 × 10-8 for telomere length lower than the median, p=0·0044 for telomere length greater than the median).InterpretationRare protein-altering variants in TERT, PARN, TERC, and RTEL1 are enriched in patients with IPF compared with controls, and, in the case of TERT, particularly in individuals without a risk allele at the rs35705950 locus. This suggests that multiple genetic factors contribute to sporadic IPF, which might implicate distinct mechanisms of pathogenesis and disease progression.FundingGenentech, National Institutes of Health, Francis Family Foundation, Pulmonary Fibrosis Foundation, Nina Ireland Program for Lung Health, US Department of Veterans Affairs
A rare mutation in UNC5C predisposes to late-onset Alzheimer's disease and increases neuronal cell death
We have identified a rare coding mutation, T835M (rs137875858), in the UNC5C netrin receptor gene that segregated with disease in an autosomal dominant pattern in two families enriched for late-onset Alzheimer's disease and that was associated with disease across four large case-control cohorts (odds ratio = 2.15, Pmeta = 0.0095). T835M alters a conserved residue in the hinge region of UNC5C, and in vitro studies demonstrate that this mutation leads to increased cell death in human HEK293T cells and in rodent neurons. Furthermore, neurons expressing T835M UNC5C are more susceptible to cell death from multiple neurotoxic stimuli, including β-amyloid (Aβ), glutamate and staurosporine. On the basis of these data and the enriched hippocampal expression of UNC5C in the adult nervous system, we propose that one possible mechanism in which T835M UNC5C contributes to the risk of Alzheimer's disease is by increasing susceptibility to neuronal cell death, particularly in vulnerable regions of the Alzheimer's disease brain