22 research outputs found
Genes differentially expressed at 1 day, 6 weeks, and 6 months of age in aortas of spontaneously atherosclerotic White Carneau pigeons
Genetics is reported to be the primary causative factor for individuals diagnosed with atherosclerosis, in the absence of known risk factors. The development of atherosclerosis in White Carneau (WC) pigeons is of genetic origin, making it an excellent model to study genetic factors.
Representational Difference Analysis (RDA) was used to determine genes differentially upregulated between three ages, at the celiac bifurcation of the aorta in WC pigeons. Genes responsible for spontaneous initiation of atherosclerosis were hypothesized as being differentially expressed at 1 day, while those differentially expressed at 6 weeks and 6 months were related to progression.
Multiple candidate genes were upregulated at 1 day, although they were not definitively assigned to initiation. Genes upregulated at 6 weeks reflected increases in protein synthesis, loss of cellular integrity, and changes in muscle contraction. By 6 months, increases in lipid metabolism and changes in energy metabolism from oxidative phosphorylation to glycolysis were apparent
Refined annotation and assembly of the Tetrahymena thermophila genome sequence through EST analysis, comparative genomic hybridization, and targeted gap closure
<p>Abstract</p> <p>Background</p> <p><it>Tetrahymena thermophila</it>, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of <it>Tetrahymena</it>'s coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing.</p> <p>Results</p> <p>We addressed the problem of MIC contamination using comparative genomic hybridization with purified MIC and MAC DNA probes against a whole genome oligonucleotide microarray, allowing the identification of 763 genome scaffolds likely to contain MIC-limited DNA sequences. We also employed standard genome closure methods to essentially finish over 60% of the MAC genome. For the improvement of annotation, we have sequenced and analyzed over 60,000 verified EST reads from a variety of cellular growth and development conditions. Using this EST evidence, a combination of automated and manual reannotation efforts led to updates that affect 16% of the current protein-coding gene models. By comparing EST abundance, many genes showing apparent differential expression between these conditions were identified. Rare instances of alternative splicing and uses of the non-standard amino acid selenocysteine were also identified.</p> <p>Conclusion</p> <p>We report here significant progress in genome closure and reannotation of <it>Tetrahymena thermophila</it>. Our experience to date suggests that complete closure of the MAC genome is attainable. Using the new EST evidence, automated and manual curation has resulted in substantial improvements to the over 24,000 gene models, which will be valuable to researchers studying this model organism as well as for comparative genomics purposes.</p
Macronuclear Genome Sequence of the Ciliate Tetrahymena thermophila, a Model Eukaryote
The ciliate Tetrahymena thermophila is a model organism for molecular and cellular biology. Like other ciliates, this species has separate germline and soma functions that are embodied by distinct nuclei within a single cell. The germline-like micronucleus (MIC) has its genome held in reserve for sexual reproduction. The soma-like macronucleus (MAC), which possesses a genome processed from that of the MIC, is the center of gene expression and does not directly contribute DNA to sexual progeny. We report here the shotgun sequencing, assembly, and analysis of the MAC genome of T. thermophila, which is approximately 104 Mb in length and composed of approximately 225 chromosomes. Overall, the gene set is robust, with more than 27,000 predicted protein-coding genes, 15,000 of which have strong matches to genes in other organisms. The functional diversity encoded by these genes is substantial and reflects the complexity of processes required for a free-living, predatory, single-celled organism. This is highlighted by the abundance of lineage-specific duplications of genes with predicted roles in sensing and responding to environmental conditions (e.g., kinases), using diverse resources (e.g., proteases and transporters), and generating structural complexity (e.g., kinesins and dyneins). In contrast to the other lineages of alveolates (apicomplexans and dinoflagellates), no compelling evidence could be found for plastid-derived genes in the genome. UGA, the only T. thermophila stop codon, is used in some genes to encode selenocysteine, thus making this organism the first known with the potential to translate all 64 codons in nuclear genes into amino acids. We present genomic evidence supporting the hypothesis that the excision of DNA from the MIC to generate the MAC specifically targets foreign DNA as a form of genome self-defense. The combination of the genome sequence, the functional diversity encoded therein, and the presence of some pathways missing from other model organisms makes T. thermophila an ideal model for functional genomic studies to address biological, biomedical, and biotechnological questions of fundamental importance
Discovery of common and rare genetic risk variants for colorectal cancer.
To further dissect the genetic architecture of colorectal cancer (CRC), we performed whole-genome sequencing of 1,439 cases and 720 controls, imputed discovered sequence variants and Haplotype Reference Consortium panel variants into genome-wide association study data, and tested for association in 34,869 cases and 29,051 controls. Findings were followed up in an additional 23,262 cases and 38,296 controls. We discovered a strongly protective 0.3% frequency variant signal at CHD1. In a combined meta-analysis of 125,478 individuals, we identified 40 new independent signals at P < 5 × 10-8, bringing the number of known independent signals for CRC to ~100. New signals implicate lower-frequency variants, Krüppel-like factors, Hedgehog signaling, Hippo-YAP signaling, long noncoding RNAs and somatic drivers, and support a role for immune function. Heritability analyses suggest that CRC risk is highly polygenic, and larger, more comprehensive studies enabling rare variant analysis will improve understanding of biology underlying this risk and influence personalized screening strategies and drug development.Goncalo R Abecasis has received compensation from 23andMe and Helix. He is currently an employee of Regeneron Pharmaceuticals. Heather Hampel performs collaborative research with Ambry Genetics, InVitae Genetics, and Myriad Genetic Laboratories, Inc., is on the scientific advisory board for InVitae Genetics and Genome Medical, and has stock in Genome Medical. Rachel Pearlman has participated in collaborative funded research with Myriad Genetics Laboratories and Invitae Genetics but has no financial competitive interest
Recommended from our members
Refined annotation and assembly of the Tetrahymena thermophila genome sequence through EST analysis, comparative genomic hybridization, and targeted gap closure.
BackgroundTetrahymena thermophila, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of Tetrahymena's coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing.ResultsWe addressed the problem of MIC contamination using comparative genomic hybridization with purified MIC and MAC DNA probes against a whole genome oligonucleotide microarray, allowing the identification of 763 genome scaffolds likely to contain MIC-limited DNA sequences. We also employed standard genome closure methods to essentially finish over 60% of the MAC genome. For the improvement of annotation, we have sequenced and analyzed over 60,000 verified EST reads from a variety of cellular growth and development conditions. Using this EST evidence, a combination of automated and manual reannotation efforts led to updates that affect 16% of the current protein-coding gene models. By comparing EST abundance, many genes showing apparent differential expression between these conditions were identified. Rare instances of alternative splicing and uses of the non-standard amino acid selenocysteine were also identified.ConclusionWe report here significant progress in genome closure and reannotation of Tetrahymena thermophila. Our experience to date suggests that complete closure of the MAC genome is attainable. Using the new EST evidence, automated and manual curation has resulted in substantial improvements to the over 24,000 gene models, which will be valuable to researchers studying this model organism as well as for comparative genomics purposes
Clinical and biological landscape of constitutional mismatch-repair deficiency syndrome: an International Replication Repair Deficiency Consortium cohort study
Background: Constitutional mismatch repair deficiency (CMMRD) syndrome is a rare and aggressive cancer predisposition syndrome. Because a scarcity of data on this condition contributes to management challenges and poor outcomes, we aimed to describe the clinical spectrum, cancer biology, and impact of genetics on patient survival in CMMRD. Methods: In this cohort study, we collected cross-sectional and longitudinal data on all patients with CMMRD, with no age limits, registered with the International Replication Repair Deficiency Consortium (IRRDC) across more than 50 countries. Clinical data were extracted from the IRRDC database, medical records, and physician-completed case record forms. The primary objective was to describe the clinical features, cancer spectrum, and biology of the condition. Secondary objectives included estimations of cancer incidence and of the impact of the specific mismatch-repair gene and genotype on cancer onset and survival, including after cancer surveillance and immunotherapy interventions. Findings: We analysed data from 201 patients (103 males, 98 females) enrolled between June 5, 2007 and Sept 9, 2022. Median age at diagnosis of CMMRD or a related cancer was 8·9 years (IQR 5·9-12·6), and median follow-up from diagnosis was 7·2 years (3·6-14·8). Endogamy among minorities and closed communities contributed to high homozygosity within countries with low consanguinity. Frequent dermatological manifestations (117 [93%] of 126 patients with complete data) led to a clinical overlap with neurofibromatosis type 1 (35 [28%] of 126). 339 cancers were reported in 194 (97%) of 201 patients. The cumulative cancer incidence by age 18 years was 90% (95% CI 80-99). Median time between cancer diagnoses for patients with more than one cancer was 1·9 years (IQR 0·8-3·9). Neoplasms developed in 15 organs and included early-onset adult cancers. CNS tumours were the most frequent (173 [51%] cancers), followed by gastrointestinal (75 [22%]), haematological (61 [18%]), and other cancer types (30 [9%]). Patients with CNS tumours had the poorest overall survival rates (39% [95% CI 30-52] at 10 years from diagnosis; log-rank p<0·0001 across four cancer types), followed by those with haematological cancers (67% [55-82]), gastrointestinal cancers (89% [81-97]), and other solid tumours (96% [88-100]). All cancers showed high mutation and microsatellite indel burdens, and pathognomonic mutational signatures. MLH1 or MSH2 variants caused earlier cancer onset than PMS2 or MSH6 variants, and inferior survival (overall survival at age 15 years 63% [95% CI 55-73] for PMS2, 49% [35-68] for MSH6, 19% [6-66] for MLH1, and 0% for MSH2; p<0·0001). Frameshift or truncating variants within the same gene caused earlier cancers and inferior outcomes compared with missense variants (p<0·0001). The greater deleterious effects of MLH1 and MSH2 variants as compared with PMS2 and MSH6 variants persisted despite overall improvements in survival after surveillance or immune checkpoint inhibitor interventions. Interpretation: The very high cancer burden and unique genomic landscape of CMMRD highlight the benefit of comprehensive assays in timely diagnosis and precision approaches toward surveillance and immunotherapy. These data will guide the clinical management of children and patients who survive into adulthood with CMMRD. Funding: The Canadian Institutes for Health Research, Stand Up to Cancer, Children's Oncology Group National Cancer Institute Community Oncology Research Program, Canadian Cancer Society, Brain Canada, The V Foundation for Cancer Research, BioCanRx, Harry and Agnieszka Hall, Meagan's Walk, BRAINchild Canada, The LivWise Foundation, St Baldrick Foundation, Hold'em for Life, and Garron Family Cancer Center