12 research outputs found

    The transcriptomic blueprint of molt in rooster using various tissues from Ginkkoridak (Korean long-tailed chicken)

    Get PDF
    Background Annual molt is a critical stage in the life cycle of birds. Although the most extensively documented aspects of molt are the renewing of plumage and the remodeling of the reproductive tract in laying hens, in chicken, molt deeply affects various tissues and physiological functions. However, with exception of the reproductive tract, the effect of molt on gene expression across the tissues known to be affected by molt has to date never been investigated. The present study aimed to decipher the transcriptomic effects of molt in Ginkkoridak, a Korean long-tailed chicken. Messenger RNA data available across 24 types of tissue samples (9 males) and a combination of mRNA and miRNA data on 10 males and 10 females blood were used. Results The impact of molt on gene expression and gene transcript usage appeared to vary substantially across tissues types in terms of histological entities or physiological functions particularly related to nervous system. Blood was the tissue most affected by molt in terms of differentially expressed genes in both sexes, closely followed by meninges, bone marrow and heart. The effect of molt in blood appeared to differ between males and females, with a more than fivefold difference in the number of down-regulated genes between both sexes. The blueprint of molt in roosters appeared to be specific to tissues or group of tissues, with relatively few genes replicating extensively across tissues, excepted for the spliceosome genes (U1, U4) and the ribosomal proteins (RPL21, RPL23). By integrating miRNA and mRNA data, when chickens molt, potential roles of miRNA were discovered such as regulation of neurogenesis, regulation of immunity and development of various organs. Furthermore, reliable candidate biomarkers of molt were found, which are related to cell dynamics, nervous system or immunity, processes or functions that have been shown to be extensively modulated in response to molt. Conclusions Our results provide a comprehensive description at the scale of the whole organism deciphering the effects of molt on the transcriptome in chicken. Also, the conclusion of this study can be used as a valuable resource in transcriptome analyses of chicken in the future and provide new insights related to molt.This work was funded by Population genomics of Korean long-tailed fowl the Program for Agriculture Science and Technology Development (Project No. PJ0133402) of the Rural Development Administration (RDA). The funding body had a role in sample collection

    False gene and chromosome losses in genome assemblies caused by GC content variation and repeats.

    Get PDF
    BACKGROUND: Many short-read genome assemblies have been found to be incomplete and contain mis-assemblies. The Vertebrate Genomes Project has been producing new reference genome assemblies with an emphasis on being as complete and error-free as possible, which requires utilizing long reads, long-range scaffolding data, new assembly algorithms, and manual curation. A more thorough evaluation of the recent references relative to prior assemblies can provide a detailed overview of the types and magnitude of improvements. RESULTS: Here we evaluate new vertebrate genome references relative to the previous assemblies for the same species and, in two cases, the same individuals, including a mammal (platypus), two birds (zebra finch, Anna's hummingbird), and a fish (climbing perch). We find that up to 11% of genomic sequence is entirely missing in the previous assemblies. In the Vertebrate Genomes Project zebra finch assembly, we identify eight new GC- and repeat-rich micro-chromosomes with high gene density. The impact of missing sequences is biased towards GC-rich 5'-proximal promoters and 5' exon regions of protein-coding genes and long non-coding RNAs. Between 26 and 60% of genes include structural or sequence errors that could lead to misunderstanding of their function when using the previous genome assemblies. CONCLUSIONS: Our findings reveal novel regulatory landscapes and protein coding sequences that have been greatly underestimated in previous assemblies and are now present in the Vertebrate Genomes Project reference genomes

    The Korean undiagnosed diseases program: lessons from a one-year pilot project

    Get PDF
    Abstract Background The Korean Undiagnosed Diseases Program (KUDP) was launched in January 2017 as a one-year pilot project to address the increasing global interest in patients with undiagnosed rare diseases. The purpose of this paper is to summarize the project results and emphasize the unmet research needs among patients with undiagnosed rare diseases in Korea. Results Patient enrollment, assessment, and diagnostic processes were determined by the KUDP clinical expert consortium. Patients followed a diagnostic workflow after being categorized into one of four groups: I) insufficient clinical information or lack of standard diagnostic processes; II) undiagnosed due to low disease awareness; III) clinically diagnosed but unconfirmed genetically due to genetic heterogeneities; or IV) unknown disease due to complex, atypical clinical presentations. After excluding two patients from group I, 97 patients were enrolled, including 10 in group II, 67 in group III, and 20 in group IV. Most of them (92 of 97, 94.8%) were pediatric patients (< 18 years old) and 59 (60.8%) were male. The primary symptoms for 80 patients (82.5%) were neurologic. During the one-year pilot study, 72 patients completed a diagnostic assessment including clinical and molecular genetic analyses; some patients also underwent pathological or biochemical analysis. Twenty-eight of these patients (28/72, 38.9%) achieved molecular genetic diagnosis. Thirteen patients were diagnosed based on traditional tests, including biochemical assay, single or targeted genetic analysis, and chromosomal microarray. We performed whole exome sequencing on 52 patients, among whom 15 (28.8%, 15/52) reached a final diagnosis. One new disorder was identified via international collaboration. Conclusions Using an efficient clinical diagnostic workflow, this KUDP pilot study resulted in a fair diagnostic success rate, improving the potential for additional diagnoses and new scientific discovery of complex and rare diseases. KUDP also satisfied unmet needs for rare diseases with multisystem involvement, highlighting the value of emerging genomic technologies for further research into rare and still-undiagnosed conditions

    Scalable, accessible, and reproducible reference genome assembly and evaluation in Galaxy

    No full text
    Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals)

    Towards complete and error-free genome assemblies of all vertebrate species

    No full text
    High-quality and complete reference genome assemblies are fundamental for the application of genomics to biology, disease, and biodiversity conservation. However, such assemblies are available for only a few non-microbial species1,2,3,4. To address this issue, the international Genome 10K (G10K) consortium5,6 has worked over a five-year period to evaluate and develop cost-effective methods for assembling highly accurate and nearly complete reference genomes. Here we present lessons learned from generating assemblies for 16 species that represent six major vertebrate lineages. We confirm that long-read sequencing technologies are essential for maximizing genome quality, and that unresolved complex repeats and haplotype heterozygosity are major sources of assembly error when not handled correctly. Our assemblies correct substantial errors, add missing sequence in some of the best historical reference genomes, and reveal biological discoveries. These include the identification of many false gene duplications, increases in gene sizes, chromosome rearrangements that are specific to lineages, a repeated independent chromosome breakpoint in bat genomes, and a canonical GC-rich pattern in protein-coding genes and their regulatory regions. Adopting these lessons, we have embarked on the Vertebrate Genomes Project (VGP), an international effort to generate high-quality, complete reference genomes for all of the roughly 70,000 extant vertebrate species and to help to enable a new era of discovery across the life sciences
    corecore