43 research outputs found

    Transcription-associated mutation promotes RNA complexity in highly expressed genes - a major new source of selectable variation

    Get PDF
    Alternatively spliced transcript isoforms are thought to play a critical role for functional diversity. However, the mechanism generating the enormous diversity of spliced transcript isoforms remains unknown, and its biological significance remains unclear. We analyzed transcriptomes in saker falcons, chickens, and mice to show that alternative splicing occurs more frequently, yielding more isoforms, in highly expressed genes. We focused on hemoglobin in the falcon, the most abundantly expressed genes in blood, finding that alternative splicing produces 10-fold more isoforms than expected from the number of splice junctions in the genome. These isoforms were produced mainly by alternative use of de novo splice sites generated by transcription-associated mutation (TAM), not by the RNA editing mechanism normally invoked. We found that high expression of globin genes increases mutation frequencies during transcription, especially on nontranscribed DNA strands. After DNA replication, transcribed strands inherit these somatic mutations, creating de novo splice sites, and generating multiple distinct isoforms in the cell clone. Bisulfate sequencing revealed that DNA methylation may counteract this process by suppressing TAM, suggesting DNA methylation can spatially regulate RNA complexity. RNA profiling showed that falcons living on the high Qinghai–Tibetan Plateau possess greater global gene expression levels and higher diversity of mean to high abundance isoforms (reads per kilobases per million mapped reads ≥18) than their low-altitude counterparts, and we speculate that this may enhance their oxygen transport capacity under low-oxygen environments. Thus, TAM-induced RNA diversity may be physiologically significant, providing an alternative strategy in lifestyle evolution

    Population transcriptomes reveal synergistic responses of DNA polymorphism and RNA expression to extreme environments on the Qinghai-Tibetan Plateau in a predatory bird

    Get PDF
    Low oxygen and temperature pose key physiological challenges for endotherms living on the Qinghai–Tibetan Plateau (QTP). Molecular adaptations to high‐altitude living have been detected in the genomes of Tibetans, their domesticated animals and a few wild species, but the contribution of transcriptional variation to altitudinal adaptation remains to be determined. Here we studied a top QTP predator, the saker falcon, and analysed how the transcriptome has become modified to cope with the stresses of hypoxia and hypothermia. Using a hierarchical design to study saker populations inhabiting grassland, steppe/desert and highland across Eurasia, we found that the QTP population is already distinct despite having colonized the Plateau <2000 years ago. Selection signals are limited at the cDNA level, but of only seventeen genes identified, three function in hypoxia and four in immune response. Our results show a significant role for RNA transcription: 50% of upregulated transcription factors were related to hypoxia responses, differentiated modules were significantly enriched for oxygen transport, and importantly, divergent EPAS1 functional variants with a refined co‐expression network were identified. Conservative gene expression and relaxed immune gene variation may further reflect adaptation to hypothermia. Our results exemplify synergistic responses between DNA polymorphism and RNA expression diversity in coping with common stresses, underpinning the successful rapid colonization of a top predator onto the QTP. Importantly, molecular mechanisms underpinning highland adaptation involve relatively few genes, but are nonetheless more complex than previously thought and involve fine‐tuned transcriptional responses and genomic adaptation

    Genomic analysis of the domestication and post-Spanish conquest evolution of the llama and alpaca

    Get PDF
    Background Despite their regional economic importance and being increasingly reared globally, the origins and evolution of the llama and alpaca remain poorly understood. Here we report reference genomes for the llama, and for the guanaco and vicuña (their putative wild progenitors), compare these with the published alpaca genome, and resequence seven individuals of all four species to better understand domestication and introgression between the llama and alpaca. Results Phylogenomic analysis confirms that the llama was domesticated from the guanaco and the alpaca from the vicuña. Introgression was much higher in the alpaca genome (36%) than the llama (5%) and could be dated close to the time of the Spanish conquest, approximately 500 years ago. Introgression patterns are at their most variable on the X-chromosome of the alpaca, featuring 53 genes known to have deleterious X-linked phenotypes in humans. Strong genome-wide introgression signatures include olfactory receptor complexes into both species, hypertension resistance into alpaca, and fleece/fiber traits into llama. Genomic signatures of domestication in the llama include male reproductive traits, while in alpaca feature fleece characteristics, olfaction-related and hypoxia adaptation traits. Expression analysis of the introgressed region that is syntenic to human HSA4q21, a gene cluster previously associated with hypertension in humans under hypoxic conditions, shows a previously undocumented role for PRDM8 downregulation as a potential transcriptional regulation mechanism, analogous to that previously reported at high altitude for hypoxia-inducible factor 1α. Conclusions The unprecedented introgression signatures within both domestic camelid genomes may reflect post-conquest changes in agriculture and the breakdown of traditional management practices

    Baiji genomes reveal low genetic variability and new insights into secondary aquatic adaptations

    Get PDF
    The baiji, or Yangtze River dolphin (Lipotes vexillifer), is a flagship species for the conservation of aquatic animals and ecosystems in the Yangtze River of China; however, this species has now been recognized as functionally extinct. Here we report a high-quality draft genome and three re-sequenced genomes of L. vexillifer using Illumina short-read sequencing technology. Comparative genomic analyses reveal that cetaceans have a slow molecular clock and molecular adaptations to their aquatic lifestyle. We also find a significantly lower number of heterozygous single nucleotide polymorphisms in the baiji compared to all other mammalian genomes reported thus far. A reconstruction of the demographic history of the baiji indicates that a bottleneck occurred near the end of the last deglaciation, a time coinciding with a rapid decrease in temperature and the rise of eustatic sea level

    Arctic introgression and chromatin regulation facilitated rapid Qinghai-Tibet Plateau colonization by an avian predator

    Get PDF
    The Qinghai-Tibet Plateau (QTP), possesses a climate as cold as that of the Arctic, and also presents uniquely low oxygen concentrations and intense ultraviolet (UV) radiation. QTP animals have adapted to these extreme conditions, but whether they obtained genetic variations from the Arctic during cold adaptation, and how genomic mutations in non-coding regions regulate gene expression under hypoxia and intense UV environment, remain largely unknown. Here, we assemble a high-quality saker falcon genome and resequence populations across Eurasia. We identify female-biased hybridization with Arctic gyrfalcons in the last glacial maximum, that endowed eastern sakers with alleles conveying larger body size and changes in fat metabolism, predisposing their QTP cold adaptation. We discover that QTP hypoxia and UV adaptations mainly involve independent changes in non-coding genomic variants. Our study highlights key roles of gene flow from Arctic relatives during QTP hypothermia adaptation, and cis-regulatory elements during hypoxic response and UV protection

    Peregrine and saker falcon genome sequences provide insights into evolution of a predatory lifestyle

    Get PDF
    As top predators, falcons possess unique morphological, physiological and behavioral adaptations that allow them to be successful hunters: for example, the peregrine is renowned as the world's fastest animal. To examine the evolutionary basis of predatory adaptations, we sequenced the genomes of both the peregrine (Falco peregrinus) and saker falcon (Falco cherrug), and we present parallel, genome-wide evidence for evolutionary innovation and selection for a predatory lifestyle. The genomes, assembled using Illumina deep sequencing with greater than 100-fold coverage, are both approximately 1.2 Gb in length, with transcriptome-assisted prediction of approximately 16,200 genes for both species. Analysis of 8,424 orthologs in both falcons, chicken, zebra finch and turkey identified consistent evidence for genome-wide rapid evolution in these raptors. SNP-based inference showed contrasting recent demographic trajectories for the two falcons, and gene-based analysis highlighted falcon-specific evolutionary novelties for beak development and olfaction and specifically for homeostasis-related genes in the arid environment–adapted saker

    SCD: A Stacked Carton Dataset for Detection and Segmentation

    No full text
    Carton detection is an important technique in the automatic logistics system and can be applied to many applications such as the stacking and unstacking of cartons and the unloading of cartons in the containers. However, there is no public large-scale carton dataset for the research community to train and evaluate the carton detection models up to now, which hinders the development of carton detection. In this article, we present a large-scale carton dataset named Stacked Carton Dataset (SCD) with the goal of advancing the state-of-the-art in carton detection. Images were collected from the Internet and several warehouses, and objects were labeled for precise localization using instance mask annotation. There were a total of 250,000 instance masks from 16,136 images. Naturally, a suite of benchmarks was established with several popular detectors and instance segmentation models. In addition, we designed a carton detector based on RetinaNet by embedding our proposed Offset Prediction between the Classification and Localization module (OPCL) and the Boundary Guided Supervision module (BGS). OPCL alleviates the imbalance problem between classification and localization quality, which boosts AP by 3.1∼4.7% on SCD at the model level, while BGS guides the detector to pay more attention to the boundary information of cartons and decouple repeated carton textures at the task level. To demonstrate the generalization of OPCL for other datasets, we conducted extensive experiments on MS COCO and PASCAL VOC. The improvements in AP on MS COCO and PASCAL VOC were 1.8∼2.2% and 3.4∼4.3%, respectively
    corecore