Search CORE

224 research outputs found

Phylogeny and evolutionary history of glycogen synthase kinase 3/SHAGGY-like kinase genes in land plants

Author: André S Chanderbali
Douglas E Soltis
Gane Ka-Shu Wong
Pamela S Soltis
Xinshuai Qi
Publication venue: Springer Nature
Publication date: 01/01/2013
Field of study

Background: GSK3 (glycogen synthase kinase 3) genes encode signal transduction proteins with roles in a variety of biological processes in eukaryotes. In contrast to the low copy numbers observed in animals, GSK3 genes have expanded into a multi-gene family in land plants (embryophytes), and have also evolved functions in diverse plant specific processes, including floral development in angiosperms. However, despite previous efforts, the phylogeny of land plant GSK3 genes is currently unclear. Here, we analyze genes from a representative sample of phylogenetically pivotal taxa, including basal angiosperms, gymnosperms, and monilophytes, to reconstruct the evolutionary history and functional diversification of the GSK3 gene family in land plants. Results: Maximum Likelihood phylogenetic analyses resolve a gene tree with four major gene duplication events that coincide with the emergence of novel land plant clades. The single GSK3 gene inherited from the ancestor of land plants was first duplicated along the ancestral branch to extant vascular plants, and three subsequent duplications produced three GSK3 loci in the ancestor of euphyllophytes, four in the ancestor of seed plants, and at least five in the ancestor of angiosperms. A single gene in the Amborella trichopoda genome may be the sole survivor of a sixth GSK3 locus that originated in the ancestor of extant angiosperms. Homologs of two Arabidopsis GSK3 genes with genetically confirmed roles in floral development, AtSK11 and AtSK12, exhibit floral preferential expression in several basal angiosperms, suggesting evolutionary conservation of their floral functions. Members of other gene lineages appear to have independently evolved roles in plant reproductive tissues in individual taxa. Conclusions: Our phylogenetic analyses provide the most detailed reconstruction of GSK3 gene evolution in land plants to date and offer new insights into the origins, relationships, and functions of family members. Notably, the diversity of this “green” branch of the gene family has increased in concert with the increasing morphological and physiological complexity of land plant life forms. Expression data for seed plants indicate that the functions of GSK3 genes have also diversified during evolutionary time

Crossref

Springer - Publisher Connector

ZENODO

Dryad Digital Repository (Duke University)

Electronic Archiving System

A cross-species alignment tool (CAT)

Author: Guan Liang
Guo Yiran
Li Heng
Liu Tao
Wang Jun
Wong Gane Ka-Shu
Zheng Wei-Mou
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The main two sorts of automatic gene annotation frameworks are <it>ab initio </it>and alignment-based, the latter splitting into two sub-groups. The first group is used for intra-species alignments, among which are successful ones with high specificity and speed. The other group contains more sensitive methods which are usually applied in aligning inter-species sequences. Results Here we present a new algorithm called <it>CAT </it>(for Cross-species Alignment Tool). It is designed to align mRNA sequences to mammalian-sized genomes. <it>CAT </it>is implemented using C scripts and is freely available on the web at <url>http://xat.sourceforge.net/</url>. Conclusions Examined from different angles, <it>CAT </it>outperforms other extant alignment tools. Tested against all available mouse-human and zebrafish-human orthologs, we demonstrate that <it>CAT </it>combines the specificity and speed of the best intra-species algorithms, like <it>BLAT </it>and <it>sim4</it>, with the sensitivity of the best inter-species tools, like <it>GeneWise</it>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Southern Denmark Research Output

Gene conversion in the rice genome

Author: Clark Terry
Li Ruiqiang
Vang Søren
Wang Jun
Wong Gane Ka-Shu
Xu Shuqing
Zheng Hongkun
Zheng Xiaoguang
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Gene conversion causes a non-reciprocal transfer of genetic information between similar sequences. Gene conversion can both homogenize genes and recruit point mutations thereby shaping the evolution of multigene families. In the rice genome, the large number of duplicated genes increases opportunities for gene conversion. Results To characterize gene conversion in rice, we have defined 626 multigene families in which 377 gene conversions were detected using the GENECONV program. Over 60% of the conversions we detected were between chromosomes. We found that the inter-chromosomal conversions distributed between chromosome 1 and 5, 2 and 6, and 3 and 5 are more frequent than genome average (Z-test, P < 0.05). The frequencies of gene conversion on the same chromosome decreased with the physical distance between gene conversion partners. Ka/Ks analysis indicates that gene conversion is not tightly linked to natural selection in the rice genome. To assess the contribution of segmental duplication on gene conversion statistics, we determined locations of conversion partners with respect to inter-chromosomal segment duplication. The number of conversions associated with segmentation is less than ten percent. Pseudogenes in the rice genome with low similarity to <it>Arabidopsis </it>genes showed greater likelihood for gene conversion than those with high similarity to <it>Arabidopsis </it>genes. Functional annotations suggest that at least 14 multigene families related to disease or bacteria resistance were involved in conversion events. Conclusion The evolution of gene families in the rice genome may have been accelerated by conversion with pseudogenes. Our analysis suggests a possible role for gene conversion in the evolution of pathogen-response genes.</p

Repository for Publications and Research Data

Springer - Publisher Connector

Directory of Open Access Journals

University of Queensland eSpace

Gene conversion in the rice genome

Author: Clark Terry
Li Ruiqiang
Vang Søren
Wang Jun
Wong Gane Ka-Shu
Xu Shuqing
Zheng Hongkun
Zheng Xiaoguang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/01/2014
Field of study

Background: Gene conversion causes a non-reciprocal transfer of genetic information between similar sequences. Gene conversion can both homogenize genes and recruit point mutations thereby shaping the evolution of multigene families. In the rice genome, the large number of duplicated genes increases opportunities for gene conversion. Results: To characterize gene conversion in rice, we have defined 626 multigene families in which 377 gene conversions were detected using the GENECONV program. Over 60% of the conversions we detected were between chromosomes. We found that the inter-chromosomal conversions distributed between chromosome 1 and 5, 2 and 6, and 3 and 5 are more frequent than genome average (Z-test, P < 0.05). The frequencies of gene conversion on the same chromosome decreased with the physical distance between gene conversion partners. Ka/Ks analysis indicates that gene conversion is not tightly linked to natural selection in the rice genome. To assess the contribution of segmental duplication on gene conversion statistics, we determined locations of conversion partners with respect to inter-chromosomal segment duplication. The number of conversions associated with segmentation is less than ten percent. Pseudogenes in the rice genome with low similarity to Arabidopsis genes showed greater likelihood for gene conversion than those with high similarity to Arabidopsis genes. Functional annotations suggest that at least 14 multigene families related to disease or bacteria resistance were involved in conversion events. Conclusion: The evolution of gene families in the rice genome may have been accelerated by conversion with pseudogenes. Our analysis suggests a possible role for gene conversion in the evolution of pathogen-response genes

KU ScholarWorks

Identification and characterization of insect-specific proteins by genome data analysis

Author: Clark Terry
Kang Le
Shi Junjie
Wang Hongsheng
Wang Jun
Wang Wen
Wang Xiaoling
Wong Gane Ka-Shu
Zhang Guojie
Zheng Hongkun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/01/2014
Field of study

Background: Insects constitute the vast majority of known species with their importance including biodiversity, agricultural, and human health concerns. It is likely that the successful adaptation of the Insecta clade depends on specific components in its proteome that give rise to specialized features. However, proteome determination is an intensive undertaking. Here we present results from a computational method that uses genome analysis to characterize insect and eukaryote proteomes as an approximation complementary to experimental approaches. Results: Homologs in common to Drosophila melanogaster, Anopheles gambiae, Bombyx mori, Tribolium castaneum, and Apis mellifera were compared to the complete genomes of three non-insect eukaryotes (opisthokonts) Homo sapiens, Caenorhabditis elegans and Saccharomyces cerevisiae. This operation yielded 154 groups of orthologous proteins in Drosophila to be insect-specific homologs; 466 groups were determined to be common to eukaryotes (represented by three opisthokonts). ESTs from the hemimetabolous insect Locust migratoria were also considered in order to approximate their corresponding genes in the insect-specific homologs. Stress and stimulus response proteins were found to constitute a higher fraction in the insect-specific homologs than in the homologs common to eukaryotes. Conclusion: The significant representation of stress response and stimulus response proteins in proteins determined to be insect-specific, along with specific cuticle and pheromone/odorant binding proteins, suggest that communication and adaptation to environments may distinguish insect evolution relative to other eukaryotes. The tendency for low Ka/Ks ratios in the insect-specific protein set suggests purifying selection pressure. The generally larger number of paralogs in the insect-specific proteins may indicate adaptation to environment changes. Instances in our insect-specific protein set have been arrived at through experiments reported in the literature, supporting the accuracy of our approach

KU ScholarWorks

ReAS: Recovery of Ancestral Sequences for Transposable Elements from the Unassembled Reads of a Whole Genome Shotgun

Author: Chen Ye
Gane Ka-Shu Wong
Huanming Yang
Jia Ye
Jian Wang
Jing Wang
Jun Wang
Jun Yu
Rice Chromosome 10 Sequencing Consortium
Ruiqiang Li
Songgang Li
Susan Baxter
Yujun Han
Publication venue: Public Library of Science
Publication date: 01/01/2005
Field of study

We describe an algorithm, ReAS, to recover ancestral sequences for transposable elements (TEs) from the unassembled reads of a whole genome shotgun. The main assumptions are that these TEs must exist at high copy numbers across the genome and must not be so old that they are no longer recognizable in comparison to their ancestral sequences. Tested on the japonica rice genome, ReAS was able to reconstruct all of the high copy sequences in the Repbase repository of known TEs, and increase the effectiveness of RepeatMasker in identifying TEs from genome sequences

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

RNA-Seq based phylogeny recapitulates previous phylogeny of the genus Flaveria (Asteraceae) with some modifications.

Author: Covshoff Sarah
Gowik Udo
Hibberd Julian M
Kelly Steve
Lu Haorong
Lyu Ming-Ju Amy
Mallmann Julia
Sage Rowan F
Stata Matt
Wei Xiaofeng
Westhoff Peter
Wong Gane Ka-Shu
Zhu Xin-Guang
Publication venue: BMC Evol Biol
Publication date: 18/06/2015
Field of study

BACKGROUND: The genus Flaveria has been extensively used as a model to study the evolution of C4 photosynthesis as it contains C3 and C4 species as well as a number of species that exhibit intermediate types of photosynthesis. The current phylogenetic tree of the genus Flaveria contains 21 of the 23 known Flaveria species and has been previously constructed using a combination of morphological data and three non-coding DNA sequences (nuclear encoded ETS, ITS and chloroplast encoded trnL-F). RESULTS: Here we developed a new strategy to update the phylogenetic tree of 16 Flaveria species based on RNA-Seq data. The updated phylogeny is largely congruent with the previously published tree but with some modifications. We propose that the data collection method provided in this study can be used as a generic method for phylogenetic tree reconstruction if the target species has no genomic information. We also showed that a "F. pringlei" genotype recently used in a number of labs may be a hybrid between F. pringlei (C3) and F. angustifolia (C3-C4). CONCLUSIONS: We propose that the new strategy of obtaining phylogenetic sequences outlined in this study can be used to construct robust trees in a larger number of taxa. The updated Flaveria phylogenetic tree also supports a hypothesis of stepwise and parallel evolution of C4 photosynthesis in the Flavaria clade

Crossref

PubMed Central

Apollo (Cambridge)

TreeFam: a curated database of phylogenetic trees of animal gene families

Author: Bolund Lars
Coghlan Avril
Coin Lachlan James
Dehal Paramvir
Durbin Richard
Hériché Jean-Karim
Li Heng
Li Ruiqiang
Liu Tao
Osmotherly Lara
Ruan Jue
Wang Jun
Wong Gane Ka-Shu
Zhang Zhang
Zheng Weimou
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

TreeFam is a database of phylogenetic trees of gene families found in animals. It aims to develop a curated resource that presents the accurate evolutionary history of all animal gene families, as well as reliable ortholog and paralog assignments. Curated families are being added progressively, based on seed alignments and trees in a similar fashion to Pfam. Release 1.1 of TreeFam contains curated trees for 690 families and automatically generated trees for another 11 646 families. These represent over 128 000 genes from nine fully sequenced animal genomes and over 45 000 other animal proteins from UniProt; ∼40–85% of proteins encoded in the fully sequenced animal genomes are included in TreeFam. TreeFam is freely available at and

Crossref

PubMed Central

University of Southern Denmark Research Output

University of Melbourne Institutional Repository

University of Queensland eSpace

ChickVD: a sequence variation database for the chicken genome

Author: Burt David W.
Chen Jie
Cong Lijuan
Dai Mingtao
Fang Lin
He Ximiao
Hu Yafeng
Li Shengting
Li Songgang
Liu Bin
Ruan Jue
Wang Jian
Wang Jing
Wang Jun
Wong Gane Ka-Shu
Yang Huanming
Ye Chen
Yu Jun
Zhang Yong
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

Working in parallel with the efforts to sequence the chicken (Gallus gallus) genome, the Beijing Genomics Institute led an international team of scientists from China, USA, UK, Sweden, The Netherlands and Germany to map extensive DNA sequence variation throughout the chicken genome by sampling DNA from domestic breeds. Using the Red Jungle Fowl genome sequence as a reference, we identified 3.1 million non-redundant DNA sequence variants. To facilitate the application of our data to avian genetics and to provide a foundation for functional and evolutionary studies, we created the ‘Chicken Variation Database’ (ChickVD). A graphical MapView shows variants mapped onto the chicken genome in the context of gene annotations and other features, including genetic markers, trait loci, cDNAs, chicken orthologs of human disease genes and raw sequence traces. ChickVD also stores information on quantitative trait loci using data from collaborating institutions and public resources. Our data can be queried by search engine and homology-based BLAST searches. ChickVD is publicly accessible at http://chicken.genomics.org.cn

Crossref

PubMed Central

Edinburgh Research Explorer

University of Southern Denmark Research Output

University of Queensland eSpace

BGI-RIS: An integrated information resource and comparative analysis workbench for rice genomics

Author: Chen Ye
Dai Mingtao
Fu Jian
He Ximiao
Huang Xiaobing
Jiao Yongzhi
Li Songgang
Ni Peixiang
Ren Xiaoyu
Wang Jian
Wang Jing
Wang Jun
Wei Shulin
Wong Gane Ka-Shu
Yang Huanming
Yu Jun
Zhang Jianguo
Zhang Yong
Zhao Hongyu
Zhao Wenming
Publication venue
Publication date: 01/01/2004
Field of study

Rice is a major food staple for the world's population and serves as a model species in cereal genome research. The Beijing Genomics Institute (BGI) has long been devoting itself to sequencing, information analysis and biological research of the rice and other crop genomes. In order to facilitate the application of the rice genomic information and to provide a foundation for functional and evolutionary studies of other important cereal crops, we implemented our Rice Information System (BGI-RIS), the most up-to-date integrated information resource as well as a workbench for comparative genomic analysis. In addition to comprehensive data from Oryza sativa L. ssp. indica sequenced by BGI, BGI-RIS also hosts carefully curated genome information from Oryza sativa L. ssp. japonica and EST sequences available from other cereal crops. In this resource, sequence contigs of indica (93-11) have been further assembled into Mbp-sized scaffolds and anchored onto the rice chromosomes referenced to physical/genetic markers, cDNAs and BAC-end sequences. We have annotated the rice genomes for gene content, repetitive elements, gene duplications (tandem and segmental) and single nucleotide polymorphisms between rice subspecies. Designed as a basic platform, BGI-RIS presents the sequenced genomes and related information in systematic and graphical ways for the convenience of in-depth comparative studie

CiteSeerX

PubMed Central

University of Southern Denmark Research Output