13 research outputs found

    Gene and repetitive sequence annotation in the Triticeae

    Full text link
    The Triticeae tribe contains some of the world’s most important agricultural crops (wheat, barley and rye) and is perhaps, one of the most challenging for genome annotation because Triticeae genomes are primarily composed of repetitive sequences. Further complicating the challenge is the polyploidy found in wheat and particularly in the hexaploid bread wheat genome. Genomic sequence data are available for the Triticeae in the form of large collections of Expressed Sequence Tags (>1.5 million) and an increasing number of bacterial artificial chromosome clone sequences. Given that high repetitive sequence content in the Triticeae confounds annotation of protein-coding genes, repetitive sequences have been identified, annotated, and collated into public databases. Protein coding genes in the Triticeae are structurally annotated using a combination of ab initio gene finders and experimental evidence. Functional annotation of protein coding genes involves assessment of sequence similarity to known proteins, expression evidence, and the presence of domain and motifs. Annotation methods and tools for Triticeae genomic sequences have been adapted from existing plant genome annotation projects and were designed to allow for flexibility of single sequence annotation while allowing a whole community annotation effort to be developed. With the availability of an increasing number of annotated grass genomes, comparative genomics can be exploited to accelerate and enhance the quality of Triticeae sequences annotation. This chapter provides a brief overview of the Triticeae genomes features that are challenging for genome annotation and describes the resources and methods available for sequence annotation with a particular emphasis on problems caused by the repetitive fraction of these genomes

    Physical molecular maps of wheat chromosomes

    No full text
    In bread wheat, a set of 527 simple sequence repeats (SSRs) were tried on 164 deletion lines, leading to a successful mapping of 270 SSRs on 313 loci covering all 21 chromosomes. A maximum of 119 loci (38%) were located on B subgenome, and a minimum of 90 loci (29%) mapped on D subgenome. Similarly, homoeologous group 7 carried a maximum of 61 loci (19%), and group 4 carried a minimum of 22 loci (7%). Of the cited 270 SSRs, 39 had multiple loci, but only eight of these detected homoeologous loci. Linear order of loci in physical maps largely corresponded with those in the genetic maps. Apparently, distances between each of only 26 pairs of loci significantly differed from the corresponding distances on genetic maps. Some loci, which were genetically mapped close to the centromere, were physically located distally, while other loci that were mapped distally in the genetic maps were located in the proximal bins in the physical maps. This suggested that although the linear order of the loci was largely conserved, variation does exist between genetic and physical distances
    corecore