35,152 research outputs found

    Diminishing Return for Increased Mappability with Longer Sequencing Reads: Implications of the k-mer Distributions in the Human Genome

    Get PDF
    The amount of non-unique sequence (non-singletons) in a genome directly affects the difficulty of read alignment to a reference assembly for high throughput-sequencing data. Although a greater length increases the chance for reads being uniquely mapped to the reference genome, a quantitative analysis of the influence of read lengths on mappability has been lacking. To address this question, we evaluate the k-mer distribution of the human reference genome. The k-mer frequency is determined for k ranging from 20 to 1000 basepairs. We use the proportion of non-singleton k-mers to evaluate the mappability of reads for a corresponding read length. We observe that the proportion of non-singletons decreases slowly with increasing k, and can be fitted by piecewise power-law functions with different exponents at different k ranges. A faster decay at smaller values for k indicates more limited gains for read lengths > 200 basepairs. The frequency distributions of k-mers exhibit long tails in a power-law-like trend, and rank frequency plots exhibit a concave Zipf's curve. The location of the most frequent 1000-mers comprises 172 kilobase-ranged regions, including four large stretches on chromosomes 1 and X, containing genes with biomedical implications. Even the read length 1000 would be insufficient to reliably sequence these specific regions.Comment: 5 figure

    Selection, tinkering and emergence in complex networks: crossing the land of tinkering

    Get PDF
    Complex biological networks have very different origins than technologic ones. The latter involve extensive design and, as engineered structures, include a high level of optimization. The former involve (in principle) contingency and structural constraints, with new structures being incorporated through tinkering with previously evolved modules or units. However, the observation of the topological features of different biological nets suggests that nature can have a limited repertoire of ”attractors” that essentially optimize communication under some basic constraints of cost and architecture or that allow the biological nets to reach a high degree of homeostasis. Conversely, the topological features exhibited by some technology graphs indicate that tinkering and internal constraints play a key role, in spite of the ”designed” nature of these structures. Previous scenarios suggested to explain the overall trends of evolution are re-analyzed in light of topological patterns.Peer ReviewedPostprint (author's final draft

    Genome-wide transcriptomics analysis identifies sox7 and sox18 as specifically regulated by gata4 in cardiomyogenesis

    Get PDF
    This work was supported by British Heart Foundation (BHF Project Grant no PG/13/23/30080 to B.A.A and S.H.), Biotechnology and Biological Sciences Research Council (BB/M001695/1 to S.H.) and the University of Aberdeen (for A.T.L). Acknowledgements We’re grateful to Ms Yvonne Turnbull and Ms Kate Watt for technical assistance and lab management. We would like to thank Professor Cedric Blanpain and Dr Xionghui Li from Université Libre de Bruxelles for providing training of ES cell manipulation and Mesp1/Gata4 cell lines. We are grateful to Professor Todd Evans from Weill Cornell Medical College for generously providing iGata ES cell lines. We also would like to thank Professor Aaron Zorn and Scott Rankin for providing Xsox18 plasmid.Peer reviewedPublisher PD

    MicroR159 regulation of most conserved targets in Arabidopsis has negligible phenotypic effects

    Get PDF
    BACKGROUND A current challenge of microRNA (miRNA) research is the identification of biologically relevant miRNA:target gene relationships. In plants, high miRNA:target gene complementarity has enabled accurate target predictions, and slicing of target mRNAs has facilitated target validation through rapid amplification of 5' cDNA ends (5'-RACE) analysis. Together, these approaches have identified more than 20 targets potentially regulated by the deeply conserved miR159 family in Arabidopsis, including eight MYB genes with highly conserved miR159 target sites. However, genetic analysis has revealed the functional specificity of the major family members, miR159a and miR159b is limited to only two targets, MYB33 and MYB65. Here, we examine the functional role of miR159 regulation for the other potential MYB target genes. RESULTS For these target genes, functional analysis failed to identify miR159 regulation that resulted in any major phenotypic impact, either at the morphological or molecular level. This appears to be mainly due to the quiescent nature of the remaining family member, MIR159c. Although its expression overlaps in a temporal and spatial cell-specific manner with a subset of these targets in anthers, the abundance of miR159c is extremely low and concomitantly a mir159c mutant displays no anther defects. Examination of potential miR159c targets with conserved miR159 binding sites found neither their spatial or temporal expression domains appeared miR159 regulated, despite the detection of miR159-guided cleavage products by 5'-RACE. Moreover, expression of a miR159-resistant target (mMYB101) resulted predominantly in plants that are indistinguishable from wild type. Plants that displayed altered morphological phenotypes were found to be ectopically expressing the mMYB101 transgene, and hence were misrepresentative of the in vivo functional role of miR159. CONCLUSIONS This study presents a novel explanation for a paradox common to plant and animal miRNA systems, where among many potential miRNA-target relationships usually only a few appear physiologically relevant. The identification of a quiescent miR159c:target gene regulatory module in anthers provides a likely rationale for the presence of conserved miR159 binding sites in many targets for which miR159 regulation has no obvious functional role. Remnants from the demise of such modules may lead to an overestimation of miRNA regulatory complexity when investigated using bioinformatic, 5'-RACE or transgenic approaches.RSA was funded by an ANU postgraduate scholarship and by a CSIRO Emerging Science Initiative. JL is the recipient of an ANU international student postgraduate scholarship. This research was supported by an Australian Research Council grant DP0773270

    Maternal Expression Relaxes Constraint on Innovation of the Anterior Determinant, bicoid

    Get PDF
    The origin of evolutionary novelty is believed to involve both positive selection and relaxed developmental constraint. In flies, the redesign of anterior patterning during embryogenesis is a major developmental innovation and the rapidly evolving Hox gene, bicoid (bcd), plays a critical role. We report evidence for relaxation of selective constraint acting on bicoid as a result of its maternal pattern of gene expression. Evolutionary theory predicts 2-fold greater sequence diversity for maternal effect genes than for zygotically expressed genes, because natural selection is only half as effective acting on autosomal genes expressed in one sex as it is on genes expressed in both sexes. We sample an individual from ten populations of Drosophila melanogaster and nine populations of D. simulans for polymorphism in the tandem gene duplicates bcd, which is maternally expressed, and zerknüllt (zen), which is zygotically expressed. In both species, we find the ratio of bcd to zen nucleotide diversity to be two or more in the coding regions but one in the noncoding regions, providing the first quantitative support for the theoretical prediction of relaxed selective constraint on maternal-effect genes resulting from sex-limited expression. Our results suggest that the accelerated rate of evolution observed for bcd is owing, at least partly, to variation generated by relaxed selective constraint

    Temporal tracking of mineralization and transcriptional developments of shell formation during the early life history of pearl oyster Pinctada maxima

    Get PDF
    Molluscan larval ontogeny is a highly conserved process comprising three principal developmental stages. A characteristic unique to each of these stages is shell design, termed prodissoconch I, prodissoconch II and dissoconch. These shells vary in morphology, mineralogy and microstructure. The discrete temporal transitions in shell biomineralization between these larval stages are utilized in this study to investigate transcriptional involvement in several distinct biomineralization events. Scanning electron microscopy and X-ray diffraction analysis of P. maxima larvae and juveniles collected throughout post-embryonic ontogenesis, document the mineralogy and microstructure of each shelled stage as well as establishing a timeline for transitions in biomineralization. P. maxima larval samples most representative of these biomineralization distinctions and transitions were analyzed for differential gene expression on the microarray platform PmaxArray 1.0. A number of transcripts are reported as differentially expressed in correlation to the mineralization events of P. maxima larval ontogeny. Some of those isolated are known shell matrix genes while others are novel; these are discussed in relation to potential shell formation roles. This interdisciplinary investigation has linked the shell developments of P. maxima larval ontogeny with corresponding gene expression profiles, furthering the elucidation of shell biomineralization

    Transcriptional Regulation: a Genomic Overview

    Get PDF
    The availability of the Arabidopsis thaliana genome sequence allows a comprehensive analysis of transcriptional regulation in plants using novel genomic approaches and methodologies. Such a genomic view of transcription first necessitates the compilation of lists of elements. Transcription factors are the most numerous of the different types of proteins involved in transcription in eukaryotes, and the Arabidopsis genome codes for more than 1,500 of them, or approximately 6% of its total number of genes. A genome-wide comparison of transcription factors across the three eukaryotic kingdoms reveals the evolutionary generation of diversity in the components of the regulatory machinery of transcription. However, as illustrated by Arabidopsis, transcription in plants follows similar basic principles and logic to those in animals and fungi. A global view and understanding of transcription at a cellular and organismal level requires the characterization of the Arabidopsis transcriptome and promoterome, as well as of the interactome, the localizome, and the phenome of the proteins involved in transcription

    Evolutionary processes from the perspective of flowering time diversity.

    Get PDF
    Although it is well appreciated that genetic studies of flowering time regulation have led to fundamental advances in the fields of molecular and developmental biology, the ways in which genetic studies of flowering time diversity have enriched the field of evolutionary biology have received less attention despite often being equally profound. Because flowering time is a complex, environmentally responsive trait that has critical impacts on plant fitness, crop yield, and reproductive isolation, research into the genetic architecture and molecular basis of its evolution continues to yield novel insights into our understanding of domestication, adaptation, and speciation. For instance, recent studies of flowering time variation have reconstructed how, when, and where polygenic evolution of phenotypic plasticity proceeded from standing variation and de novo mutations; shown how antagonistic pleiotropy and temporally varying selection maintain polymorphisms in natural populations; and provided important case studies of how assortative mating can evolve and facilitate speciation with gene flow. In addition, functional studies have built detailed regulatory networks for this trait in diverse taxa, leading to new knowledge about how and why developmental pathways are rewired and elaborated through evolutionary time
    corecore