90 research outputs found
Genome comparison using Gene Ontology (GO) with statistical testing
BACKGROUND: Automated comparison of complete sets of genes encoded in two genomes can provide insight on the genetic basis of differences in biological traits between species. Gene ontology (GO) is used as a common vocabulary to annotate genes for comparison. Current approaches calculate the fold of unweighted or weighted differences between two species at the high-level GO functional categories. However, to ensure the reliability of the differences detected, it is important to evaluate their statistical significance. It is also useful to search for differences at all levels of GO. RESULTS: We propose a statistical approach to find reliable differences between the complete sets of genes encoded in two genomes at all levels of GO. The genes are first assigned GO terms from BLAST searches against genes with known GO assignments, and for each GO term the abundance of genes in the two genomes is compared using a chi-squared test followed by false discovery rate (FDR) correction. We applied this method to find statistically significant differences between two cyanobacteria, Synechocystis sp. PCC6803 and Anabaena sp. PCC7120. We then studied how the set of identified differences vary when different BLAST cutoffs are used. We also studied how the results vary when only subsets of the genes were used in the comparison of human vs. mouse and that of Saccharomyces cerevisiae vs. Schizosaccharomyces pombe. CONCLUSION: There is a surprising lack of statistical approaches for comparing complete genomes at all levels of GO. With the rapid increase of the number of sequenced genomes, we hope that the approach we proposed and tested can make valuable contribution to comparative genomics
Correlation between sequence conservation and structural thermodynamics of microRNA precursors from human, mouse, and chicken genomes
<p>Abstract</p> <p>Background</p> <p>Previous studies have shown that microRNA precursors (pre-miRNAs) have considerably more stable secondary structures than other native RNAs (tRNA, rRNA, and mRNA) and artificial RNA sequences. However, pre-miRNAs with ultra stable secondary structures have not been investigated. It is not known if there is a tendency in pre-miRNA sequences towards or against ultra stable structures? Furthermore, the relationship between the structural thermodynamic stability of pre-miRNA and their evolution remains unclear.</p> <p>Results</p> <p>We investigated the correlation between pre-miRNA sequence conservation and structural stability as measured by adjusted minimum folding free energies in pre-miRNAs isolated from human, mouse, and chicken. The analysis revealed that conserved and non-conserved pre-miRNA sequences had structures with similar average stabilities. However, the relatively ultra stable and unstable pre-miRNAs were more likely to be non-conserved than pre-miRNAs with moderate stability. Non-conserved pre-miRNAs had more G+C than A+U nucleotides, while conserved pre-miRNAs contained more A+U nucleotides. Notably, the U content of conserved pre-miRNAs was especially higher than that of non-conserved pre-miRNAs. Further investigations showed that conserved and non-conserved pre-miRNAs exhibited different structural element features, even though they had comparable levels of stability.</p> <p>Conclusions</p> <p>We proposed that there is a correlation between structural thermodynamic stability and sequence conservation for pre-miRNAs from human, mouse, and chicken genomes. Our analyses suggested that pre-miRNAs with relatively ultra stable or unstable structures were less favoured by natural selection than those with moderately stable structures. Comparison of nucleotide compositions between non-conserved and conserved pre-miRNAs indicated the importance of U nucleotides in the pre-miRNA evolutionary process. Several characteristic structural elements were also detected in conserved pre-miRNAs.</p
Hybrid phase-change Lattice Boltzmann simulation of vapor condensation on vertical subcooled walls
Saturated vapor condensation on homogenous and heterogeneous subcooled walls is presented in this study by adopting a hybrid phase-change multiple-relaxation-time Lattice Boltzmann model. The effects of wall wettability on the condensation process, including droplets’ growth, coalescence and falling, and the influence of vapor flow to condensation are investigated. The results demonstrate that the heat fluxes around the triple-phase contact lines are higher than that in other cold areas in homogeneous subcooled walls, which actually indicates the fact that filmwise condensation is preventing the continuous condensation process. Furthermore, the dropwise condensation can be formed more easily on the heterogeneous surface with a mixed surface wettability. At last, the dynamic process of condensation of continuous vapor flow is also investigated by considering the homogenous and heterogeneous subcooled surfaces. The results show that the heterogeneous surface with mixed wettability doesn’t contribute to the formation, growth of droplets, when compared to the homogeneous surface. It is expected that this study can bring more attentions to simulate condensation using multiphase LBM for complex geometries in heat transfer community
Tiling microarray analysis of rice chromosome 10 to identify the transcriptome and relate its expression to chromosomal architecture
BACKGROUND: Sequencing and annotation of the genome of rice (Oryza sativa) have generated gene models in numbers that top all other fully sequenced species, with many lacking recognizable sequence homology to known genes. Experimental evaluation of these gene models and identification of new models will facilitate rice genome annotation and the application of this knowledge to other more complex cereal genomes. RESULTS: We report here an analysis of the chromosome 10 transcriptome of the two major rice subspecies, japonica and indica, using oligonucleotide tiling microarrays. This analysis detected expression of approximately three-quarters of the gene models without previous experimental evidence in both subspecies. Cloning and sequence analysis of the previously unsupported models suggests that the predicted gene structure of nearly half of those models needs improvement. Coupled with comparative gene model mapping, the tiling microarray analysis identified 549 new models for the japonica chromosome, representing an 18% increase in the annotated protein-coding capacity. Furthermore, an asymmetric distribution of genome elements along the chromosome was found that coincides with the cytological definition of the heterochromatin and euchromatin domains. The heterochromatin domain appears to associate with distinct chromosome level transcriptional activities under normal and stress conditions. CONCLUSION: These results demonstrated the utility of genome tiling microarray in evaluating annotated rice gene models and in identifying novel transcriptional units. The tiling microarray sanalysis further revealed a chromosome-wide transcription pattern that suggests a role for transposable element-enriched heterochromatin in shaping global transcription in response to environmental changes in rice
ReAS: Recovery of Ancestral Sequences for Transposable Elements from the Unassembled Reads of a Whole Genome Shotgun
We describe an algorithm, ReAS, to recover ancestral sequences for transposable elements (TEs) from the unassembled reads of a whole genome shotgun. The main assumptions are that these TEs must exist at high copy numbers across the genome and must not be so old that they are no longer recognizable in comparison to their ancestral sequences. Tested on the japonica rice genome, ReAS was able to reconstruct all of the high copy sequences in the Repbase repository of known TEs, and increase the effectiveness of RepeatMasker in identifying TEs from genome sequences
Identification of genes regulated by Wnt/β-catenin pathway and involved in apoptosis via microarray analysis
BACKGROUND: Wnt/β-catenin pathway has critical roles in development and oncogenesis. Although significant progress has been made in understanding the downstream signaling cascade of this pathway, little is known regarding Wnt/β-catenin pathway modification of the cellular apoptosis. METHODS: To identify potential genes regulated by Wnt/β-catenin pathway and involved in apoptosis, we used a stably integrated, inducible RNA interference (RNAi) vector to specific inhibit the expression and the transcriptional activity of β-catenin in HeLa cells. Meanwhile, we designed an oligonucleotide microarray covering 1384 apoptosis-related genes. Using oligonucleotide microarrays, a series of differential expression of genes was identified and further confirmed by RT-PCR. RESULTS: Stably integrated inducible RNAi vector could effectively suppress β-catenin expression and the transcriptional activity of β-catenin/TCF. Meanwhile, depletion of β-catenin in this manner made the cells more sensitive to apoptosis. 130 genes involved in some important cell-apoptotic pathways, such as PTEN-PI3K-AKT pathway, NF-κB pathway and p53 pathway, showed significant alteration in their expression level after the knockdown of β-catenin. CONCLUSION: Coupling RNAi knockdown with microarray and RT-PCR analyses proves to be a versatile strategy for identifying genes regulated by Wnt/β-catenin pathway and for a better understanding the role of this pathway in apoptosis. Some of the identified β-catenin/TCF directed or indirected target genes may represent excellent targets to limit tumor growth
The R Protein of SARS-CoV: Analyses of Structure and Function Based on Four Complete Genome Sequences of Isolates BJ01-BJ04
The R (replicase) protein is the uniquely defined non-structural protein (NSP) responsible for RNA replication, mutation rate or fidelity, regulation of transcription in coronaviruses and many other ssRNA viruses. Based on our complete genome sequences of four isolates (BJ01-BJ04) of SARS-CoV from Beijing, China, we analyzed the structure and predicted functions of the R protein in comparison with 13 other isolates of SARS-CoV and 6 other coronaviruses. The entire ORF (open-reading frame) encodes for two major enzyme activities, RNA-dependent RNA polymerase (RdRp) and proteinase activities. The R polyprotein undergoes a complex proteolytic process to produce 15 function-related peptides. A hydrophobic domain (HOD) and a hydrophilic domain (HID) are newly identified within NSP1. The substitution rate of the R protein is close to the average of the SARS-CoV genome. The functional domains in all NSPs of the R protein give different phylogenetic results that suggest their different mutation rate under selective pressure. Eleven highly conserved regions in RdRp and twelve cleavage sites by 3CLP (chymotrypsin-like protein) have been identified as potential drug targets. Findings suggest that it is possible to obtain information about the phylogeny of SARS-CoV, as well as potential tools for drug design, genotyping and diagnostics of SARS
ChickVD: a sequence variation database for the chicken genome
Working in parallel with the efforts to sequence the chicken (Gallus gallus) genome, the Beijing Genomics Institute led an international team of scientists from China, USA, UK, Sweden, The Netherlands and Germany to map extensive DNA sequence variation throughout the chicken genome by sampling DNA from domestic breeds. Using the Red Jungle Fowl genome sequence as a reference, we identified 3.1 million non-redundant DNA sequence variants. To facilitate the application of our data to avian genetics and to provide a foundation for functional and evolutionary studies, we created the ‘Chicken Variation Database’ (ChickVD). A graphical MapView shows variants mapped onto the chicken genome in the context of gene annotations and other features, including genetic markers, trait loci, cDNAs, chicken orthologs of human disease genes and raw sequence traces. ChickVD also stores information on quantitative trait loci using data from collaborating institutions and public resources. Our data can be queried by search engine and homology-based BLAST searches. ChickVD is publicly accessible at http://chicken.genomics.org.cn
Arabidopsis Hormone Database: a comprehensive genetic and phenotypic information database for plant hormone research in Arabidopsis
Plant hormones are small organic molecules that influence almost every aspect of plant growth and development. Genetic and molecular studies have revealed a large number of genes that are involved in responses to numerous plant hormones, including auxin, gibberellin, cytokinin, abscisic acid, ethylene, jasmonic acid, salicylic acid, and brassinosteroid. Here, we develop an Arabidopsis hormone database, which aims to provide a systematic and comprehensive view of genes participating in plant hormonal regulation, as well as morphological phenotypes controlled by plant hormones. Based on data from mutant studies, transgenic analysis and gene ontology (GO) annotation, we have identified a total of 1026 genes in the Arabidopsis genome that participate in plant hormone functions. Meanwhile, a phenotype ontology is developed to precisely describe myriad hormone-regulated morphological processes with standardized vocabularies. A web interface (http://ahd.cbi.pku.edu.cn) would allow users to quickly get access to information about these hormone-related genes, including sequences, functional category, mutant information, phenotypic description, microarray data and linked publications. Several applications of this database in studying plant hormonal regulation and hormone cross-talk will be presented and discussed
Fabrication of Porous Scaffolds with a Controllable Microstructure and Mechanical Properties by Porogen Fusion Technique
Macroporous scaffolds with controllable pore structure and mechanical properties were fabricated by a porogen fusion technique. Biodegradable material poly (d, l-lactide) (PDLLA) was used as the scaffold matrix. The effects of porogen size, PDLLA concentration and hydroxyapatite (HA) content on the scaffold morphology, porosity and mechanical properties were investigated. High porosity (90% and above) and highly interconnected structures were easily obtained and the pore size could be adjusted by varying the porogen size. With the increasing porogen size and PDLLA concentration, the porosity of scaffolds decreases, while its mechanical properties increase. The introduction of HA greatly increases the impact on pore structure, mechanical properties and water absorption ability of scaffolds, while it has comparatively little influence on its porosity under low HA contents. These results show that by adjusting processing parameters, scaffolds could afford a controllable pore size, exhibit suitable pore structure and high porosity, as well as good mechanical properties, and may serve as an excellent substrate for bone tissue engineering
- …