Search CORE

Reliability and applications of statistical methods based on oligonucleotide frequencies in bacterial and archaeal genomes

Author: Bohlin Jon
Skjerve Eystein
Ussery David W
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background The increasing number of sequenced prokaryotic genomes contains a wealth of genomic data that needs to be effectively analysed. A set of statistical tools exists for such analysis, but their strengths and weaknesses have not been fully explored. The statistical methods we are concerned with here are mainly used to examine similarities between archaeal and bacterial DNA from different genomes. These methods compare observed genomic frequencies of fixed-sized oligonucleotides with expected values, which can be determined by genomic nucleotide content, smaller oligonucleotide frequencies, or be based on specific statistical distributions. Advantages with these statistical methods include measurements of phylogenetic relationship with relatively small pieces of DNA sampled from almost anywhere within genomes, detection of foreign/conserved DNA, and homology searches. Our aim was to explore the reliability and best suited applications for some popular methods, which include relative oligonucleotide frequencies (ROF), di- to hexanucleotide zero'th order Markov methods (ZOM) and 2.order Markov chain Method (MCM). Tests were performed on distant homology searches with large DNA sequences, detection of foreign/conserved DNA, and plasmid-host similarity comparisons. Additionally, the reliability of the methods was tested by comparing both real and random genomic DNA. Results Our findings show that the optimal method is context dependent. ROFs were best suited for distant homology searches, whilst the hexanucleotide ZOM and MCM measures were more reliable measures in terms of phylogeny. The dinucleotide ZOM method produced high correlation values when used to compare real genomes to an artificially constructed random genome with similar %GC, and should therefore be used with care. The tetranucleotide ZOM measure was a good measure to detect horizontally transferred regions, and when used to compare the phylogenetic relationships between plasmids and hosts, significant correlation <it>(R</it>2 = <it>0.4) </it>was found with genomic GC content and intra-chromosomal homogeneity. Conclusion The statistical methods examined are fast, easy to implement, and powerful for a number of different applications involving genomic sequence comparisons. However, none of the measures examined were superior in all tests, and therefore the choice of the statistical method should depend on the task at hand.</p

Online Research Database In Technology

Analysis of genomic signatures in prokaryotes using multinomial regression and hierarchical clustering

Author: Bohlin Jon
Skjerve Eystein
Ussery David W
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Springer - Publisher Connector

Online Research Database In Technology

Investigations of Oligonucleotide Usage Variance Within and Between Prokaryotes

Author: David W. Ussery
Eystein Skjerve
Gary Stormo
Jon Bohlin
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Oligonucleotide usage in archaeal and bacterial genomes can be linked to a number of properties, including codon usage (trinucleotides), DNA base-stacking energy (dinucleotides), and DNA structural conformation (di- to tetranucleotides). We wanted to assess the statistical information potential of different DNA ‘word-sizes’ and explore how oligonucleotide frequencies differ in coding and non-coding regions. In addition, we used oligonucleotide frequencies to investigate DNA composition and how DNA sequence patterns change within and between prokaryotic organisms. Among the results found was that prokaryotic chromosomes can be described by hexanucleotide frequencies, suggesting that prokaryotic DNA is predominantly short range correlated, i.e., information in prokaryotic genomes is encoded in short oligonucleotides. Oligonucleotide usage varied more within AT-rich and host-associated genomes than in GC-rich and free-living genomes, and this variation was mainly located in non-coding regions. Bias (selectional pressure) in tetranucleotide usage correlated with GC content, and coding regions were more biased than non-coding regions. Non-coding regions were also found to be approximately 5.5% more AT-rich than coding regions, on average, in the 402 chromosomes examined. Pronounced DNA compositional differences were found both within and between AT-rich and GC-rich genomes. GC-rich genomes were more similar and biased in terms of tetranucleotide usage in non-coding regions than AT-rich genomes. The differences found between AT-rich and GC-rich genomes may possibly be attributed to lifestyle, since tetranucleotide usage within host-associated bacteria was, on average, more dissimilar and less biased than free-living archaea and bacteria

Public Library of Science (PLOS)

Online Research Database In Technology

The effect of vaccine-associated cross-stitch vertebrae pathology on growth of farmed Atlantic salmon

Author: Bæverfjord Grete
Negaard Paul
Peña Patricio
Skjerve Eystein
Thorarinsson Ragnar
Publication venue: 'Wiley'
Publication date: 01/01/2023
Field of study

publishedVersio

Brage NMBU

NOFIMA Repository

Q fever abortions in ruminants and associated on-farm risk factors in northern Cyprus

Author: Cantas Hasan
Muwonge Adrian
Sareyyupoglu Baris
Skjerve Eystein
Yardimci Hakan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

Edinburgh Research Explorer

Non-Tuberculous Mycobacteria in Uganda: A Problem or Not?

Author: Adrian Muwonge
Ashemeire Patience
Clovice Kankya
Demelash Biffa
Eystein Skjerve
James Oloya
Publication venue: 'IntechOpen'
Publication date: 26/10/2011
Field of study

IntechOpen

Characterization of some Brucella species from Zimbabwe by biochemical profiling and AMOS-PCR

Author: Bhebhe Evison
Djønne Berit
Matope Gift
Muma John Bwalya
Skjerve Eystein
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

This is an Open Access article distributed under the terms of the Creative Commons Attribution Licens

CiteSeerX

Springer - Publisher Connector

Biblioteca Digital de Castilla y León

La crisis castellana : medios que deben adoptarse para conjurarla

Author: Bergström Annika
Boge Gudrun Seeberg
Dimopoulou Maria
Moldal Elena Regine
Skjerve Eystein
Publication venue: Hospicio Provincial
Publication date: 01/01/1884
Field of study

Abstract Background A retrospective case–control study was conducted to estimate breed predisposition for common orthopaedic conditions in 12 popular dog breeds in Norway and Sweden. Orthopaedic conditions investigated were elbow dysplasia (ED); cranial cruciate ligament disease (CCLD); medial patellar luxation (MPL); and fractures of the radius and ulna. Dogs surgically treated for the conditions above at the Swedish and Norwegian University Animal Hospitals between the years 2011 and 2015 were compared with a geographically adjusted control group calculated from the national ID-registries. Logistic regression analyses (stratified for clinic and combined) were used to calculate odds ratios (OR) and 95% confidence intervals. Mixed breed dogs were used as reference. Results Breeds found at-risk for ED were the Labrador retriever (OR = 5.73), the Rottweiler (OR = 5.63), the German shepherd dog (OR = 3.31) and the Staffordshire bull terrier (OR = 3.08). The Chihuahua was the only breed where an increased risk for MPL (OR = 2.80) was identified. While the Rottweiler was the only breed predisposed for CCLD (OR = 3.96), the results were conflicting for the Labrador retriever (OR = 0.44 in Sweden, 2.85 in Norway); the overall risk was identical to mixed-breed dogs. Conclusions Most results are in concordance with earlier studies. However, an increased risk of CCLD was not identified for the Labrador retriever, the Staffordshire bull terrier was found to have an increased risk of ED and some country-specific differences were noted. These results highlight the importance of utilising large caseloads and appropriate control groups when breed susceptibility is reported

Brage NMBU