Search CORE

12 research outputs found

Hierarchical structure of cascade of primary and secondary periodicities in Fourier power spectrum of alphoid higher order repeats

Author: A Arneodo
A Arneodo
A Puente de la
A Som
A Weiss
AK Brodzik
AL Jorgensen
AM Lynn
AR Fuentes
B Borštnik
B Haubold
BD Silverman
BR Kim
C Lee
C Tyler-Smith
C Yin
CA Chatzidimitriou-Dreismann
CA Chatzidimitriou-Dreismann
CC Yin
CK Peng
CK Peng
D Anastassiou
D Holste
D Kotlar
D Larhammar
D Sharma
DC Benson
DD Mauresan
DG Arques
E Coward
E Coward
E Pizzi
EA Cleever
EN Trifonov
EN Trifonov
EPC Rocha
EV Korotkov
EV Korotkov
G Bernardi
G Dodin
GI Kutuzova
H Herzel
H Herzel
H Herzel
HE Stanley
HE Stanley
I Dunham
IA Alexandrov
Ivan Basar
J Felsenstein
J Gao
J Jin
J Widom
JH Jackson
JM Gutierez
JS Waye
JS Waye
JW Fickett
JW Fickett
KHA Cho
L Du
L Manuelidis
LQ Zhou
LY Romanova
M Rosandić
M Rosandić
M Sousa Vieira de
Marija Rosandić
Matko Glunčić
MK Rudd
MQ Zhang
MY Azbel
N Bouayanaya
N Nagai
Nenad Pavin
Nils Paar
P Bernaola-Galvan
P Bernaola-Galvan
PE Warburton
PG Pop
PP Vaidyanathan
PV O'Neil
R Gupta
R Hall
R Ramakrishna
R Wevrick
R Wevrick
R Zhang
RF Voss
S Guharay
S Karlin
S Nee
S Tiwari
SA Aghili
SV Buldyrev
SV Buldyrev
T Haaf
TR Gregory
TT Tran
V Afreixo
V Paar
V Paar
V Paar
VA Emanuele
Vladimir Paar
VP Turutina
VR Chechetkin
VR Chechetkin
VR Chechetkin
VR Chechetkin
VR Chechetkin
VV Lobzin
VV Pradbu
W Lee
W Li
W Li
W Li
YX Tian
Z-G Yu
Z-G Yu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Identification of approximate tandem repeats is an important task of broad significance and still remains a challenging problem of computational genomics. Often there is no single best approach to periodicity detection and a combination of different methods may improve the prediction accuracy. Discrete Fourier transform (DFT) has been extensively used to study primary periodicities in DNA sequences. Here we investigate the application of DFT method to identify and study alphoid higher order repeats. Results We used method based on DFT with mapping of symbolic into numerical sequence to identify and study alphoid higher order repeats (HOR). For HORs the power spectrum shows equidistant frequency pattern, with characteristic two-level hierarchical organization as signature of HOR. Our case study was the 16 mer HOR tandem in AC017075.8 from human chromosome 7. Very long array of equidistant peaks at multiple frequencies (more than a thousand higher harmonics) is based on fundamental frequency of 16 mer HOR. Pronounced subset of equidistant peaks is based on multiples of the fundamental HOR frequency (multiplication factor <it>n </it>for <it>n</it>mer) and higher harmonics. In general, <it>n</it>mer HOR-pattern contains equidistant secondary periodicity peaks, having a pronounced subset of equidistant primary periodicity peaks. This hierarchical pattern as signature for HOR detection is robust with respect to monomer insertions and deletions, random sequence insertions etc. For a monomeric alphoid sequence only primary periodicity peaks are present. The 1/<it>f</it><it>β </it>– noise and periodicity three pattern are missing from power spectra in alphoid regions, in accordance with expectations. Conclusion DFT provides a robust detection method for higher order periodicity. Easily recognizable HOR power spectrum is characterized by hierarchical two-level equidistant pattern: higher harmonics of the fundamental HOR-frequency (secondary periodicity) and a subset of pronounced peaks corresponding to constituent monomers (primary periodicity). The number of lower frequency peaks (secondary periodicity) below the frequency of the first primary periodicity peak reveals the size of <it>n</it>mer HOR, i.e., the number <it>n </it>of monomers contained in consensus HOR.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

A model-independent approach to infer hierarchical codon substitution dynamics

Author: A Jiménez-Sánchez
AS Novozhilov
C Kosiol
C Kosiol
CR Woese
CR Woese
DG Hwang
DS Riddle
DT Jones
E Trifonov
EN Trifonov
EN Trifonov
EN Trifonov
FH Crick
GH Gonnet
HA Simon
JEM Hornos
JG Kemeny
JR Jungck
JTF Wong
M Di Giulio
M Meilă
MA Jiménez-Montano
MA Jiménez-Montaño
Martin Nilsson Jacobi
MN Jacobi
MO Dayhoff
MS Johnson
MW Nirenberg
O Görnerup
O R
Olof Görnerup
R Marquez
S Itzkovitz
S Whelan
SD Copley
T Bollenbach
T Wilhelm
TD Wu
V Karasev
VR Chechetkin
W Taylor
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Codon substitution constitutes a fundamental process in molecular biology that has been studied extensively. However, prior studies rely on various assumptions, e.g. regarding the relevance of specific biochemical properties, or on conservation criteria for defining substitution groups. Ideally, one would instead like to analyze the substitution process in terms of raw dynamics, independently of underlying system specifics. In this paper we propose a method for doing this by identifying groups of codons and amino acids such that these groups imply closed dynamics. The approach relies on recently developed spectral and agglomerative techniques for identifying hierarchical organization in dynamical systems. Results We have applied the techniques on an empirically derived Markov model of the codon substitution process that is provided in the literature. Without system specific knowledge of the substitution process, the techniques manage to "blindly" identify multiple levels of dynamics; from amino acid substitutions (via the standard genetic code) to higher order dynamics on the level of amino acid groups. We hypothesize that the acquired groups reflect earlier versions of the genetic code. Conclusions The results demonstrate the applicability of the techniques. Due to their generality, we believe that they can be used to coarse grain and identify hierarchical organization in a broad range of other biological systems and processes, such as protein interaction networks, genetic regulatory networks and food webs.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Chalmers Research

Chalmers Publication Library

Inclusion of the fitness sharing technique in an evolutionary algorithm to analyze the fitness landscape of the genetic code adaptability

Author: AS Novozhilov
B Sareni
CR Woese
CT Zhu
D Haig
DE Goldberg
EV Koonin
F Crick
HP Yockey
J Holland
J Santos
J Santos
José Santos
JT Wong
LL de Oliveira
LL de Oliveira
M Di Giulio
M Di Giulio
M Di Giulio
M Di Giulio
N Torabi
P BlaŻej
PG Higgs
R Marquez
RD Knight
RD Knight
RD Knight
SJ Freeland
SJ Freeland
SJ Freeland
SJ Freeland
VR Chechetkin
Ángel Monteagudo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

An analysis of single amino acid repeats as use case for application specific background models

Author: C Notredame
David P Kreil
DP Depledge
DP Kreil
E Birney
E Delot
EL Sonnhammer
EM Marcotte
G Gouridis
G Nuel
G Reinert
H Gerber
H Nielsen
H Nielsen
IB Kuznetsov
J Thompson
J Wootton
J Xie
JD Bendtsen
JM Hancock
JW Fondon
L Brown
L Zhang
M Hoebeke
M Mar Alba
M Thomas-Chollier
M Tipping
M Tipping
MA Huntley
O Weiss
OB Ptitsyn
P Siwach
P Siwach
Paweł P Łabaj
Peter Sykacek
PP Łabaj
R Lopez
R Lyne
RI Sadreyev
RS Hegde
S Caburet
S Hands
S Henikoff
S Karlin
S Karlin
SF Altschul
SF Altschul
SF Altschul
T Koestler
VJ Promponas
VR Chechetkin
VS Pande
WR Pearson
Y Kashi
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background Sequence analysis aims to identify biologically relevant signals against a backdrop of functionally meaningless variation. Increasingly, it is recognized that the quality of the background model directly affects the performance of analyses. State-of-the-art approaches rely on classical sequence models that are adapted to the studied dataset. Although performing well in the analysis of globular protein domains, these models break down in regions of stronger compositional bias or low complexity. While these regions are typically filtered, there is increasing anecdotal evidence of functional roles. This motivates an exploration of more complex sequence models and application-specific approaches for the investigation of biased regions. Results Traditional Markov-chains and application-specific regression models are compared using the example of predicting runs of single amino acids, a particularly simple class of biased regions. Cross-fold validation experiments reveal that the alternative regression models capture the multi-variate trends well, despite their low dimensionality and in contrast even to higher-order Markov-predictors. We show how the significance of unusual observations can be computed for such empirical models. The power of a dedicated model in the detection of biologically interesting signals is then demonstrated in an analysis identifying the unexpected enrichment of contiguous leucine-repeats in signal-peptides. Considering different reference sets, we show how the question examined actually defines what constitutes the 'background'. Results can thus be highly sensitive to the choice of appropriate model training sets. Conversely, the choice of reference data determines the questions that can be investigated in an analysis. Conclusions Using a specific case of studying biased regions as an example, we have demonstrated that the construction of application-specific background models is both necessary and feasible in a challenging sequence analysis situation

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Publikationsserver der Universitätsbibliothek Bodenkultur Wien

Warwick Research Archives Portal Repository

Nonequilibrium Phase Transitions

Author: A Nitzan
A.S. Mikhailov
A.S. Mikhailov
AS Mikhailov
AS Mikhailov
AS Mikhailov
AS Mikhailov
AS Mikhailov
D Kondepudi
D Kondepudi
D Walgraef
F Moss
G Broggi
G Nicolis
H Haken
H Zeghlache
HK Janssen
HK Janssen
L Arnold
LD Landau
LL Morozov
MA Shishkova
NR Lebovitz
P Mandel
R Haberman
S Grossmann
S-K Ma
VR Chechetkin
VR Chechetkin
W Horsthemke
W. Horsthemke
YB Zeldovich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1996
Field of study

Crossref

Computational methods of identification of pseudogenes based on functionality: entropy and GC content.

Author: A Heger
A Pavlicek
B Lewin
BR Morton
C Collet
C Dutta
C Ortutay
C Yin
CG Bell
D Holste
D Lachaise
D Lachaise
D Tillo
D Zheng
D Zheng
DA Filatov
DC Shields
DJ Obbard
Drosophila 12 genomes consortium
DT Sullivan
EC Rouchka
EN Moriyama
EN Moriyama
EN Trifonov
ES Balakirev
ES Balakirev
ES Balakirev
ES Balakirev
ES Balakirev
ES Balakirev
ES Balakirev
ES Balakirev
ES Balakirev
ES Balakirev
F Alvarez-Valin
F Lemeunier
F Wright
GI Kravatskaya
GN Yenikolopov
H Hildebrand
H Sakai
H Wu
I Menashe
I Molineris
J Brosius
J Harrow
JD Thompson
JE Karro
JG Oakeshott
JG Oakeshott
JG Oakeshott
JL Lage da
JM Bischof
JP Brady
K Lin
K Tamura
K Vetsigian
KR Sakharkar
L Coin
L Duret
L Duret
LM King
M Bulmer
M Leveugle
M-L Cariou
MJ Baren van
MJ Healy
MM Dumancic
N Echols
NL Johnson
P Librado
PD Currie
PD East
PM Harrison
PM Harrison
PM Harrison
R Raghavan
RC Pink
RC Richmond
RC Richmond
RJ Epstein
RM Kliman
RS Illingworth
S Mann
S Ramos-Onsins
S Tiwari
S Vicario
S-M Chen
SM Procé de
T Gojobori
V Solovyev
VR Chechetkin
VR Chechetkin
VR Chechetkin
VR Chechetkin
VR Chechetkin
VV Lobzin
W Li
W-H Li
W-Y Ko
WT Starmer
Y Yang
Y-Z Wen
Z Zhang
Z Zhang
Z Zhang
Publication venue: eScholarship, University of California
Publication date: 01/01/2014
Field of study

Spectral entropy and GC content analyses reveal comprehensive structural features of DNA sequences. To illustrate the significance of these features, we analyze the β-esterase gene cluster, including the Est-6 gene and the ψEst-6 putative pseudogene, in seven species of the Drosophila melanogaster subgroup. The spectral entropies show distinctly lower structural ordering for ψEst-6 than for Est-6 in all species studied. However, entropy accumulation is not a completely random process for either gene and it shows to be nucleotide dependent. Furthermore, GC content in synonymous positions is uniformly higher in Est-6 than in ψEst-6, in agreement with the reduced GC content generally observed in pseudogenes and nonfunctional sequences. The observed differences in entropy and GC content reflect an evolutionary shift associated with the process of pseudogenization and subsequent functional divergence of ψEst-6 and Est-6 after the duplication event. The data obtained show the relevance and significance of entropy and GC content analyses for pseudogene identification and for the comparative study of gene-pseudogene evolution

Crossref

eScholarship - University of California