Search CORE

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1

Fast and robust multiple sequence alignment with phylogeny-aware gap placement

Author: A Biegert
A Löytynoja
A Löytynoja
A Löytynoja
A Viterbi
Adam M Szalkowski
AM Altenhoff
AM Szalkowski
B Paten
C Dessimoz
C Grasso
C Lee
D Robinson
DA Dalquen
G Gonnet
GH Gonnet
GH Gonnet
GW Stuart
J Felsenstein
JD Thompson
JD Thompson
JL Thorne
JM Sauder
K Katoh
M Anisimova
M Kimura
O Gascuel
O Gotoh
R Durbin
RC Edgar
S Pascarella
S Whelan
SA Benner
SB Needleman
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Fast MCMC sampling for hidden markov models to determine copy number variations

Author: A Krogh
A Schliep
A Viterbi
AB Olshen
Alexander Schliep
AM Snijders
CM Bishop
D Pelleg
D Pinto
F Picard
H Willenbrock
J Fridlyand
J Fritsch
K Wang
L Rabiner
LE Baum
M Bredel
Md Pavel Mahmud
P Wang
PHC Eilers
Q McNemar
R Andersson
R Durbin
R Tibshirani
RJD Leeuw
S Chib
S Geman
S Guha
S Morganella
S Mozes
S Salvador
S Scott
S Srivastava
SP Shah
SP Shah
SR Eddy
T Harada
W Gilks
WR Lai
Y Nannya
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Hidden Markov Models (HMM) are often used for analyzing Comparative Genomic Hybridization (CGH) data to identify chromosomal aberrations or copy number variations by segmenting observation sequences. For efficiency reasons the parameters of a HMM are often estimated with maximum likelihood and a segmentation is obtained with the Viterbi algorithm. This introduces considerable uncertainty in the segmentation, which can be avoided with Bayesian approaches integrating out parameters using Markov Chain Monte Carlo (MCMC) sampling. While the advantages of Bayesian approaches have been clearly demonstrated, the likelihood based approaches are still preferred in practice for their lower running times; datasets coming from high-density arrays and next generation sequencing amplify these problems. Results We propose an approximate sampling technique, inspired by compression of discrete sequences in HMM computations and by <it>kd</it>-trees to leverage spatial relations between data points in typical data sets, to speed up the MCMC sampling. Conclusions We test our approximate sampling method on simulated and biological ArrayCGH datasets and high-density SNP arrays, and demonstrate a speed-up of 10 to 60 respectively 90 while achieving competitive results with the state-of-the art Bayesian approaches. <it>Availability: </it>An implementation of our method will be made available as part of the open source GHMM library from <url>http://ghmm.org</url>.</p

Springer - Publisher Connector

Public Library of Science (PLOS)

Modeling the Evolution of Regulatory Elements by Simultaneous Detection and Alignment with Phylogenetic Pair HMMs

Author: A Loytynoja
A Siepel
A Siepel
A Viterbi
AL Halpern
AM Moses
AP Boyle
B Langmead
D Stanojevic
DA Pollard
DL Gumucio
DS Hirschberg
G Wray
GP Wagner
I Holmes
J Felsenstein
J Hawkins
JC Bryne
JD Thompson
JL Thorne
K Wong
MS Halfon
MZ Ludwig
MZ Ludwig
N Saitou
PR Ray
R Durbin
R Satija
R Siddharthan
RC Edgar
RK Bradley
RW Lusk
Uwe Ohler
W Huang
WH Majoros
WH Majoros
William H. Majoros
WJ Kent
WJL Quesne
Wyeth W. Wasserman
X He
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

The computational detection of regulatory elements in DNA is a difficult but important problem impacting our progress in understanding the complex nature of eukaryotic gene regulation. Attempts to utilize cross-species conservation for this task have been hampered both by evolutionary changes of functional sites and poor performance of general-purpose alignment programs when applied to non-coding sequence. We describe a new and flexible framework for modeling binding site evolution in multiple related genomes, based on phylogenetic pair hidden Markov models which explicitly model the gain and loss of binding sites along a phylogeny. We demonstrate the value of this framework for both the alignment of regulatory regions and the inference of precise binding-site locations within those regions. As the underlying formalism is a stochastic, generative model, it can also be used to simulate the evolution of regulatory elements. Our implementation is scalable in terms of numbers of species and sequence lengths and can produce alignments and binding-site predictions with accuracy rivaling or exceeding current systems that specialize in only alignment or only binding-site prediction. We demonstrate the validity and power of various model components on extensive simulations of realistic sequence data and apply a specific model to study Drosophila enhancers in as many as ten related genomes and in the presence of gain and loss of binding sites. Different models and modeling assumptions can be easily specified, thus providing an invaluable tool for the exploration of biological hypotheses that can drive improvements in our understanding of the mechanisms and evolution of gene regulation

CiteSeerX

Public Library of Science (PLOS)

DukeSpace (Duke Univ.)

MDC Repository

Computational Analysis of Whole-Genome Differential Allelic Expression Data in Human

Author: A Baross
A Gimelbrant
A Siepel
A Viterbi
AM Khalil
AP Dempster
B Ge
Bing Ge
C Li
C Yau
D Serre
DJ Verlaan
Dmitry Pokholok
E Birney
E Venkatraman
H Bengtsson
H Bengtsson
J Marioni
James R. Wagner
K Wang
KA Frazer
KD Pruitt
Kevin L. Gunderson
KPV Pant
KS Pollard
L Carrel
L Rabiner
L Wu
LE Baum
Mathieu Blanchette
MV Rockman
O Rueda
P Fearnhead
S Browning
S Campino
S Colella
SH Lo
SP Shah
SP Shah
SR Eddy
T Mitchell
T Pastinen
T Pastinen
T Pastinen
Tomi Pastinen
VG Cheung
W Cookson
W Kent
WJ Kent
Wyeth W. Wasserman
Y Nannya
Publication venue: Public Library of Science
Publication date: 08/07/2010
Field of study

Allelic imbalance (AI) is a phenomenon where the two alleles of a given gene are expressed at different levels in a given cell, either because of epigenetic inactivation of one of the two alleles, or because of genetic variation in regulatory regions. Recently, Bing et al. have described the use of genotyping arrays to assay AI at a high resolution (∼750,000 SNPs across the autosomes). In this paper, we investigate computational approaches to analyze this data and identify genomic regions with AI in an unbiased and robust statistical manner. We propose two families of approaches: (i) a statistical approach based on z-score computations, and (ii) a family of machine learning approaches based on Hidden Markov Models. Each method is evaluated using previously published experimental data sets as well as with permutation testing. When applied to whole genome data from 53 HapMap samples, our approaches reveal that allelic imbalance is widespread (most expressed genes show evidence of AI in at least one of our 53 samples) and that most AI regions in a given individual are also found in at least a few other individuals. While many AI regions identified in the genome correspond to known protein-coding transcripts, others overlap with recently discovered long non-coding RNAs. We also observe that genomic regions with AI not only include complete transcripts with consistent differential expression levels, but also more complex patterns of allelic expression such as alternative promoters and alternative 3′ end. The approaches developed not only shed light on the incidence and mechanisms of allelic expression, but will also help towards mapping the genetic causes of allelic expression and identify cases where this variation may be linked to diseases

Global Chromatin Domain Organization of the Drosophila Genome

Author: A Orian
AJ Viterbi
AM Boutanaev
AT Beckenbach
AV Pindyurin
B Tolhuis
B Tolhuis
B van Steensel
B van Steensel
Bas van Steensel
BE Bernstein
C Moorman
CA Russo
D Bianchi-Frias
D Duboule
D Sproul
E de Wit
E de Wit
EC Pym
Elzo de Wit
F Greil
F Greil
FN Hamada
Frauke Greil
G Yi
H Caron
H Pickersgill
Harmen J. Bussemaker
JM Lee
K Tamura
LD Hurst
M Ashburner
M Kmita
MA Miller
MC Mahajan
N Dillon
N Negre
PB Talbert
PJ Roy
PT Spellman
R Versteeg
RA Fisher
S Cai
S Dorus
S Richards
SP Choksi
ST Kosak
T Straub
TC James
Ulrich Braunschweig
V Gupta
V Orlando
V Stolc
W de Laat
W de Laat
W Fischle
Wolf Reik
Y Mito
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

In eukaryotes, neighboring genes can be packaged together in specific chromatin structures that ensure their coordinated expression. Examples of such multi-gene chromatin domains are well-documented, but a global view of the chromatin organization of eukaryotic genomes is lacking. To systematically identify multi-gene chromatin domains, we constructed a compendium of genome-scale binding maps for a broad panel of chromatin-associated proteins in Drosophila melanogaster. Next, we computationally analyzed this compendium for evidence of multi-gene chromatin domains using a novel statistical segmentation algorithm. We find that at least 50% of all fly genes are organized into chromatin domains, which often consist of dozens of genes. The domains are characterized by various known and novel combinations of chromatin proteins. The genes in many of the domains are coregulated during development and tend to have similar biological functions. Furthermore, during evolution fewer chromosomal rearrangements occur inside chromatin domains than outside domains. Our results indicate that a substantial portion of the Drosophila genome is packaged into functionally coherent, multi-gene chromatin domains. This has broad mechanistic implications for gene regulation and genome evolution

CiteSeerX

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Network dimensioning and base station on/off switching strategies for sustainable deployments in remote areas

Author: A Bousia
A Bousia
A Bousia
A Rendón
Adrián Agustín
AM Viterbi
Antonio Pascual-Iserte
DE Goldberg
DP Bertsekas
E Aarts
E Oh
FA Haight
H Holma
J del Olmo
J Laiho
J Lorincz
J Lorincz
J Lorincz
J Rubio
Jaume del-Olmo
Javier Rubio
JM Kelif
Josep Vidal
K Sipila
L Kleinrock
M Abramowitz
M Chiani
M Meo
NB Mehta
Olga Muñoz
SM Kay
W Choi
W Guo
Y-PE Wang
YS Soh
YW Chung
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This paper provides a methodology for the dimensioning of the access network in remote rural areas, considering the progressive introduction of cellular services in these regions. A 3G small cell (SC) network with one or several carriers deployed at the SC, fed with solar panels and connected to a backhaul with limited capacity is considered for the analysis. Because the backhaul may be inexistent or very expensive (e.g., satellite-based backhaul) the network design pursues the minimization of the required backhaul bandwidth. The required backhaul bandwidth and the required energy units (i.e., the size of the solar panels and the required number of batteries) are then obtained as an output of the dimensioning analysis. Both the backhaul minimization objective and the constraints associated with each of the carriers (low maximum radiated power and low number of users connected simultaneously) require a novel methodology compared to the classical dimensioning techniques. We also develop a procedure for switching on/off carriers in order to minimize the energy consumption without affecting the quality of service (QoS) perceived by the users. This technique allows reducing the required size of the energy units, which directly translates into a cost reduction. In the development of this on/off switching strategy, we first assume perfect knowledge of the traffic profile and later, we develop a robust Bayesian approach to account for possible error modeling in the traffic profile information.Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

A hidden Markov model for decoding and the analysis of replay in spike trains

Author: A Johnson
A Kong
A Peyrache
AG Siapas
AJ Viterbi
AK Lee
AM Wikenheiser
AS Gupta
BE Pfeiffer
C Pavlides
D Ji
DJ Foster
DR Euston
EN Brown
G Buzsáki
G Buzsáki
G Dragoi
G Girardeau
G Schwarz
GR Sutherland
HS Kudrimoti
J Csicsvari
J O’Keefe
J O’Keefe
J O’Neill
JS Liu
K Diba
K Louie
K Zhang
L Buhry
M Mölle
MA Wilson
MA Wilson
Marc Box
Matt W. Jones
MF Carr
MP Karlsson
MW Jones
N Chopin
Nick Whiteley
R Naud
RE Kass
SL Scott
SW Linderman
TJ Davidson
TT Wong
V Ego-Stengel
V Rao
WE Skaggs
YL Qin
Z Chen
Z Chen
Z Chen
Z Nádasdy
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Capacity of a Simple Stable Protocol for Short Message Service Over a CDMA Network

Author: AM Viterbi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1994
Field of study

Performance Analysis of a Cellular Slotted CDMA System with Imperfect Power Control over a Rayleigh Fading Channel

Author: A Sampath
AM Viterbi
AM Viterbi
CA Wijffels
F Adachi
J Evans
JS Lee
LF Fenton
ME Crovella
MG Jansen
O Salient
PE Omiyi
R Padovani
T Ojanperä
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1999
Field of study