Search CORE

14 research outputs found

Meta-Alignment with Crumble and Prune: Partitioning very large alignment problems for performance and parallelization

Author: A Siepel
A Siepel
AS Schwartz
B Paten
B Paten
B Rhead
Benedict Paten
C Lee
CN Dewey
David Haussler
DF Feng
G Myers
I Lumb
J Ma
JE Stajich
JS Pedersen
K Katoh
K Katoh
K Kryukov
K Liu
K Reinert
KM Roskin
Krishna M Roskin
M Blanchette
M Hasegawa
M Waterman
N Bray
P Di Tommaso
RC Edgar
RK Bradley
S Griffiths-Jones
S Schwartz
T Kim
U Tönges
W Gentzsch
WJ Kent
WJ Kent
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Continuing research into the global multiple sequence alignment problem has resulted in more sophisticated and principled alignment methods. Unfortunately these new algorithms often require large amounts of time and memory to run, making it nearly impossible to run these algorithms on large datasets. As a solution, we present two general methods, Crumble and Prune, for breaking a phylogenetic alignment problem into smaller, more tractable sub-problems. We call Crumble and Prune <it>meta-alignment </it>methods because they use existing alignment algorithms and can be used with many current alignment programs. Crumble breaks long alignment problems into shorter sub-problems. Prune divides the phylogenetic tree into a collection of smaller trees to reduce the number of sequences in each alignment problem. These methods are orthogonal: they can be applied together to provide better scaling in terms of sequence length and in sequence depth. Both methods partition the problem such that many of the sub-problems can be solved independently. The results are then combined to form a solution to the full alignment problem. Results Crumble and Prune each provide a significant performance improvement with little loss of accuracy. In some cases, a gain in accuracy was observed. Crumble and Prune were tested on real and simulated data. Furthermore, we have implemented a system called Job-tree that allows hierarchical sub-problems to be solved in parallel on a compute cluster, significantly shortening the run-time. Conclusions These methods enabled us to solve gigabase alignment problems. These methods could enable a new generation of biologically realistic alignment algorithms to be applied to real world, large scale alignment problems.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Identification and Classification of Conserved RNA Secondary Structures in the Human Genome

Author: Adam Siepel
Angrand PO Apiou F, Stewart AF, Dutrillaux B, Losson R, et al.
Aparicio S Chapman J, Stupka E, Putnam N, Chia JM, et al.
Bentwich I Avniel A, Karov Y, Aharonov R, Gilad S, et al.
Berezikov E Guryev V, van de Belt J, Wienholds E, Plasterk RH, et al.
Berry MJ Banu L, Chen YY, Mandel SJ, Kieffer JD, et al.
Blanchette M Kent WJ, Riemer C, Elnitski L, Smit AF, et al.
Bompfünewerer AF Flamm C, Fried C, Fritzsch G, Hofacker IL, et al.
Brudno M Do CB, Cooper GM, Kim MF, Davydov E, et al.
Chimpanzee Sequencing and Analysis Consortium
David Haussler
Eric S Lander
Gibbs RA Weinstock GM, Metzker ML, Muzny DM, Sodergren EJ, et al.
Gill Bejerano
Gregory RI Yan KP, Amuthan G, Chendrimada T, Doratotaj B, et al.
Griffiths-Jones S Moxon S, Marshall M, Khanna A, Eddy SR, et al.
Higuchi M Maas S, Single FN, Hartner J, Rozov A, et al.
Hillier LW Miller W, Birney E, Warren W, Hardison RC, et al.
Howard MT Aggarwal G, Anderson CB, Khatri S, Flanigan KM, et al.
International Human Genome Sequencing Consortium
Jakob Skou Pedersen
Jim Kent
Kate Rosenbloom
Kent WJ Sugnet CW, Furey TS, Roskin KM, Pringle TH, et al.
Kerstin Lindblad-Toh
Kryukov GV Castellano S, Novoselov SV, Lobanov AV, Zehtab O, et al.
Lagos-Quintana M Rauhut R, Yalcin A, Meyer J, Lendeckel W, et al.
Lim LP Lau NC, Weinstein EG, Abdelhakim A, Yekta S, et al.
Matsufuji S Matsufuji T, Miyazaki Y, Murakami Y, Atkins JF, et al.
Pahl PM Hodges YK, Meltesen L, Perryman MB, Horwitz KB, et al.
Richard Durbin
Schwartz S Kent WJ, Smit A, Zhang Z, Baertsch R, et al.
Siepel A Bejerano G, Pedersen JS, Hinrichs AS, Hou M, et al.
Waterston RH Lindblad-Toh K, Birney E, Rogers J, Abril JF, et al.
Webb Miller
Xie X Lu J, Kulbokas EJ, Golub TR, Mootha V, et al.
Publication venue: Public Library of Science
Publication date: 01/01/2005
Field of study

The discoveries of microRNAs and riboswitches, among others, have shown functional RNAs to be biologically more important and genomically more prevalent than previously anticipated. We have developed a general comparative genomics method based on phylogenetic stochastic context-free grammars for identifying functional RNAs encoded in the human genome and used it to survey an eight-way genome-wide alignment of the human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish genomes for deeply conserved functional RNAs. At a loose threshold for acceptance, this search resulted in a set of 48,479 candidate RNA structures. This screen finds a large number of known functional RNAs, including 195 miRNAs, 62 histone 3′UTR stem loops, and various types of known genetic recoding elements. Among the highest-scoring new predictions are 169 new miRNA candidates, as well as new candidate selenocysteine insertion sites, RNA editing hairpins, RNAs involved in transcript auto regulation, and many folds that form singletons or small functional RNA families of completely unknown function. While the rate of false positives in the overall set is difficult to estimate and is likely to be substantial, the results nevertheless provide evidence for many new human functional RNAs and present specific predictions to facilitate their further characterization

Public Library of Science (PLOS)

Crossref

Cold Spring Harbor Laboratory Institutional Repository

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

Copenhagen University Research Information System

eScholarship - University of California

A user's guide to the Encyclopedia of DNA elements (ENCODE)

Author: Abdelhamid RF
Absher DM
Abyzov A
Aken B
Alioto T
Altshuler RC
Amrhein H
Antonarakis SE
Auerbach RK
Balasubramanian S
Bansal A
Barber GP
Battenhouse A
Batut P
Batzoglou S
Beal K
Bell I
Bell K
Bernstein BE
Bhardwaj N
Bhinge AA
Bickel PJ
Bignell A
Bild N
Birney E
Blahnik KR
Boley N
Borel C
Bowling KM
Boychenko V
Boyle AP
Brazma A
Brent M
Brown JB
Brown RH
Buske OJ
Canfield T
Cao AR
Carninci P
Cayting P
Chakrabortty S
Charos A
Chen X
Cheng C
Chittur S
Chrast J
Cline MS
Collins PJ
Coyne MJ
Crawford GE
Davis CA
Dekker J
Derrien T
DeSalvo G
Despacio-Reyes G
Diekhans M
Dillon LAL
Dilocker JA
Djebali S
Dobin A
Dong X
Doyle F
Drenkow J
Dreszer TR
Du J
Dumais E
Dumais J
Dunham I
Durham T
Ebersol AK
Elnitski L
Epstein CB
Ernst J
Euskirchen G
Farnham PJ
Feingold EA
Fejes K
Fisher K
Fleming JD
Frankish A
Frietze S
Frum T
Fujita PA
Furey TS
Gao H
Gerstein M
Gertz J
Gibson T
Giddings MC
Gingeras TR
Giresi PG
Giste E
Good PJ
Gordon A
Graison EAY
Grasfeder LL
Green ED
Grossman RL
Grubert F
Guigo R
Gunawardena H
Habegger L
Hannon G
Hardison RC
Hariharan M
Harris RS
Harrow J
Harte R
Haugen E
Haussler D
Hayashizaki Y
Herrero J
Hoffman MM
Howald C
Huang H
Hubbard T
Humbert R
Hunt T
Issner R
Iyengar S
Iyer VR
Jain P
Jameel N
Jee J
Jha S
Johnson A
Johnson EM
Kapranov P
Karmakar S
Karolchik D
Kasowski M
Kaul R
Kay M
Keefe D
Kellis M
Kent WJ
Khatun J
Kheradpour P
Khurana E
Kim SKC
King B
Kingswood C
Kirilusha A
Knowles DG
Kokocinski F
Ku M
Kuhn RM
Kundaje A
Kutyavin T
Lacroute P
Lagarde J
Lajoie BR
Lam H
Lamarre N
Landt SG
Lassmann T
Learned K
Lee B-K
Lee K
Leng J
Li Q
Lian J
Libbrecht M
Lieb JD
Lin M
Lin MF
Lin W
Lindahl M
Liu Z
Lochovsky L
London D
Lotakis D
Lowdon RF
Lu Z
Lukk M
Luscombe NM
Maier C
Malladi VS
Margulies EH
Marinov G
Mariotti M
McCue K
McDaniell RM
Merkel A
Meyer LR
Mikkelsen TS
Miller W
Miotto B
Monahan H
Moqtaderi Z
Mortazavi A
Mukherjee G
Muratet MA
Myers RM
Navas PA
Neph S
Neri J
Nesmith AS
Newberry KM
Newburger P
Nguyen ED
Noble WS
O'Geen H
Parker SCJ
Parker SL
Partridge EC
Patacsil D
Paten B
Pauli F
Penalva LO
Pepke S
Poh WT
Preall J
Pusey B
Raha D
Raney BJ
Rauch R
Reddy TE
Reed B
Reymond A
Reynolds A
Rhead B
Ribeca P
Risk B
Roach V
Roberts K
Robilotto R
Rodriguez JM
Rosenbloom KR
Roskin K
Rozowsky J
Ruan X
Ruan Y
Rynes E
Sabo PJ
Sammeth M
Sanchez ME
Sandstrom R
Sanyal A
Saunders G
Sboner A
Schlesinger F
Searle S
Shafer T
Shahab A
Sheffer HH
Sheffield NC
Sherlock G
Shestak C
Shi M
Shibata Y
Shoresh N
Showers KA
Sidow A
Slifer T
Sloan CA
Snyder M
Sobral D
Song G
Song L
Sotirova V
Sprouse RO
Stamatoyannopoulos J
Stamatoyannopoulos JA
Struhl K
Suh B
Swing VK
Takahashi H
Tanzer A
Tenenbaum SA
Thibeault K
Thurman RE
Tilgner H
Tress M
Trinklein ND
Trout D
Truong T
Tullius TD
Ucla C
Valencia A
Vales T
van Baren MJ
Vaquerizas JM
Varley KE
Victorsen A
Vielmetter J
Vong S
Waite LL
Wang H
Wang J
Wang L
Wang T
Ward LD
Weaver M
Wei C-L
Weissman SM
Weng Z
White KP
Whitfield TW
Wilder SP
Williams B
Winter D
Wold B
Wu L
Xi H
Xu X
Xu Y
Yan K-K
Yang X
Yip KY
Yu Y
Zaleski C
Zhang X
Zhang Z
Zweig AS
Publication venue
Publication date: 19/04/2011
Field of study

The mission of the Encyclopedia of DNA Elements (ENCODE) Project is to enable the scientific and medical communities to interpret the human genome sequence and apply it to understand human biology and improve health. The ENCODE Consortium is integrating multiple technologies and approaches in a collective effort to discover and define the functional elements encoded in the human genome, including genes, transcripts, and transcriptional regulatory regions, together with their attendant chromatin states and DNA methylation patterns. In the process, standards to ensure high-quality data have been implemented, and novel algorithms have been developed to facilitate analysis. Data and derived results are made available through a freely accessible database. Here we provide an overview of the project and the resources it is generating and illustrate the application of ENCODE data to interpret the human genome

UCL Discovery

Patterns of insertions and their covariation with substitutions in the rat, mouse and human genomes

Author: CHIAROMONTE FRANCESCA
Hardison RC
Haussler D
Miller W
Roskin KM
Schwartz S
Smit AF
Yang S
Publication venue
Publication date: 01/01/2004
Field of study

Archivio della ricerca della Scuola Superiore Sant'Anna

Aberrant B cell repertoire selection associated with HIV neutralizing antibody breadth

Author: Bonsignori M
Borrow P
Boyd SD
Fire AZ
Haynes BF
Hoh RA
Hwang KK
Jackson KJL
Joshi SA
Lee JY
Liao HX
Moody MA
Pedroza-Pacheco I
Roskin KM
Publication venue: 'Nature Research Society'
Publication date: 01/01/2020
Field of study

A goal of HIV vaccine development is to elicit antibodies with neutralizing breadth. Broadly neutralizing antibodies (bNAbs) to HIV often have unusual sequences with long heavy-chain complementarity-determining region loops, high somatic mutation rates and polyreactivity. A subset of HIV-infected individuals develops such antibodies, but it is unclear whether this reflects systematic differences in their antibody repertoires or is a consequence of rare stochastic events involving individual clones. We sequenced antibody heavy-chain repertoires in a large cohort of HIV-infected individuals with bNAb responses or no neutralization breadth and uninfected controls, identifying consistent features of bNAb repertoires, encompassing thousands of B cell clones per individual, with correlated T cell phenotypes. These repertoire features were not observed during chronic cytomegalovirus infection in an independent cohort. Our data indicate that the development of numerous B cell lineages with antibody features associated with autoreactivity may be a key aspect in the development of HIV neutralizing antibody breadth

Oxford University Research Archive

Co-variation in frequencies of substitution, deletion, transposition and recombination during eutherian evolution

Author: CHIAROMONTE FRANCESCA
Diekhans M
Elnitski L
Furey TS
Goldman N
Hardison RC
Haussler D.
Kent WJ
Kolbe D
Li J
Miller W
O’Connor M
Roskin KM
Schwartz S
Smit A
Weber R
Whelan S
Yang S
Publication venue
Publication date: 01/01/2003
Field of study

Archivio della ricerca della Scuola Superiore Sant'Anna

HIV-1 envelope gp41 antibodies can originate from terminal ileum B cells that share cross-reactivity with commensal bacteria.

Author: Alam SM
Boyd SD
Foulger A
Haynes BF
Hwang KK
Jackson KJ
Jaeger FH
Jeffries TL
Kepler TB
Lambson B
Liao HX
Lloyd KE
Lockwood B
Marshall DJ
Moody MA
Morris L
Parks R
Roskin KM
Scearce R
Soderberg K
Stolarchuk C
Tomaras GD
Trama AM
Vandergrift N
Whitesides JF
Wiehe K
Publication venue
Publication date: 13/08/2014
Field of study

Monoclonal antibodies derived from blood plasma cells of acute HIV-1-infected individuals are predominantly targeted to the HIV Env gp41 and cross-reactive with commensal bacteria. To understand this phenomenon, we examined anti-HIV responses in ileum B cells using recombinant antibody technology and probed their relationship to commensal bacteria. The dominant ileum B cell response was to Env gp41. Remarkably, a majority (82%) of the ileum anti-gp41 antibodies cross-reacted with commensal bacteria, and of those, 43% showed non-HIV-1 antigen polyreactivity. Pyrosequencing revealed shared HIV-1 antibody clonal lineages between ileum and blood. Mutated immunoglobulin G antibodies cross-reactive with both Env gp41 and microbiota could also be isolated from the ileum of HIV-1 uninfected individuals. Thus, the gp41 commensal bacterial antigen cross-reactive antibodies originate in the intestine, and the gp41 Env response in HIV-1 infection can be derived from a preinfection memory B cell pool triggered by commensal bacteria that cross-react with Env

Elsevier - Publisher Connector

DukeSpace

Antibody lineages with evidence of somatic hypermutation persisting for >4 years in a South African subject with broad neutralizing activity

Author: AM Trama
AZ Fire
BF Haynes
C Tsao
ES Gray
G Kelsoe
GD Tomaras
H Liao
J Lee
JA Eudailey
JD Amos
K Seo
KE Lloyd
KJ Jackson
KM Roskin
L Morris
LC Armand
M Bonsignori
M Moody
MS Drinker
R Hoh
R Parks
S Wang
SD Boyd
T Pham
TB Kepler
TC Gurley
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Co-evolution of a broadly neutralizing HIV-1 antibody and founder virus.

Author: Alam SM
Boyd SD
Cai F
Chen S
Chen Y
Cohen M
Du X
Fire AZ
Gao F
Gnanakaran S
Hahn BH
Haynes BF
Hraber P
Joyce MG
Kamanga G
Kelsoe G
Kepler TB
Korber BT
Kwong PD
Liao HX
Lloyd KE
Louder MK
Lynch R
Mascola JR
Montefiori DC
Moquin S
Mullikin JC
NISC Comparative Sequencing Program
Parks R
Roskin KM
Scearce RM
Schramm CA
Shapiro L
Shaw GM
Soderberg KA
Srivatsan S
Tran LM
Wiehe K
Xia SM
Yang G
Zhang B
Zhang Z
Zheng A
Zhou T
Zhu J
Publication venue
Publication date: 25/04/2013
Field of study

Current human immunodeficiency virus-1 (HIV-1) vaccines elicit strain-specific neutralizing antibodies. However, cross-reactive neutralizing antibodies arise in approximately 20% of HIV-1-infected individuals, and details of their generation could provide a blueprint for effective vaccination. Here we report the isolation, evolution and structure of a broadly neutralizing antibody from an African donor followed from the time of infection. The mature antibody, CH103, neutralized approximately 55% of HIV-1 isolates, and its co-crystal structure with the HIV-1 envelope protein gp120 revealed a new loop-based mechanism of CD4-binding-site recognition. Virus and antibody gene sequencing revealed concomitant virus evolution and antibody maturation. Notably, the unmutated common ancestor of the CH103 lineage avidly bound the transmitted/founder HIV-1 envelope glycoprotein, and evolution of antibody neutralization breadth was preceded by extensive viral diversification in and near the CH103 epitope. These data determine the viral and antibody evolution leading to induction of a lineage of HIV-1 broadly neutralizing antibodies, and provide insights into strategies to elicit similar antibodies by vaccination

DukeSpace

ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach

Author: A Wilm
A Wilm
AR Subramanian
BP Blackburne
C Notredame
C Notredame
C Notredame
CB Do
CB Do
CP Ponting
DF Feng
DG Higgins
Dimitrios P Lyras
Dirk Metzler
DJ Russell
F Chiaromonte
F Sievers
GJ Barton
GPS Raghava
H Carroll
J Kececioglu
JD Thompson
JD Thompson
JD Thompson
JM Sauder
K Katoh
KM Roskin
L Wang
M Cline
M Murata
MA Larkin
MP Berger
O Gotoh
O Gotoh
P Bonizzoni
P Hogeweg
PP Gardner
PP Gardner
R Durbin
RC Edgar
RC Edgar
RC Edgar
RC Edgar
SB Needleman
SME Sahraeian
T Lassmann
W Just
X Ye
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref