Search CORE

34 research outputs found

X. couchianus and X. hellerii genome models provide genomic variation insight among Xiphophorus species

Author: Agarwala Richa
Boswell Mikki
Boswell William
Chalopin Domitille
Garcia Tzintzuni
Minx Patrick
Postlethwait John H
Scharti Manfred
Shen Yingjia
Shiryev Sergey A
Volff Jean-Nicolas
Walter Ronald B
Warren Wesley C
Publication venue: Digital Commons@Becker
Publication date: 01/01/2016
Field of study

4 inter-chromosomal rearrangement events between X. hellerii and X. maculatus. (XLSX 40 kb

Crossref

Springer - Publisher Connector

Digital Commons@Becker

PubMed Central

Online-Publikations-Server der Universität Würzburg

The Francis Crick Institute

Single haplotype assembly of the human genome from a hydatidiform mole

Author: Agarwala Richa
Church Deanna M.
Eichler Evan E.
Fulton Robert S.
Graves-Lindsay Tina A.
Huddleston John
Meltz Steinberg Karyn
Morgulis Aleksandr
Schneider Valerie A.
Shiryev Sergey A.
Surti Urvashi
Warren Wesley C.
Wilson Richard K.
Publication venue: Digital Commons@Becker
Publication date: 01/01/2014
Field of study

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly

Crossref

Digital Commons@Becker

PubMed Central

Database indexing for production MegaBLAST searches

Author: Alejandro A. Schäffer
Aleksandr Morgulis
Altschul
Cao
George Coulouris
Gertz
Giladi
Jiang
Kent
Kim
Lee
Morgulis
Morgulis
Ning
Rasmussen
Richa Agarwala
Shiryev
Stokes
Thomas L. Madden
Williams
Yan Raytselis
Zhang
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: The BLAST software package for sequence comparison speeds up homology search by preprocessing a query sequence into a lookup table. Numerous research studies have suggested that preprocessing the database instead would give better performance. However, production usage of sequence comparison methods that preprocess the database has been limited to programs such as BLAT and SSAHA that are designed to find matches when query and database subsequences are highly similar

Crossref

PubMed Central

Intrinsic Structural Disorder Confers Cellular Viability on Oncogenic Fusion Proteins

Author: A Andreeva
A Goga
A Porollo
B Boeckmann
C Chothia
C Lavau
D Bischof
D Ekman
DA Benson
E Roccato
EC Collins
FJ Novo
Hedi Hegyi
HH Mak
HJ Dyson
I Ortiz de Mendibil
J Janin
J Schlessinger
JD Rowley
JJ Ward
KP Ng
L Iakoucheva
László Buday
M Sickmeier
M Soda
P Flicek
P Tompa
P Tompa
P Tompa
PA Futreal
PD Aplan
Peter Tompa
PJ Fleming
PR Romero
RK Slany
Roland Dunbrack
S Pan
SA Shiryev
SC Harrison
SG Peisajovich
SJ Furney
SJ Sammut
T Jung
TH Rabbitts
TH Rabbitts
V Lacronique
VN Uversky
X Zhao
Y Cheng
Y Laabi
Y Minezaki
Y Wang
YL Choi
Z Dosztanyi
Z Dosztanyi
Publication venue: Public Library of Science
Publication date: 01/10/2009
Field of study

Chromosomal translocations, which often generate chimeric proteins by fusing segments of two distinct genes, represent the single major genetic aberration leading to cancer. We suggest that the unifying theme of these events is a high level of intrinsic structural disorder, enabling fusion proteins to evade cellular surveillance mechanisms that eliminate misfolded proteins. Predictions in 406 translocation-related human proteins show that they are significantly enriched in disorder (43.3% vs. 20.7% in all human proteins), they have fewer Pfam domains, and their translocation breakpoints tend to avoid domain splitting. The vicinity of the breakpoint is significantly more disordered than the rest of these already highly disordered fusion proteins. In the unlikely event of domain splitting in fusion it usually spares much of the domain or splits at locations where the newly exposed hydrophobic surface area approximates that of an intact domain. The mechanisms of action of fusion proteins suggest that in most cases their structural disorder is also essential to the acquired oncogenic function, enabling the long-range structural communication of remote binding and/or catalytic elements. In this respect, there are three major mechanisms that contribute to generating an oncogenic signal: (i) a phosphorylation site and a tyrosine-kinase domain are fused, and structural disorder of the intervening region enables intramolecular phosphorylation (e.g., BCR-ABL); (ii) a dimerisation domain fuses with a tyrosine kinase domain and disorder enables the two subunits within the homodimer to engage in permanent intermolecular phosphorylations (e.g., TFG-ALK); (iii) the fusion of a DNA-binding element to a transactivator domain results in an aberrant transcription factor that causes severe misregulation of transcription (e.g. EWS-ATF). Our findings also suggest novel strategies of intervention against the ensuing neoplastic transformations

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Genome-wide-analyses of Listeria monocytogenes from food-processing plants reveals clonal diversity and dates the emergence of persisting sequence types

Author: Akhter
Althaus
Ben Embarek
Carpentier
Chiara
Ciolacu
Cossart
Costerton
den Bakker
den Bakker
den Bakker
den Bakker
Drummond
Ebner
Fagerlund
Ferreira
Ford
Fox
Freitag
Glaser
Hain
Hall
Hein
Holch
Kuenne
Kvistholm Jensen
Kwong
Larsen
Larsen
Le
Lee
Leekitcharoenphon
Lewis
Li
Lopez-Alonso
Lunter
Malley
Martin
Maury
Morganti
Moura
Mûller
Nelson
Orsi
Orsi
Ortiz
Ortiz
Pal
Ragon
Rambaut
Ramirez
Roche
Romling
Ryan
Schmitz-Esser
Seemann
Shiryev
Stamatakis
Stasiewicz
Stessl
Tamura
Valderrama
Vogel
Wang
Wulff
Xu
Zankari
Zerbino
Zhou
Publication venue: 'Wiley'
Publication date: 01/01/2017
Field of study

Crossref

Copenhagen University Research Information System

Online Research Database In Technology

Improved BLAST searches using longer words for protein seeding

Author: A. A. Schaffer
Altschul
Altschul
Altschul
Cameron
Chandonia
Edgar
Gertz
Gribskov
Henikoff
J. S. Papadopoulos
R. Agarwala
Robinson
S. A. Shiryev
Schaffer
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

Finding Candida auris in public metagenomic repositories.

Author: Aleksandr Morgulis
Anastasia P Litvintseva
D Joseph Sexton
Elijah Lowe
John Phan
Jorge E Mario-Vasquez
Matthew Blumberg
Nancy A Chow
Richa Agarwala
Rory Welsh
Rytis Slatkevičius
Sergey Shiryev
Ujwal R Bagal
Publication venue: Public Library of Science (PLoS)
Publication date: 01/01/2024
Field of study

Candida auris is a newly emerged multidrug-resistant fungus capable of causing invasive infections with high mortality. Despite intense efforts to understand how this pathogen rapidly emerged and spread worldwide, its environmental reservoirs are poorly understood. Here, we present a collaborative effort between the U.S. Centers for Disease Control and Prevention, the National Center for Biotechnology Information, and GridRepublic (a volunteer computing platform) to identify C. auris sequences in publicly available metagenomic datasets. We developed the MetaNISH pipeline that uses SRPRISM to align sequences to a set of reference genomes and computes a score for each reference genome. We used MetaNISH to scan ~300,000 SRA metagenomic runs from 2010 onwards and identified five datasets containing C. auris reads. Finally, GridRepublic has implemented a prospective C. auris molecular monitoring system using MetaNISH and volunteer computing

Directory of Open Access Journals

Bioproject metadata for samples with WGS data at SRA with <i>C</i>. <i>auris</i> positive hits.

Author: Aleksandr Morgulis (17817189)
Anastasia P. Litvintseva (8955608)
D. Joseph Sexton (17817192)
Elijah Lowe (4256932)
John Phan (6911606)
Jorge E. Mario-Vasquez (17817186)
Matthew Blumberg (17817198)
Nancy A. Chow (3907564)
Richa Agarwala (263961)
Rory Welsh (137881)
Rytis Slatkevičius (17817195)
Sergey Shiryev (3456506)
Ujwal R. Bagal (14047065)
Publication venue
Publication date: 19/01/2024
Field of study

Bioproject metadata for samples with WGS data at SRA with C. auris positive hits.</p

The Francis Crick Institute

Additional file 1: Table S1. of X. couchianus and X. hellerii genome models provide genomic variation insight among Xiphophorus species

Author: Domitille Chalopin (668693)
Jean-Nicolas Volff (75406)
John Postlethwait (841459)
Manfred Schartl (75405)
Mikki Boswell (3456503)
Patrick Minx (2206)
Richa Agarwala (263961)
Ronald Walter (3456509)
Sergey Shiryev (3456506)
Tzintzuni Garcia (3344117)
Wesley Warren (225641)
William Boswell (3456500)
Yingjia Shen (228855)
Publication venue
Publication date: 14/12/2016
Field of study

24 inter-chromosomal rearrangement events between X. couchianus and X. maculatus. (XLSX 49 kb

The Francis Crick Institute