Search CORE

613 research outputs found

Faster algorithms for 1-mappability of a sequence

Author: A Amir
G Manzini
J Fischer
M Crochemore
MA Bender
ML Fredman
ML Metzker
NA Fonseca
SV Thankachan
T Derrien
U Manber
Publication venue
Publication date: 11/05/2017
Field of study

In the k-mappability problem, we are given a string x of length n and integers m and k, and we are asked to count, for each length-m factor y of x, the number of other factors of length m of x that are at Hamming distance at most k from y. We focus here on the version of the problem where k = 1. The fastest known algorithm for k = 1 requires time O(mn log n/ log log n) and space O(n). We present two algorithms that require worst-case time O(mn) and O(n log^2 n), respectively, and space O(n), thus greatly improving the state of the art. Moreover, we present an algorithm that requires average-case time and space O(n) for integer alphabets if m = {\Omega}(log n/ log {\sigma}), where {\sigma} is the alphabet size

arXiv.org e-Print Archive

Crossref

Small RNA analysis in Sindbis virus infected human HEK293 cells

Author: A Chakrabarti
A Mortazavi
A Saumet
Andras Donaszi-Ivanov
BC Ho
BR Cullen
BR tenOever
CL Campbell
CM Cirimotich
CW Burke
E Gottwein
EG Strauss
EM Morazzani
ER Mardis
EY Choy
F Ma
F Weber
G Gatto
G Szittya
I Mohorianu
I Mohorianu
IP Greene
Irina Mohorianu
J Fang
JI Henke
JK Ahluwalia
JR Abend
JY Leung
K Prufer
KJ Ishii
KL McKnight
KM Myles
KW Witwer
L Du
M Hariharan
MB Stocks
MC Saleh
ML Metzker
MP Gantier
N Vodovar
P Parameswaran
Penny P. Powell
PV Maillard
RL Pilcher
RL Skalsky
RP Kincaid
RW Williams
S Griffiths-Jones
S Koyama
S Moxon
SW Ding
T Kawai
Tamas Dalmay
V Stollar
W Hou
WB Klimstra
X Lei
Y Kim
Y Li
YQ Wu
Zach N. Adelman
ZN Adelman
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 31/12/2013
Field of study

In contrast to the defence mechanism of RNA interference (RNAi) in plants and invertebrates, its role in the innate response to virus infection of mammals is a matter of debate. Since RNAi has a well-established role in controlling infection of the alphavirus Sindbis virus (SINV) in insects, we have used this virus to investigate the role of RNAi in SINV infection of human cells

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of East Anglia digital repository

FigShare

Deep sequencing of virus-infected cells reveals HIV-encoded small RNAs

Author: A. van Kampen
Abbink
B. Berkhout
Bennasser
Berkhout
Berkhout
Boss
Cavanagh
Das
de Vries
F. Baas
Gaudray
Ghildiyal
Haasnoot
Haasnoot
J. Haasnoot
Kasschau
Klase
Klase
Klase
Klasens
Landry
Larocca
M. Willemsen
Meister
Metzker
Michael
Morris
N. C. T. Schopman
Parameswaran
Pfeffer
Qi
Sagare
Schopman
Schopman
Schopman
T. Bradley
Umbach
Verhoef
Voinnet
Watanabe
Waterhouse
Wilkins
Y. P. Liu
Yang
Zhou
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Small virus-derived interfering RNAs (viRNAs) play an important role in antiviral defence in plants, insects and nematodes by triggering the RNA interference (RNAi) pathway. The role of RNAi as an antiviral defence mechanism in mammalian cells has been obscure due to the lack of viRNA detection. Although viRNAs from different mammalian viruses have recently been identified, their functions and possible impact on viral replication remain unknown. To identify viRNAs derived from HIV-1, we used the extremely sensitive SOLiDTM 3 Plus System to analyse viRNA accumulation in HIV-1-infected T lymphocytes. We detected numerous small RNAs that correspond to the HIV-1 RNA genome. The majority of these sequences have a positive polarity (98.1%) and could be derived from miRNAs encoded by structured segments of the HIV-1 RNA genome (vmiRNAs). A small portion of the viRNAs is of negative polarity and most of them are encoded within the 3′-UTR, which may represent viral siRNAs (vsiRNAs). The identified vsiRNAs can potently repress HIV-1 production, whereas suppression of the vsiRNAs by antagomirs stimulate virus production. These results suggest that HIV-1 triggers the production of vsiRNAs and vmiRNAs to modulate cellular and/or viral gene expression

Crossref

PubMed Central

Next-generation sequencing of common osteogenesis imperfecta-related genes in clinical practice

Author: A Von Bubnoff
AM Barnes
AV Persikov
D Baldridge
D Basel
DO Sillence
F Rauch
F Sanger
FH Glorieux
FH Glorieux
G Sule
HL Rehm
J Körkkö
J Wang
JC Marini
JM Rothberg
K Misof
KJ Jepsen
LM Ward
M Zaidi
ML Metzker
R Morello
T Cundy
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/07/2018
Field of study

Next generation sequencing (NGS) is a rapidly developing area in genetics. Utilizing this technology in the management of disorders with complex genetic background and not recurrent mutation hot spots can be extremely useful. In this study, we applied NGS, namely semiconductor sequencing to determine the most significant osteogenesis imperfecta-related genetic variants in the clinical practice. We selected genes coding collagen type I alpha-1 and-2 (COL1A1, COL1A2) which are responsible for more than 90% of all cases. CRTAP and LEPRE1/P3H1 genes involved in the background of the recessive forms with relatively high frequency (type VII and VIII) represent less than 10% of the disease. In our six patients (1-41 years), we identified 23 different variants. We found a total of 14 single nucleotide variants (SNV) in COL1A1 and COL1A2, 5 in CRTAP and 4 in LEPRE1. Two novel and two already well-established pathogenic SNVs have been identified. Among the newly recognized mutations, one results in an amino acid change and one of them is a stop codon. We have shown that a new full-scale cost-effective NGS method can be developed and utilized to supplement diagnostic process of osteogenesis imperfecta with molecular genetic data in clinical practice

Crossref

Semmelweis Repository

Midgut microbiota of the malaria mosquito vector Anopheles gambiae and Interactions with plasmodium falciparum Infection

Author: A Chao
A Rani
AK Benson
Alexandra Marie
AM Briones
AM Mendes
Anne Boissière
AV Santos
B Chouaia
C Costantini
C Dale
C Damiani
C Fanello
C Gouveia
C Harris
C Wondji
CA Lowenberger
CB Pumpuni
CB Pumpuni
CD Marsden
CJF Ter Braak
CJF Ter Braak
CM Cirimotich
Dipankar Bachar
E Crotti
E Crotti
E Zaura
Elena A. Levashina
F Armougom
F Simard
FH Collins
G Avgustin
G Elango
G Favia
G Reid
Hamid R. Shahbazkia
II Ivanov
Isabelle Morlais
J Jadin
J Okasen
J Qin
J Rodrigues
JF Petrosino
JM Lindh
K Zouache
KD Vernick
Kenneth D. Vernick
KM Maslowski
KS Hayes
KW Mah
L Gonzalez-Ceron
Luc Abate
LV Hooper
LV Hooper
M Farenhorst
M Kane
Majoline T. Tchioffo
ME Bruno
MK Lawniczak
ML Metzker
ML Sogin
MM Riehle
MO Ndiath
MS Beier
N Buchon
N Windbichler
O Terenius
P Kampfer
Parfait H. Awono-Ambene
PJ Turnbaugh
R Andreotti
R Christen
R Fabre
R Kindt
R Miyata
RE Ley
RE Ley
Richard Christen
RJ Dillon
RJ Dillon
RM Moll
S Compant
S Meister
SA Blandin
Sandrine E. Nsango
SC Straif
SK Kuss
SM Geib
T Stoeck
TL Turner
U Hentschel
VG Martinson
Y Dong
Y Kikuchi
Y Kikuchi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

The susceptibility of Anopheles mosquitoes to Plasmodium infections relies on complex interactions between the insect vector and the malaria parasite. A number of studies have shown that the mosquito innate immune responses play an important role in controlling the malaria infection and that the strength of parasite clearance is under genetic control, but little is known about the influence of environmental factors on the transmission success. We present here evidence that the composition of the vector gut microbiota is one of the major components that determine the outcome of mosquito infections. A. gambiae mosquitoes collected in natural breeding sites from Cameroon were experimentally challenged with a wild P. falciparum isolate, and their gut bacterial content was submitted for pyrosequencing analysis. The meta-taxogenomic approach revealed a broader richness of the midgut bacterial flora than previously described. Unexpectedly, the majority of bacterial species were found in only a small proportion of mosquitoes, and only 20 genera were shared by 80% of individuals. We show that observed differences in gut bacterial flora of adult mosquitoes is a result of breeding in distinct sites, suggesting that the native aquatic source where larvae were grown determines the composition of the midgut microbiota. Importantly, the abundance of Enterobacteriaceae in the mosquito midgut correlates significantly with the Plasmodium infection status. This striking relationship highlights the role of natural gut environment in parasite transmission. Deciphering microbe-pathogen interactions offers new perspectives to control disease transmission.Institut de Recherche pour le Developpement (IRD); French Agence Nationale pour la Recherche [ANR-11-BSV7-009-01]; European Community [242095, 223601]info:eu-repo/semantics/publishedVersio

Public Library of Science (PLOS)

Crossref

HAL-UNICE

Directory of Open Access Journals

PubMed Central

HAL-IRD

Red de Bibliotecas Virtuales de Ciencias Sociales de América Latina y El Caribe

Sapientia

Horizon / Pleins textes

Hal-Diderot

DNA Damage in Plant Herbarium Tissue

Author: A Untergasser
AJ Hansen
AJ Hansen
Argelia Cuenca
AW Briggs
AW Briggs
B Shapiro
BA Rowan
Carles Lalueza-Fox
CD Millar
D Blankenberg
D Blankenberg
DP Bebber
DR Smith
Freek T. Bakker
Gitte Petersen
J Goecks
James E. Richardson
JJ Doyle
JJ Doyle
L Drábková
LN Jobba
M Hofreiter
M Srinivansan
M Stiller
Martijn Staats
ML Metzker
MM Pyle
MTP Gilbert
MW Chase
OF Cubero
Ole Seberg
P Boesch
P Brotherton
P Heyn
P Sebastian
PF McCabe
PM Hollingsworth
Ria Vrielink-van Ginkel
S Proost
S Pääbo
S Pääbo
S Telle
SA Harris
SS Gill
T Lindahl
T Lindahl
T Lindahl
T Lindahl
T Roldán-Arjona
TJ Reape
V Savolainen
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Dried plant herbarium specimens are potentially a valuable source of DNA. Efforts to obtain genetic information from this source are often hindered by an inability to obtain amplifiable DNA as herbarium DNA is typically highly degraded. DNA post-mortem damage may not only reduce the number of amplifiable template molecules, but may also lead to the generation of erroneous sequence information. A qualitative and quantitative assessment of DNA post-mortem damage is essential to determine the accuracy of molecular data from herbarium specimens. In this study we present an assessment of DNA damage as miscoding lesions in herbarium specimens using 454-sequencing of amplicons derived from plastid, mitochondrial, and nuclear DNA. In addition, we assess DNA degradation as a result of strand breaks and other types of polymerase non-bypassable damage by quantitative real-time PCR. Comparing four pairs of fresh and herbarium specimens of the same individuals we quantitatively assess post-mortem DNA damage, directly after specimen preparation, as well as after long-term herbarium storage. After specimen preparation we estimate the proportion of gene copy numbers of plastid, mitochondrial, and nuclear DNA to be 2.4–3.8% of fresh control DNA and 1.0–1.3% after long-term herbarium storage, indicating that nearly all DNA damage occurs on specimen preparation. In addition, there is no evidence of preferential degradation of organelle versus nuclear genomes. Increased levels of C→T/G→A transitions were observed in old herbarium plastid DNA, representing 21.8% of observed miscoding lesions. We interpret this type of post-mortem DNA damage-derived modification to have arisen from the hydrolytic deamination of cytosine during long-term herbarium storage. Our results suggest that reliable sequence data can be obtained from herbarium specimens

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Copenhagen University Research Information System

edocUR

Investigation into the annotation of protocol sequencing steps in the sequence read archive

Author: A Brazma
A Brazma
A Seguin-Orlando
ER Mardis
ER Mardis
F Meacham
I Kozarewa
J Housby
J Orlowski
JA Sikorsky
JC Dohm
JH Eastberg
JR Miller
KD Hansen
M Allhoff
MA Quail
MG Ross
ML Metzker
MS Cheung
N Kamps-Hughes
P Keohavong
R Edgar
R Leinonen
S Spitaleri
SG Acinas
SL Schwartz
T Nakazato
X Jiao
YC Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

BACKGROUND: The workflow for the production of high-throughput sequencing data from nucleic acid samples is complex. There are a series of protocol steps to be followed in the preparation of samples for next-generation sequencing. The quantification of bias in a number of protocol steps, namely DNA fractionation, blunting, phosphorylation, adapter ligation and library enrichment, remains to be determined. RESULTS: We examined the experimental metadata of the public repository Sequence Read Archive (SRA) in order to ascertain the level of annotation of important sequencing steps in submissions to the database. Using SQL relational database queries (using the SRAdb SQLite database generated by the Bioconductor consortium) to search for keywords commonly occurring in key preparatory protocol steps partitioned over studies, we found that 7.10%, 5.84% and 7.57% of all records (fragmentation, ligation and enrichment, respectively), had at least one keyword corresponding to one of the three protocol steps. Only 4.06% of all records, partitioned over studies, had keywords for all three steps in the protocol (5.58% of all SRA records). CONCLUSIONS: The current level of annotation in the SRA inhibits systematic studies of bias due to these protocol steps. Downstream from this, meta-analyses and comparative studies based on these data will have a source of bias that cannot be quantified at present

Crossref

Springer - Publisher Connector

Royal Holloway - Pure

PubMed Central

Spiral - Imperial College Digital Repository

Measuring, in solution, multiple-fluorophore labeling by combining Fluorescence Correlation Spectroscopy and photobleaching

Author: Balannik V.
Berlier J. E.
Chen L.-J.
Chen Y.
Chen Y.
Cuppoletti A. C.
Das S. K.
Deschenes L. A.
Elson E. L.
Füreder-Kitzmüller E.
Gregor I.
Hesse J.
Huang B.
Huang Z.
Kask P.
Kendall M. G.
Koo E. H.
Lakowicz J. R.
Luchowski R.
Luchowski R.
Margaritis T.
Messina T. C.
Metzker M. L.
Moerner W. E.
Mutch S. A.
Müller J. D.
Nguyen V. T.
Petrasek Z.
Saffarian S.
Sanborn M. E.
Sarkar P.
Shaner N. C.
Singh D.
Tinland B.
Ulbrich M. H.
Wagenknecht H.-A.
Wang Y. P.
Widengren J.
Wu B.
’t Hoen P.A. C.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2010
Field of study

Determining the number of fluorescent entities that are coupled to a given molecule (DNA, protein, etc.) is a key point of numerous biological studies, especially those based on a single molecule approach. Reliable methods are important, in this context, not only to characterize the labeling process, but also to quantify interactions, for instance within molecular complexes. We combined Fluorescence Correlation Spectroscopy (FCS) and photobleaching experiments to measure the effective number of molecules and the molecular brightness as a function of the total fluorescence count rate on solutions of cDNA (containing a few percent of C bases labeled with Alexa Fluor 647). Here, photobleaching is used as a control parameter to vary the experimental outputs (brightness and number of molecules). Assuming a Poissonian distribution of the number of fluorescent labels per cDNA, the FCS-photobleaching data could be easily fit to yield the mean number of fluorescent labels per cDNA strand (@ 2). This number could not be determined solely on the basis of the cDNA brightness, because of both the statistical distribution of the number of fluorescent labels and their unknown brightness when incorporated in cDNA. The statistical distribution of the number of fluorophores labeling cDNA was confirmed by analyzing the photon count distribution (with the cumulant method), which showed clearly that the brightness of cDNA strands varies from one molecule to the other.Comment: 38 pages (avec les figures

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

HAL Descartes

HAL-CEA

RNASeqBrowser: A genome browser for simultaneous visualization of raw strand specific RNAseq reads and UCSC genome browser custom tracks

Author: A McKenna
Atul Sajjanhar
Chenwei Wang
Colleen C Nelson
D Karolchik
David L A Wood
DC Koboldt
Gregor Tevz
H Li
H Li
H Thorvaldsdottir
I Milne
IL Hofacker
J Lai
J Severin
Jiyuan An
John Lai
JT Robinson
JW Nicol
M Fiume
M Fiume
MA DePristo
Melanie L Lehman
ML Metzker
PA Fujita
T Abeel
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers

Author: A Baross
AB Olshen
AJ Bass
AJ Holland
B Nilsson
BA Weir
Barbara Hill
BM Bolstad
BS Taylor
C Greenman
C Li
C Li
Craig H Mermel
D Chiang
D Etemadmoghadam
D Hanahan
DY Chiang
E Pleasance
ED Pleasance
ES Venkatraman
F Sanchez-Garcia
G Schwarz
Gad Getz
GR Bignell
HS Dahlback
LM Merlo
M Guttman
M Metzker
Matthew L Meyerson
MR Stratton
Network CGAR
NT Leach
P Hupé
PA Northcott
PJ Stephens
PJ Stephens
R Beroukhim
R Beroukhim
R Firestein
R McLendon
Rameen Beroukhim
SA McCarroll
SJ Diskin
SP Shah
Steven E Schumacher
T Santarius
T Sjoblom
WM Lin
Y Benjamini
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

We describe methods with enhanced power and specificity to identify genes targeted by somatic copy-number alterations (SCNAs) that drive cancer growth. By separating SCNA profiles into underlying arm-level and focal alterations, we improve the estimation of background rates for each category. We additionally describe a probabilistic method for defining the boundaries of selected-for SCNA regions with user-defined confidence. Here we detail this revised computational approach, GISTIC2.0, and validate its performance in real and simulated datasets

DSpace@MIT

Crossref

Springer - Publisher Connector

PubMed Central