Search CORE

370 research outputs found

A−β− Subtype of Ketosis-Prone Diabetes Is Not Predominantly a Monogenic Diabetic Syndrome

Author: A. Balasubramanyam
A. Guthikonda
Balasubramanyam
C. S. Hampe
Cerf
Chanock
Cockburn
D. I. Scaduto
D. Iyer
D. L. Mansouri
Elbein
Fajans
Gragnoli
M. L. Metzker
M. R. Maldonado
Maldonado
Metzker
Muoio
Navalon-Garcia
R. Nalini
S. Patel
Tomei
Umpierrez
W. C. Haaland
Publication venue: American Diabetes Association
Publication date
Field of study

Crossref

PubMed Central

Small RNA analysis in Sindbis virus infected human HEK293 cells

Author: A Chakrabarti
A Mortazavi
A Saumet
Andras Donaszi-Ivanov
BC Ho
BR Cullen
BR tenOever
CL Campbell
CM Cirimotich
CW Burke
E Gottwein
EG Strauss
EM Morazzani
ER Mardis
EY Choy
F Ma
F Weber
G Gatto
G Szittya
I Mohorianu
I Mohorianu
IP Greene
Irina Mohorianu
J Fang
JI Henke
JK Ahluwalia
JR Abend
JY Leung
K Prufer
KJ Ishii
KL McKnight
KM Myles
KW Witwer
L Du
M Hariharan
MB Stocks
MC Saleh
ML Metzker
MP Gantier
N Vodovar
P Parameswaran
Penny P. Powell
PV Maillard
RL Pilcher
RL Skalsky
RP Kincaid
RW Williams
S Griffiths-Jones
S Koyama
S Moxon
SW Ding
T Kawai
Tamas Dalmay
V Stollar
W Hou
WB Klimstra
X Lei
Y Kim
Y Li
YQ Wu
Zach N. Adelman
ZN Adelman
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 31/12/2013
Field of study

In contrast to the defence mechanism of RNA interference (RNAi) in plants and invertebrates, its role in the innate response to virus infection of mammals is a matter of debate. Since RNAi has a well-established role in controlling infection of the alphavirus Sindbis virus (SINV) in insects, we have used this virus to investigate the role of RNAi in SINV infection of human cells

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of East Anglia digital repository

FigShare

Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets

Author: AW Kung
B Han
BN Howie
CA Anderson
DE Reich
DR Nyholt
DY Lin
E Lander
F Dudbridge
G Montana
I Pe’er
I Pe’er
J Li
J Ragoussis
JC Barrett
JM Cheverud
Juilian M. Y. Yeung
K Hao
KA Frazer
Miao-Xin Li
ML Metzker
MX Li
NW Galwey
P Duggal
Pak C. Sham
R Pahl
S Purcell
SA Tishkoff
SR Seaman
Stacey S. Cherny
V Moskvina
WG Hill
X Gao
Publication venue: Springer-Verlag
Publication date: 01/01/2012
Field of study

Current genome-wide association studies (GWAS) use commercial genotyping microarrays that can assay over a million single nucleotide polymorphisms (SNPs). The number of SNPs is further boosted by advanced statistical genotype-imputation algorithms and large SNP databases for reference human populations. The testing of a huge number of SNPs needs to be taken into account in the interpretation of statistical significance in such genome-wide studies, but this is complicated by the non-independence of SNPs because of linkage disequilibrium (LD). Several previous groups have proposed the use of the effective number of independent markers (Me) for the adjustment of multiple testing, but current methods of calculation for Me are limited in accuracy or computational speed. Here, we report a more robust and fast method to calculate Me. Applying this efficient method [implemented in a free software tool named Genetic type 1 error calculator (GEC)], we systematically examined the Me, and the corresponding p-value thresholds required to control the genome-wide type 1 error rate at 0.05, for 13 Illumina or Affymetrix genotyping arrays, as well as for HapMap Project and 1000 Genomes Project datasets which are widely used in genotype imputation as reference panels. Our results suggested the use of a p-value threshold of ~10−7 as the criterion for genome-wide significance for early commercial genotyping arrays, but slightly more stringent p-value thresholds ~5 × 10−8 for current or merged commercial genotyping arrays, ~10−8 for all common SNPs in the 1000 Genomes Project dataset and ~5 × 10−8 for the common SNPs only within genes

Crossref

Springer - Publisher Connector

PubMed Central

HKU Scholars Hub

ConDeTri - A Content Dependent Read Trimmer for Illumina Data

Author: A Ratan
Axel Künstner
D Zerbino
DR Kelley
ER Mardis
F Sanger
H Li
H Li
I Kozarewa
J Miller
J Schröder
JC Dohm
JC Dohm
JR Miller
K Scheibye-Alsing
L Ilie
L Salmela
L Ye
Linnéa Smeds
M Margulies
Maureen J. Donlin
ML Metzker
MP Cox
P Pevzner
R Li
R Li
S Kurtz
TR Gregory
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

During the last few years, DNA and RNA sequencing have started to play an increasingly important role in biological and medical applications, especially due to the greater amount of sequencing data yielded from the new sequencing machines and the enormous decrease in sequencing costs. Particularly, Illumina/Solexa sequencing has had an increasing impact on gathering data from model and non-model organisms. However, accurate and easy to use tools for quality filtering have not yet been established. We present ConDeTri, a method for content dependent read trimming for next generation sequencing data using quality scores of each individual base. The main focus of the method is to remove sequencing errors from reads so that sequencing reads can be standardized. Another aspect of the method is to incorporate read trimming in next-generation sequencing data processing and analysis pipelines. It can process single-end and paired-end sequence data of arbitrary length and it is independent from sequencing coverage and user interaction. ConDeTri is able to trim and remove reads with low quality scores to save computational time and memory usage during de novo assemblies. Low coverage or large genome sequencing projects will especially gain from trimming reads. The method can easily be incorporated into preprocessing and analysis pipelines for Illumina data

Public Library of Science (PLOS)

Crossref

Publikationer från Uppsala Universitet

Directory of Open Access Journals

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Investigation into the annotation of protocol sequencing steps in the sequence read archive

Author: A Brazma
A Brazma
A Seguin-Orlando
ER Mardis
ER Mardis
F Meacham
I Kozarewa
J Housby
J Orlowski
JA Sikorsky
JC Dohm
JH Eastberg
JR Miller
KD Hansen
M Allhoff
MA Quail
MG Ross
ML Metzker
MS Cheung
N Kamps-Hughes
P Keohavong
R Edgar
R Leinonen
S Spitaleri
SG Acinas
SL Schwartz
T Nakazato
X Jiao
YC Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

BACKGROUND: The workflow for the production of high-throughput sequencing data from nucleic acid samples is complex. There are a series of protocol steps to be followed in the preparation of samples for next-generation sequencing. The quantification of bias in a number of protocol steps, namely DNA fractionation, blunting, phosphorylation, adapter ligation and library enrichment, remains to be determined. RESULTS: We examined the experimental metadata of the public repository Sequence Read Archive (SRA) in order to ascertain the level of annotation of important sequencing steps in submissions to the database. Using SQL relational database queries (using the SRAdb SQLite database generated by the Bioconductor consortium) to search for keywords commonly occurring in key preparatory protocol steps partitioned over studies, we found that 7.10%, 5.84% and 7.57% of all records (fragmentation, ligation and enrichment, respectively), had at least one keyword corresponding to one of the three protocol steps. Only 4.06% of all records, partitioned over studies, had keywords for all three steps in the protocol (5.58% of all SRA records). CONCLUSIONS: The current level of annotation in the SRA inhibits systematic studies of bias due to these protocol steps. Downstream from this, meta-analyses and comparative studies based on these data will have a source of bias that cannot be quantified at present

Crossref

Springer - Publisher Connector

Royal Holloway - Pure

PubMed Central

Spiral - Imperial College Digital Repository

Targeted Genome-Wide Enrichment of Functional Regions

Author: A Bonin
AJRS Aparicio
AR Forrest
AS Solis
Ashwini Bhasi
C Bergmann
CS Stephan
D Baralle
ER Mardis
FO Desmet
G Pizzo
I Vorechovsky
J Shendure
J Welsh
J Welsh
Jeffrey Mattox
JR ten Bosch
KD Pruitt
KY Wong
L Mamanova
M Choi
ML Metzker
N López-Bigas
NA Faustino
P Senapathy
Periannan Senapathy
Perundurai S. Dhandapany
R Regulapati
S Rozen
Sakthivel Sadayappan
SB Ng
SB Ng
Thorkild I. A. Sorensen
UG Mueller
Publication venue: Public Library of Science
Publication date: 16/06/2010
Field of study

Only a small fraction of large genomes such as that of the human contains the functional regions such as the exons, promoters, and polyA sites. A platform technique for selective enrichment of functional genomic regions will enable several next-generation sequencing applications that include the discovery of causal mutations for disease and drug response. Here, we describe a powerful platform technique, termed “functional genomic fingerprinting” (FGF), for the multiplexed genomewide isolation and analysis of targeted regions such as the exome, promoterome, or exon splice enhancers. The technique employs a fixed part of a uniquely designed Fixed-Randomized primer, while the randomized part contains all the possible sequence permutations. The Fixed-Randomized primers bind with full sequence complementarity at multiple sites where the fixed sequence (such as the splice signals) occurs within the genome, and multiplex amplify many regions bounded by the fixed sequences (e.g., exons). Notably, validation of this technique using cardiac myosin binding protein-C (MYBPC3) gene as an example strongly supports the application and efficacy of this method. Further, assisted by genomewide computational analyses of such sequences, the FGF technique may provide a unique platform for high-throughput sample production and analysis of targeted genomic regions by the next-generation sequencing techniques, with powerful applications in discovering disease and drug response genes

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The PhyloPythiaS Web Server for Taxonomic Assignment of Metagenome Sequences

Author: A Brady
A Valouev
AC McHardy
AC McHardy
Alice Carolyn McHardy
C Burge
DH Huson
F Meyer
F Sanger
F Warnecke
GL Rosen
GW Tyson
H Teeling
I Tsochantaridis
J Handelsman
K Mavromatis
Kaustubh Raosaheb Patil
KR Patil
KU Foerstner
Linus Roune
M Hess
M Margulies
ML Metzker
N Adams
P Hugenholtz
PB Pope
PJ Turnbaugh
R Sandberg
R Tewhey
S Karlin
Sarah K. Highlander
SF Altschul
W Gerlach
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Metagenome sequencing is becoming common and there is an increasing need for easily accessible tools for data analysis. An essential step is the taxonomic classification of sequence fragments. We describe a web server for the taxonomic assignment of metagenome sequences with PhyloPythiaS. PhyloPythiaS is a fast and accurate sequence composition-based classifier that utilizes the hierarchical relationships between clades. Taxonomic assignments with the web server can be made with a generic model, or with sample-specific models that users can specify and create. Several interactive visualization modes and multiple download formats allow quick and convenient analysis and downstream processing of taxonomic assignments. Here, we demonstrate usage of our web server by taxonomic assignment of metagenome samples from an acidophilic biofilm community of an acid mine and of a microbial community from cow rumen

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

MPG.PuRe

RNASeqBrowser: A genome browser for simultaneous visualization of raw strand specific RNAseq reads and UCSC genome browser custom tracks

Author: A McKenna
Atul Sajjanhar
Chenwei Wang
Colleen C Nelson
D Karolchik
David L A Wood
DC Koboldt
Gregor Tevz
H Li
H Li
H Thorvaldsdottir
I Milne
IL Hofacker
J Lai
J Severin
Jiyuan An
John Lai
JT Robinson
JW Nicol
M Fiume
M Fiume
MA DePristo
Melanie L Lehman
ML Metzker
PA Fujita
T Abeel
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Meraculous: De Novo Genome Assembly with Short Paired-End Reads

Author: A Edwards
A Edwards
B Ewing
D Hernandez
DA Wheeler
Daniel S. Rokhsar
DR Bentley
DR Bentley
DR Smith
DR Zerbino
DR Zerbino
ES Lander
EW Myers
EW Myers
EW Myers
Gary P. Schroth
GG Sutton
I Maccallum
Isaac Ho
J Butler
Jarrod A. Chapman
JC Roach
JL Weber
JT Simpson
K Hayashi
M Chaisson
M Margulies
M Pop
M Pop
MJ Chaisson
MJ Chaisson
ML Metzker
P Flicek
PA Pevzner
R Li
R Li
RL Warren
RM Idury
SC Schuster
SF Altschul
Shujun Luo
Sirisha Sunkara
Steven L. Salzberg
TW Jeffries
TW Jeffries
Publication venue: Public Library of Science
Publication date: 01/08/2011
Field of study

We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia stipitis. More than 95% of the genome is recovered, with no errors; half the assembled sequence is in contigs longer than 101 kilobases and in scaffolds longer than 269 kilobases. Incorporating fosmid ends recovers entire chromosomes. Meraculous relies on an efficient and conservative traversal of the subgraph of the k-mer (deBruijn) graph of oligonucleotides with unique high quality extensions in the dataset, avoiding an explicit error correction step as used in other short-read assemblers. A novel memory-efficient hashing scheme is introduced. The resulting contigs are ordered and oriented using paired reads separated by ∼280 bp or ∼3.2 kbp, and many gaps between contigs can be closed using paired-end placements. Practical issues with the dataset are described, and prospects for assembling larger genomes are discussed

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

UNT Digital Library

Meta-analytic approach to the accurate prediction of secreted virulence effectors in gram-negative bacteria

Author: A Filloux
A Subtil
A Takaya
A Takaya
Akiko Takaya
BA Vinatzer
D Burstein
DL Joly
DM Cirillo
E Durand
GS Niemann
GW Buchko
H Yoon
I Kampenusa
JE McDermott
JG Kim
K Geddes
K Shimizu
LJ McGuffin
LM Schechter
M Gouy
M Kanehisa
M Lower
M Ouali
MH Saier Jr
ML Metzker
MR Wilkins
P Pavlidis
PM Power
PM Sharp
PM Sharp
R Arnold
R Samudrala
S Cunnac
SA Olson
SM Eswarappa
T Petnicki-Ocwieja
T Tobe
Tomoko Yamamoto
W Deng
W Ma
Y Wang
Y Yang
YD Li
Yoshiharu Sato
Z Yang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Many pathogens use a type III secretion system to translocate virulence proteins (called effectors) in order to adapt to the host environment. To date, many prediction tools for effector identification have been developed. However, these tools are insufficiently accurate for producing a list of putative effectors that can be applied directly for labor-intensive experimental verification. This also suggests that important features of effectors have yet to be fully characterized. Results In this study, we have constructed an accurate approach to predicting secreted virulence effectors from Gram-negative bacteria. This consists of a support vector machine-based discriminant analysis followed by a simple criteria-based filtering. The accuracy was assessed by estimating the average number of true positives in the top-20 ranking in the genome-wide screening. In the validation, 10 sets of 20 training and 20 testing examples were randomly selected from 40 known effectors of <it>Salmonella enterica </it>serovar Typhimurium LT2. On average, the SVM portion of our system predicted 9.7 true positives from 20 testing examples in the top-20 of the prediction. Removal of the N-terminal instability, codon adaptation index and ProtParam indices decreased the score to 7.6, 8.9 and 7.9, respectively. These discrimination features suggested that the following characteristics of effectors had been uncovered: unstable N-terminus, non-optimal codon usage, hydrophilic, and less aliphathic. The secondary filtering process represented by coexpression analysis and domain distribution analysis further refined the average true positive counts to 12.3. We further confirmed that our system can correctly predict known effectors of <it>P. syringae </it>DC3000, strongly indicating its feasibility. Conclusions We have successfully developed an accurate prediction system for screening effectors on a genome-wide scale. We confirmed the accuracy of our system by external validation using known effectors of <it>Salmonella </it>and obtained the accurate list of putative effectors of the organism. The level of accuracy was sufficient to yield candidates for gene-directed experimental verification. Furthermore, new features of effectors were revealed: non-optimal codon usage and instability of the N-terminal region. From these findings, a new working hypothesis is proposed regarding mechanisms controlling the translocation of virulence effectors and determining the substrate specificity encoded in the secretion system.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central