Search CORE

396 research outputs found

mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications

Author: Alkan C.
Eichler E. E.
Hach F.
Hormozdiari F.
Sahinalp S. C.
Sarrafi I.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2014
Field of study

Cataloged from PDF version of article.High throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for processing and downstream analysis. While tools that report the 'best' mapping location of each read provide a fast way to process HTS data, they are not suitable for many types of downstream analysis such as structural variation detection, where it is important to report multiple mapping loci for each read. For this purpose we introduce mrsFAST-Ultra, a fast, cache oblivious, SNP-aware aligner that can handle the multi-mapping of HTS reads very efficiently. mrsFAST-Ultra improves mrsFAST, our first cache oblivious read aligner capable of handling multi-mapping reads, through new and compact index structures that reduce not only the overall memory usage but also the number of CPU operations per alignment. In fact the size of the index generated by mrsFAST-Ultra is 10 times smaller than that of mrsFAST. As importantly, mrsFAST-Ultra introduces new features such as being able to (i) obtain the best mapping loci for each read, and (ii) return all reads that have at most n mapping loci (within an error threshold), together with these loci, for any user specified n. Furthermore, mrsFAST-Ultra is SNP-aware, i.e. it can map reads to reference genome while discounting the mismatches that occur at common SNP locations provided by db-SNP; this significantly increases the number of reads that can be mapped to the reference genome. Notice that all of the above features are implemented within the index structure and are not simple post-processing steps and thus are performed highly efficiently. Finally, mrsFAST-Ultra utilizes multiple available cores and processors and can be tuned for various memory settings. Our results show that mrsFAST-Ultra is roughly five times faster than its predecessor mrsFAST. In comparison to newly enhanced popular tools such as Bowtie2, it is more sensitive (it can report 10 times or more mappings per read) and much faster (six times or more) in the multi-mapping mode. Furthermore, mrsFAST-Ultra has an index size of 2GB for the entire human reference genome, which is roughly half of that of Bowtie2. mrsFAST-Ultra is open source and it can be accessed at http://mrsfast.sourceforge.net

CiteSeerX

Bilkent University Institutional Repository

PubMed Central

Dissect: detection and characterization of novel structural alterations in transcribed sequences

Author: Brassesco
Brudno
Burge
B secke
C. C. Collins
Caudevilla
D. Yorukoglu
De Braekeleer
F. Hach
Frantz
Gingeras
Hach
Horiuchi
I. Birol
Kidd
L. Swanson
Labrador
Levin
McPherson
Miller
Minoche
Mott
Nacu
S. C. Sahinalp
Sboner
Slater
Takahashi
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Motivation: Computational identification of genomic structural variants via high-throughput sequencing is an important problem for which a number of highly sophisticated solutions have been recently developed. With the advent of high-throughput transcriptome sequencing (RNA-Seq), the problem of identifying structural alterations in the transcriptome is now attracting significant attention

DSpace@MIT

Crossref

PubMed Central

Effect of empagliflozin monotherapy on postprandial glucose and 24-hour glucose variability in Japanese patients with type 2 diabetes mellitus: a randomized, double-blind, placebo-controlled, 4-week study

Author: Afshin Salsali
Kazuki Koiwai
Kohei Inoue
Rimei Nishimura
Søren S Lund
Thomas Hach
Uli C Broedl
Yuko Tanaka
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Crossref

Fast and accurate mapping of Complete Genomics reads

Author: Alkan C.
Hach F.
Hormozdiari F.
Lee D.
Mutlu O.
Xin H.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Many recent advances in genomics and the expectations of personalized medicine are made possible thanks to power of high throughput sequencing (HTS) in sequencing large collections of human genomes. There are tens of different sequencing technologies currently available, and each HTS platform have different strengths and biases. This diversity both makes it possible to use different technologies to correct for shortcomings; but also requires to develop different algorithms for each platform due to the differences in data types and error models. The first problem to tackle in analyzing HTS data for resequencing applications is the read mapping stage, where many tools have been developed for the most popular HTS methods, but publicly available and open source aligners are still lacking for the Complete Genomics (CG) platform. Unfortunately, Burrows-Wheeler based methods are not practical for CG data due to the gapped nature of the reads generated by this method. Here we provide a sensitive read mapper (sirFAST) for the CG technology based on the seed-and-extend paradigm that can quickly map CG reads to a reference genome. We evaluate the performance and accuracy of sirFAST using both simulated and publicly available real data sets, showing high precision and recall rates. © 2014 Elsevier Inc

Bilkent University Institutional Repository

PubMed Central

eScholarship - University of California

Developing surrogate markers for predicting antibiotic resistance "hot spots" in rivers where limited data are available

Author: Attal M.
Benedini M.
Clinical and Laboratory Standard Institute
Desktop E. A.
Fair G. M.
Hach
Hach
Hach
Hach
Hands C.
Huang Y. F.
Hydromatch
Michaud J. P.
Ministry of Health Malaysia
Nuzzo R.
Revelle W.
Sigma Aldrich
Sivasampu S.
Sobsey M. A.
UNICEF
Weiner R.
WEPA
Wickham H.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/06/2021
Field of study

Pinpointing environmental antibiotic resistance (AR) hot spots in low-and middle-income countries (LMICs) is hindered by a lack of available and comparable AR monitoring data relevant to such settings. Addressing this problem, we performed a comprehensive spatial and seasonal assessment of water quality and AR conditions in a Malaysian river catchment to identify potential "simple"surrogates that mirror elevated AR. We screened for resistant coliforms, 22 antibiotics, 287 AR genes and integrons, and routine water quality parameters, covering absolute concentrations and mass loadings. To understand relationships, we introduced standardized "effect sizes"(Cohen's D) for AR monitoring to improve comparability of field studies. Overall, water quality generally declined and environmental AR levels increased as one moved down the catchment without major seasonal variations, except total antibiotic concentrations that were higher in the dry season (Cohen's D > 0.8, P < 0.05). Among simple surrogates, dissolved oxygen (DO) most strongly correlated (inversely) with total AR gene concentrations (Spearman's ρ 0.81, P < 0.05). We suspect this results from minimally treated sewage inputs, which also contain AR bacteria and genes, depleting DO in the most impacted reaches. Thus, although DO is not a measure of AR, lower DO levels reflect wastewater inputs, flagging possible AR hot spots. DO measurement is inexpensive, already monitored in many catchments, and exists in many numerical water quality models (e.g., oxygen sag curves). Therefore, we propose combining DO data and prospective modeling to guide local interventions, especially in LMIC rivers with limited data

Crossref

Universiti Teknologi Malaysia Institutional Repository

Robertson Intelligent States

Author: Abramowitz M
Agarwal G S
Aragone C
Barut A O
Barut A O
Biedenharn L C
Bogdanovic R
Brif C
Brif C
D A Trifonov
Dodonov V V
Dodonov V V
Gantmaher F R
Hach E E III
Hillery M
Holz A
Klauder J R
Klauder J R
Loudon R
Lu E Y C
Ma X
Macfarlane A J
Malkin I A
Malkin I A
Man'ko Olga
Man'ko V I
Naimark M A
Nikolov B A
Oh C H
Provost J
Schrödinger E
Simon R
Sudarshan E C G
Trifonov D A
Trifonov D A
Trifonov D A
Walls D F
Wünsche A
Publication venue: 'IOP Publishing'
Publication date: 01/01/1997
Field of study

Diagonalization of uncertainty matrix and minimization of Robertson inequality for n observables are considered. It is proved that for even n this relation is minimized in states which are eigenstates of n/2 independent complex linear combinations of the observables. In case of canonical observables this eigenvalue condition is also necessary. Such minimizing states are called Robertson intelligent states (RIS). The group related coherent states (CS) with maximal symmetry (for semisimple Lie groups) are particular case of RIS for the quadratures of Weyl generators. Explicit constructions of RIS are considered for operators of su(1,1), su(2), h_N and sp(N,R) algebras. Unlike the group related CS, RIS can exhibit strong squeezing of group generators. Multimode squared amplitude squeezed states are naturally introduced as sp(N,R) RIS. It is shown that the uncertainty matrices for quadratures of q-deformed boson operators a_{q,j} (q > 0) and of any k power of a_j = a_{1,j} are positive definite and can be diagonalized by symplectic linear transformations. PACS numbers: 03.65.Fd, 42.50.DvComment: 23 pages, LaTex. Minor changes in text and references. Accepted in J. Phys.

arXiv.org e-Print Archive

CiteSeerX

Crossref

CERN Document Server

Barut-Girardello coherent states for u(p,q) and sp(N,R) and their macroscopic superpositions

Author: Abramowitz M
Agarwal G S
Arvind
Barut A O
Barut A O
Brif C
Brif C
Brif C
Brif C
Buzek V
D A Trifonov
Hach E E III
Haroche S
Holz A
Johsi J
Klauder J R
Klyshko D N
Loudon R
Ma X
Malkin I A
Mandel L
Moya-Cessa H
Nagel B
Shanta P
Simon R
Szabo S
Todorov I T
Trifonov D A
Trifonov D A
Trifonov D A
Trifonov D A
Trifonov D A
Trifonov D A
Vourdas A
Walls D F
Publication venue: 'IOP Publishing'
Publication date: 27/11/1997
Field of study

The Barut-Girardello coherent states (BG CS) representation is extended to the noncompact algebras u(p,q) and sp(N,R) in (reducible) quadratic boson realizations. The sp(N,R) BG CS take the form of multimode ordinary Schr\"odinger cat states. Macroscopic superpositions of 2^{n-1} sp(N,R) CS (2^n canonical CS, n=1,2,...) are pointed out which are overcomplete in the N-mode Hilbert space and the relation between the canonical CS and the u(p,q) BG-type CS representations is established. The sets of u(p,q) and sp(N,R) BG CS and their discrete superpositions contain many states studied in quantum optics (even and odd N-mode CS, pair CS) and provide an approach to quadrature squeezing, alternative to that of intelligent states. New subsets of weakly and strongly nonclassical states are pointed out and their statistical properties (first- and second-order squeezing, photon number distributions) are discussed. For specific values of the angle parameters and small amplitude of the canonical CS components these states approaches multimode Fock states with one, two or three bosons/photons. It is shown that eigenstates of a squared non-Hermitian operator A^2 (generalized cat states) can exhibit squeezing of the quadratures of A.Comment: 29 pages, LaTex, 5 figures. Improvements in text, corrections in some formulas. To appear in J. Phys. A, v. 3

arXiv.org e-Print Archive

Crossref

CERN Document Server

A single-chain insulin-like growth factor I/insulin hybrid binds with high affinity to the insulin receptor

Author: A S Andersen
C Kristensen
F C Wiberg
L Schäffer
M Hach
T Kjeldsen
Publication venue: 'Portland Press Ltd.'
Publication date
Field of study

Crossref

Structural variation and fusion detection using targeted sequencing data from circulating cell free DNA

Author: Adra Nabil
Asghari Hossein
Collins Colin C.
Gawroński Alexander R.
Hach Faraz
Koçkan Can
LeBihan Stephane
Lin Yen-Yi
McConeghy Brian
Orabi Baraa
Pili Roberto
Sahinalp S. Cenk
Publication venue: 'Oxford University Press (OUP)'
Publication date: 23/04/2019
Field of study

MOTIVATION: Cancer is a complex disease that involves rapidly evolving cells, often forming multiple distinct clones. In order to effectively understand progression of a patient-specific tumor, one needs to comprehensively sample tumor DNA at multiple time points, ideally obtained through inexpensive and minimally invasive techniques. Current sequencing technologies make the 'liquid biopsy' possible, which involves sampling a patient's blood or urine and sequencing the circulating cell free DNA (cfDNA). A certain percentage of this DNA originates from the tumor, known as circulating tumor DNA (ctDNA). The ratio of ctDNA may be extremely low in the sample, and the ctDNA may originate from multiple tumors or clones. These factors present unique challenges for applying existing tools and workflows to the analysis of ctDNA, especially in the detection of structural variations which rely on sufficient read coverage to be detectable. RESULTS: Here we introduce SViCT , a structural variation (SV) detection tool designed to handle the challenges associated with cfDNA analysis. SViCT can detect breakpoints and sequences of various structural variations including deletions, insertions, inversions, duplications and translocations. SViCT extracts discordant read pairs, one-end anchors and soft-clipped/split reads, assembles them into contigs, and re-maps contig intervals to a reference genome using an efficient k-mer indexing approach. The intervals are then joined using a combination of graph and greedy algorithms to identify specific structural variant signatures. We assessed the performance of SViCT and compared it to state-of-the-art tools using simulated cfDNA datasets with properties matching those of real cfDNA samples. The positive predictive value and sensitivity of our tool was superior to all the tested tools and reasonable performance was maintained down to the lowest dilution of 0.01% tumor DNA in simulated datasets. Additionally, SViCT was able to detect all known SVs in two real cfDNA reference datasets (at 0.6-5% ctDNA) and predict a novel structural variant in a prostate cancer cohort

IUPUIScholarWorks