12,252 research outputs found

    Identification and correction of systematic error in high-throughput sequence data

    Get PDF
    A feature common to all DNA sequencing technologies is the presence of base-call errors in the sequenced reads. The implications of such errors are application specific, ranging from minor informatics nuisances to major problems affecting biological inferences. Recently developed “next-gen” sequencing technologies have greatly reduced the cost of sequencing, but have been shown to be more error prone than previous technologies. Both position specific (depending on the location in the read) and sequence specific (depending on the sequence in the read) errors have been identified in Illumina and Life Technology sequencing platforms. We describe a new type of _systematic_ error that manifests as statistically unlikely accumulations of errors at specific genome (or transcriptome) locations. We characterize and describe systematic errors using overlapping paired reads form high-coverage data. We show that such errors occur in approximately 1 in 1000 base pairs, and that quality scores at systematic error sites do not account for the extent of errors. We identify motifs that are frequent at systematic error sites, and describe a classifier that distinguishes heterozygous sites from systematic error. Our classifier is designed to accommodate data from experiments in which the allele frequencies at heterozygous sites are not necessarily 0.5 (such as in the case of RNA-Seq). Systematic errors can easily be mistaken for heterozygous sites in individuals, or for SNPs in population analyses. Systematic errors are particularly problematic in low coverage experiments, or in estimates of allele-specific expression from RNA-Seq data. Our characterization of systematic error has allowed us to develop a program, called SysCall, for identifying and correcting such errors. We conclude that correction of systematic errors is important to consider in the design and interpretation of high-throughput sequencing experiments

    CpG Methylation of a Silent Controlling Element in the Murine Avy Allele Is Incomplete and Unresponsive to Methyl Donor Supplementation

    Get PDF
    Background: The viable yellow allele of agouti (A vy) is remarkable for its unstable and partially heritable epigenetic state, which produces wide variation in phenotypes of isogenic mice. In the A vy allele an inserted intracisternal A particle (IAP) acts as a controlling element which deregulates expression of agouti by transcription from the LTR of the IAP; the phenotypic state has been linked to CpG methylation of the LTR. Phenotypic variation between A vy mice indicates that the epigenetic state of the IAP is unstable in the germline. Principal Findings: We have made a detailed examination of somatic methylation of the IAP using bisulphite allelic sequencing, and find that the promoter is incompletely methylated even when it is transcriptionally silent. In utero exposure to supplementary methyl donors, which alters the spectrum of A vy phenotypes, does not increase the density of CpG methylation in the silent LTR. Conclusions: Our findings suggest that, contrary to previous supposition, methyl donor supplementation acts through an indirect mechanism to silence A vy. The incomplete cytosine methylation we observe at the somatically silent A vy allele ma

    MetMap Enables Genome-Scale Methyltyping for Determining Methylation States in Populations

    Get PDF
    The ability to assay genome-scale methylation patterns using high-throughput sequencing makes it possible to carry out association studies to determine the relationship between epigenetic variation and phenotype. While bisulfite sequencing can determine a methylome at high resolution, cost inhibits its use in comparative and population studies. MethylSeq, based on sequencing of fragment ends produced by a methylation-sensitive restriction enzyme, is a method for methyltyping (survey of methylation states) and is a site-specific and cost-effective alternative to whole-genome bisulfite sequencing. Despite its advantages, the use of MethylSeq has been restricted by biases in MethylSeq data that complicate the determination of methyltypes. Here we introduce a statistical method, MetMap, that produces corrected site-specific methylation states from MethylSeq experiments and annotates unmethylated islands across the genome. MetMap integrates genome sequence information with experimental data, in a statistically sound and cohesive Bayesian Network. It infers the extent of methylation at individual CGs and across regions, and serves as a framework for comparative methylation analysis within and among species. We validated MetMap's inferences with direct bisulfite sequencing, showing that the methylation status of sites and islands is accurately inferred. We used MetMap to analyze MethylSeq data from four human neutrophil samples, identifying novel, highly unmethylated islands that are invisible to sequence-based annotation strategies. The combination of MethylSeq and MetMap is a powerful and cost-effective tool for determining genome-scale methyltypes suitable for comparative and association studies

    Inhibition of Respiration by Nitric Oxide Induces a Mycobacterium tuberculosis Dormancy Program

    Get PDF
    An estimated two billion persons are latently infected with Mycobacterium tuberculosis. The host factors that initiate and maintain this latent state and the mechanisms by which M. tuberculosis survives within latent lesions are compelling but unanswered questions. One such host factor may be nitric oxide (NO), a product of activated macrophages that exhibits antimycobacterial properties. Evidence for the possible significance of NO comes from murine models of tuberculosis showing progressive infection in animals unable to produce the inducible isoform of NO synthase and in animals treated with a NO synthase inhibitor. Here, we show that O2 and low, nontoxic concentrations of NO competitively modulate the expression of a 48-gene regulon, which is expressed in vivo and prepares bacilli for survival during long periods of in vitro dormancy. NO was found to reversibly inhibit aerobic respiration and growth. A heme-containing enzyme, possibly the terminal oxidase in the respiratory pathway, likely senses and integrates NO and O2 levels and signals the regulon. These data lead to a model postulating that, within granulomas, inhibition of respiration by NO production and O2 limitation constrains M. tuberculosis replication rates in persons with latent tuberculosis

    Dynamical Masses for Pre-Main Sequence Stars: A Preliminary Physical Orbit for V773 Tau A

    Get PDF
    We report on interferometric and radial-velocity observations of the double-lined 51-d period binary (A) component of the quadruple pre-main sequence (PMS) system V773 Tau. With these observations we have estimated preliminary visual and physical orbits of the V773 Tau A subsystem. Among other parameters, our orbit model includes an inclination of 66.0 ±\pm 2.4 deg, and allows us to infer the component dynamical masses and system distance. In particular we find component masses of 1.54 ±\pm 0.14 and 1.332 ±\pm 0.097 M_{\sun} for the Aa (primary) and Ab (secondary) components respectively. Our modeling of the subsystem component spectral energy distributions finds temperatures and luminosities consistent with previous studies, and coupled with the component mass estimates allows for comparison with PMS stellar models in the intermediate-mass range. We compare V773 Tau A component properties with several popular solar-composition models for intermediate-mass PMS stars. All models predict masses consistent to within 2-sigma of the dynamically determined values, though some models predict values that are more consistent than others.Comment: ApJ in press; 25 pages, 6 figures; data tables available in journal versio

    Wedge states in string field theory

    Get PDF
    The wedge states form an important subalgebra in the string field theory. We review and further investigate their various properties. We find in particular a novel expression for the wedge states, which allows to understand their star products purely algebraically. The method allows also for treating the matter and ghost sectors separately. It turns out, that wedge states with different matter and ghost parts violate the associativity of the algebra. We introduce and study also wedge states with insertions of local operators and show how they are useful for obtaining exact results about convergence of level truncation calculations. These results help to clarify the issue of anomalies related to the identity and some exterior derivations in the string field algebra.Comment: 40 pages, 9 figures, v3: section 3.3 rewritten, few other corrections, set in JHEP styl

    Low power radiometric partial discharge sensor using composite transistor-reset integrator

    Get PDF
    The measurement of partial discharge provides a means of monitoring insulation health in high-voltage equipment. Traditional partial discharge measurements require separate installation for each item of plant to physically connect sensors with specific items. Wireless measurement methods provide an attractive and scalable alternative. Existing wireless monitoring technologies which use time-difference-of-arrival of a partial discharge signal at multiple, spatially separated, sensors place high demands on power consumption and cost due to a requirement for rapid sampling. A recently proposed partial discharge monitoring system using a wireless sensor network and measuring received signal strength only, has potential cost and scalability advantages. An incoherent wireless sensor incorporating a transistor-reset integrator has been developed that reduces the measurement bandwidth of the PD events and alleviates the need for high-speed sampling. It is based on composite amplifier techniques to reduce the power requirements by a factor of approximately four without compromising precision. The accuracy of the proposed sensor is compared to that obtained using a high-speed digital sampling oscilloscope. Received energies were measured over a 10 m distance in 1 m increments and produced an error within 1 dB beyond 4 m and 3.2 dB at shorter distances, resulting in a measurement accuracy within 1 m

    The WiggleZ Dark Energy Survey: improved distance measurements to z = 1 with reconstruction of the baryonic acoustic feature

    Get PDF
    We present significant improvements in cosmic distance measurements from the WiggleZ Dark Energy Survey, achieved by applying the reconstruction of the baryonic acoustic feature technique. We show using both data and simulations that the reconstruction technique can often be effective despite patchiness of the survey, significant edge effects and shot-noise. We investigate three redshift bins in the redshift range 0.2 < z < 1, and in all three find improvement after reconstruction in the detection of the baryonic acoustic feature and its usage as a standard ruler. We measure model-independent distance measures DV(rsfid/rs) of 1716 ± 83, 2221 ± 101, 2516 ± 86 Mpc (68 per cent CL) at effective redshifts z = 0.44, 0.6, 0.73, respectively, where DV is the volume-averaged distance, and rs is the sound horizon at the end of the baryon drag epoch. These significantly improved 4.8, 4.5 and 3.4 per cent accuracy measurements are equivalent to those expected from surveys with up to 2.5 times the volume of WiggleZ without reconstruction applied. These measurements are fully consistent with cosmologies allowed by the analyses of the Planck Collaboration and the Sloan Digital Sky Survey. We provide the DV(rsfid/rs) posterior probability distributions and their covariances. When combining these measurements with temperature fluctuations measurements of Planck, the polarization of Wilkinson Microwave Anisotropy Probe 9, and the 6dF Galaxy Survey baryonic acoustic feature, we do not detect deviations from a flat Λ cold dark matter (ΛCDM) model. Assuming this model, we constrain the current expansion rate to H₀ = 67.15 ± 0.98 km s⁻¹Mpc⁻¹. Allowing the equation of state of dark energy to vary, we obtain wDE = −1.080 ± 0.135. When assuming a curved ΛCDM model we obtain a curvature value of ΩK = −0.0043 ± 0.0047
    corecore