762 research outputs found
FMLRC: Hybrid long read error correction using an FM-index
Abstract Background Long read sequencing is changing the landscape of genomic research, especially de novo assembly. Despite the high error rate inherent to long read technologies, increased read lengths dramatically improve the continuity and accuracy of genome assemblies. However, the cost and throughput of these technologies limits their application to complex genomes. One solution is to decrease the cost and time to assemble novel genomes by leveraging “hybrid” assemblies that use long reads for scaffolding and short reads for accuracy. Results We describe a novel method leveraging a multi-string Burrows-Wheeler Transform with auxiliary FM-index to correct errors in long read sequences using a set of complementary short reads. We demonstrate that our method efficiently produces significantly more high quality corrected sequence than existing hybrid error-correction methods. We also show that our method produces more contiguous assemblies, in many cases, than existing state-of-the-art hybrid and long-read only de novo assembly methods. Conclusion Our method accurately corrects long read sequence data using complementary short reads. We demonstrate higher total throughput of corrected long reads and a corresponding increase in contiguity of the resulting de novo assemblies. Improved throughput and computational efficiency than existing methods will help better economically utilize emerging long read sequencing technologies
WISE/NEOWISE Observations of Comet 103P/Hartley 2
We report results based on mid-infrared photometry of comet 103P/Hartley 2 taken during 2010 May 4-13 (when the comet was at a heliocentric distance of 2.3 AU, and an observer distance of 2.0 AU) by the Wide-field Infrared Survey Explorer. Photometry of the coma at 22 μm and data from the University of Hawaii 2.2 m telescope obtained on 2010 May 22 provide constraints on the dust particle size distribution, d log n/d log m, yielding power-law slope values of alpha = –0.97 ± 0.10, steeper than that found for the inbound particle fluence during the Stardust encounter of comet 81P/Wild 2. The extracted nucleus signal at 12 μm is consistent with a body of average spherical radius of 0.6 ± 0.2 km (one standard deviation), assuming a beaming parameter of 1.2. The 4.6 μm band signal in excess of dust and nucleus reflected and thermal contributions may be attributed to carbon monoxide or carbon dioxide emission lines and provides limits and estimates of species production. Derived carbon dioxide coma production rates are 3.5(± 0.9) × 10^(24) molecules per second. Analyses of the trail signal present in the stacked image with an effective exposure time of 158.4 s yields optical-depth values near 9 × 10^(–10) at a delta mean anomaly of 0.2 deg trailing the comet nucleus, in both 12 and 22 μm bands. A minimum chi-squared analysis of the dust trail position yields a beta-parameter value of 1.0 × 10^(–4), consistent with a derived mean trail-grain diameter of 1.1/ρ cm for grains of ρ g cm^(–3) density. This leads to a total detected trail mass of at least 4 × 10^(10) ρ kg
Dominant and Recessive Compound Heterozygous Mutations in Epidermolysis Bullosa Simplex Demonstrate the Role of the Stutter Region in Keratin Intermediate Filament Assembly
Keratin intermediate filaments are important cytoskeletal structural proteins involved in maintaining cell shape and function. Mutations in the epidermal keratin genes, keratin 5 or keratin 14 lead to the disruption of keratin filament assembly, resulting in an autosomal dominant inherited blistering skin disease, epidermolysis bullosa simplex (EBS). We investigated a large EBS kindred who exhibited a markedly heterogeneous clinical presentation and detected two distinct keratin 5 mutations in the proband, the most severely affected. One missense mutation (E170K) in the highly conserved helix initiation peptide sequence of the 1A rod domain was found in all the affected family members. In contrast, the other missense mutation (E418K) was found only in the proband. The E418K mutation was located in the stutter region, an interruption in the heptad repeat regularity, whose function as yet remains unclear. We hypothesized that this mutated stutter allele was clinically silent when combined with the wild type allele but aggravates the clinical severity of EBS caused by the E170K mutation on the other allele. To confirm this in vitro, we transfected mutant keratin 5 cDNA into cultured cells. Although only 12.7% of the cells transfected with the E170K mutation alone showed disrupted keratin filament aggregations, significantly more cells (30.0%) cotransfected with both E170K and E418K mutations demonstrated keratin aggregation (p < 0.05). These transfection assay results corresponded to the heterogeneous clinical findings of the EBS patient in this kindred. We have identified the first case of both compound heterozygous dominant (E170K) and recessive (E418K) mutations in any keratin gene and confirmed the significant involvement of the stutter region in the assembly and organization of the keratin intermediate filament network in vitro
Canonical Melnikov theory for diffeomorphisms
We study perturbations of diffeomorphisms that have a saddle connection
between a pair of normally hyperbolic invariant manifolds. We develop a
first-order deformation calculus for invariant manifolds and show that a
generalized Melnikov function or Melnikov displacement can be written in a
canonical way. This function is defined to be a section of the normal bundle of
the saddle connection.
We show how our definition reproduces the classical methods of Poincar\'{e}
and Melnikov and specializes to methods previously used for exact symplectic
and volume-preserving maps. We use the method to detect the transverse
intersection of stable and unstable manifolds and relate this intersection to
the set of zeros of the Melnikov displacement.Comment: laTeX, 31 pages, 3 figure
A Century of Grading Research: Meaning and Value in the Most Common Educational Measure
Grading refers to the symbols assigned to individual pieces of student work or to composite measures of student performance on report cards. This review of over 100 years of research on grading considers five types of studies: (a) early studies of the reliability of grades, (b) quantitative studies of the composition of K-12 report card grades, (c) survey and interview studies of teachers’ perceptions of grades, (d) studies of standards-based grading, and (e) grading in higher education. Early 20th century studies generally condemned teachers’ grades as unreliable. More recent studies of the relationships of grades to tested achievement and survey studies of teachers’ grading practices and beliefs suggest that grades assess a multidimensional construct containing both cognitive and non-cognitive factors reflecting what teachers value in student work. Implications for future research and for grading practices are discussed
Extracting science from surveys of our Galaxy
Our knowledge of the Galaxy is being revolutionised by a series of
photometric, spectroscopic and astrometric surveys. Already an enormous body of
data is available from completed surveys, and data of ever increasing quality
and richness will accrue at least until the end of this decade. To extract
science from these surveys we need a class of models that can give probability
density functions in the space of the observables of a survey -- we should not
attempt to "invert" the data from the space of observables into the physical
space of the Galaxy. Currently just one class of model has the required
capability, so-called "torus models". A pilot application of torus models to
understanding the structure of the Galaxy's thin and thick discs has already
produced two significant results: a major revision of our best estimate of the
Sun's velocity with respect to the Local Standard of Rest, and a successful
prediction of the way in which the vertical velocity dispersion in the disc
varies with distance from the Galactic plane.Comment: 13 pages. Invited review to appear in Pramana - journal of physics
(Indian Academy of Sciences
The Shape of Galaxy Cluster Dark Matter Haloes: Systematics of Its Imprint on Cluster Gas, and Comparison to Observations
(Abridged) We study predictions for galaxy cluster observables that can test
the statistics of dark matter halo shapes expected in a flat LCDM universe. We
present a simple analytical model for the prediction of cluster-scale X-ray
observations, approximating clusters as isothermal systems in hydrostatic
equilibrium, and dark matter haloes as ellipsoids with uniform axial ratios. We
test the model against high-resolution, hydrodynamic cluster simulations to
gauge its reliability. We find that this simple prescription does a good job of
predicting the distribution of cluster X-ray ellipticities compared to the
simulations as long as one focuses on cluster regions that are less sensitive
to recent mergers. Based on this simple model, the distribution of cluster-size
halo shapes expected in the concordance LCDM cosmology implies an X-ray
ellipticity distribution with a mean of 0.32 +- 0.01 and a scatter of 0.14 +-
0.01 for the mass range (1-4)x10^{14} Msun/h. We find it important to include
the mass dependence of halo shape to make comparisons to observational samples
that contain many, very massive clusters. We analyse the systematics of four
observational samples of cluster ellipticities and find that our results are
statistically compatible with observations. In particular, we find remarkably
good agreement between two recent ROSAT samples and LCDM predictions that DO
NOT include gas cooling. We also test how well our analytical model can predict
Sunyaev-Zel'dovich decrement maps and find that it is less successful although
still useful; the model does not perform as well as a function of flux level in
this case because of the changing triaxiality of dark matter haloes as a
function of radial distance. Both this effect and the changing alignment of
isodensity shells of dark matter haloes leave an imprint on cluster gas...Comment: 16 pages, 9 figures; corrected typo (no result affected) submitted to
MNRA
WISE/NEOWISE observations of Active Bodies in the Main Belt
We report results based on mid-infrared photometry of 5 active main belt
objects (AMBOs) detected by the Wide-field Infrared Survey Explorer (WISE)
spacecraft. Four of these bodies, P/2010 R2 (La Sagra), 133P/Elst-Pizarro,
(596) Scheila, and 176P/LINEAR, showed no signs of activity at the time of the
observations, allowing the WISE detections to place firm constraints on their
diameters and albedos. Geometric albedos were in the range of a few percent,
and on the order of other measured comet nuclei. P/2010 A2 was observed on
April 2-3, 2010, three months after its peak activity. Photometry of the coma
at 12 and 22 {\mu}m combined with ground-based visible-wavelength measurements
provides constraints on the dust particle mass distribution (PMD), dlogn/dlogm,
yielding power-law slope values of {\alpha} = -0.5 +/- 0.1. This PMD is
considerably more shallow than that found for other comets, in particular
inbound particle fluence during the Stardust encounter of comet 81P/Wild 2. It
is similar to the PMD seen for 9P/Tempel 1 in the immediate aftermath of the
Deep Impact experiment. Upper limits for CO2 & CO production are also provided
for each AMBO and compared with revised production numbers for WISE
observations of 103P/Hartley 2.Comment: 32 Pages, including 5 Figure
- …