206 research outputs found
Constrained Multistate Sequence Design for Nucleic Acid Reaction Pathway Engineering
We describe a framework for designing the sequences of multiple nucleic acid strands intended to hybridize in solution via a prescribed reaction pathway. Sequence design is formulated as a multistate optimization problem using a set of target test tubes to represent reactant, intermediate, and product states of the system, as well as to model crosstalk between components. Each target test tube contains a set of desired âon-targetâ complexes, each with a target secondary structure and target concentration, and a set of undesired âoff-targetâ complexes, each with vanishing target concentration. Optimization of the equilibrium ensemble properties of the target test tubes implements both a positive design paradigm, explicitly designing for on-pathway elementary steps, and a negative design paradigm, explicitly designing against off-pathway crosstalk. Sequence design is performed subject to diverse user-specified sequence constraints including composition constraints, complementarity constraints, pattern prevention constraints, and biological constraints. Constrained multistate sequence design facilitates nucleic acid reaction pathway engineering for diverse applications in molecular programming and synthetic biology. Design jobs can be run online via the NUPACK web application
breakpointR:an R/Bioconductor package to localize strand state changes in Strand-seq data
MOTIVATION: Strand-seq is a specialized single-cell DNA sequencing technique centered around the directionality of single-stranded DNA. Computational tools for Strand-seq analyses must capture the strand-specific information embedded in these data. RESULTS: Here we introduce breakpointR, an R/Bioconductor package specifically tailored to process and interpret single-cell strand-specific sequencing data obtained from Strand-seq. We developed breakpointR to detect local changes in strand directionality of aligned Strand-seq data, to enable fine-mapping of sister chromatid exchanges, germline inversion and to support global haplotype assembly. Given the broad spectrum of Strand-seq applications we expect breakpointR to be an important addition to currently available tools and extend the accessibility of this novel sequencing technique. AVAILABILITY: R/Bioconductor package https://bioconductor.org/packages/breakpointR
Investigation of Indazole Unbinding Pathways in CYP2E1 by Molecular Dynamics Simulations
Human microsomal cytochrome P450 2E1 (CYP2E1) can oxidize not only low molecular weight xenobiotic compounds such as ethanol, but also many endogenous fatty acids. The crystal structure of CYP2E1 in complex with indazole reveals that the active site is deeply buried into the protein center. Thus, the unbinding pathways and associated unbinding mechanisms remain elusive. In this study, random acceleration molecular dynamics simulations combined with steered molecular dynamics and potential of mean force calculations were performed to identify the possible unbinding pathways in CYP2E1. The results show that channel 2c and 2a are most likely the unbinding channels of CYP2E1. The former channel is located between helices G and I and the B-C loop, and the latter resides between the region formed by the F-G loop, the B-C loop and the ÎČ1 sheet. Phe298 and Phe478 act as the gate keeper during indazole unbinding along channel 2c and 2a, respectively. Previous site-directed mutagenesis experiments also supported these findings
Dense and accurate whole-chromosome haplotyping of individual genomes
The diploid nature of the human genome is neglected in many analyses done today, where a genome is perceived as a set of unphased variants with respect to a reference genome. This lack of haplotype-level analyses can be explained by a lack of methods that can produce dense and accurate chromosome-length haplotypes at reasonable costs. Here we introduce an integrative phasing strategy that combines global, but sparse haplotypes obtained from strand-specific single-cell sequencing (Strand-seq) with dense, yet local, haplotype information available through long-read or linked-read sequencing. We provide comprehensive guidance on the required sequencing depths and reliably assign more than 95% of alleles (NA12878) to their parental haplotypes using as few as 10 Strand-seq libraries in combination with 10-fold coverage PacBio data or, alternatively, 10X Genomics linked-read sequencing data. We conclude that the combination of Strand-seq with different technologies represents an attractive solution to chart the genetic variation of diploid genomes
Improved assembly and variant detection of a haploid human genome using single-molecule, high-fidelity long reads
The sequence and assembly of human genomes using long-read sequencing technologies has revolutionized our understanding of structural variation and genome organization. We compared the accuracy, continuity, and gene annotation of genome assemblies generated from either high-fidelity (HiFi) or continuous long-read (CLR) datasets from the same complete hydatidiform mole human genome. We find that the HiFi sequence data assemble an additional 10% of duplicated regions and more accurately represent the structure of tandem repeats, as validated with orthogonal analyses. As a result, an additional 5 Mbp of pericentromeric sequences are recovered in the HiFi assembly, resulting in a 2.5-fold increase in the NG50 within 1 Mbp of the centromere (HiFi 480.6Â kbp, CLR 191.5Â kbp). Additionally, the HiFi genome assembly was generated in significantly less time with fewer computational resources than the CLR assembly. Although the HiFi assembly has significantly improved continuity and accuracy in many complex regions of the genome, it still falls short of the assembly of centromeric DNA and the largest regions of segmental duplication using existing assemblers. Despite these shortcomings, our results suggest that HiFi may be the most effective standalone technology for de novo assembly of human genomes
Thymic Hyperplasia with Lymphoepithelial Sialadenitis (LESA)-Like Features: Strong Association with Lymphomas and Non-Myasthenic Autoimmune Diseases.
Thymic hyperplasia (TH) with lymphoepithelial sialadenitis (LESA)-like features (LESA-like TH) has been described as a tumor-like, benign proliferation of thymic epithelial cells and lymphoid follicles. We aimed to determine the frequency of lymphoma and autoimmunity in LESA-like TH and performed retrospective analysis of cases with LESA-like TH and/or thymic MALT-lymphoma. Among 36 patients (21 males) with LESA-like TH (age 52 years, 32-80; lesion diameter 7.0 cm, 1-14.5; median, range), five (14%) showed associated lymphomas, including four (11%) thymic MALT lymphomas and one (3%) diffuse large B-cell lymphoma. One additional case showed a clonal B-cell-receptor rearrangement without evidence of lymphoma. Twelve (33%) patients (7 women) suffered from partially overlapping autoimmune diseases: systemic lupus erythematosus (n = 4, 11%), rheumatoid arthritis (n = 3, 8%), myasthenia gravis (n = 2, 6%), asthma (n = 2, 6%), scleroderma, Sjögren syndrome, pure red cell aplasia, Grave's disease and anti-IgLON5 syndrome (each n = 1, 3%). Among 11 primary thymic MALT lymphomas, remnants of LESA-like TH were found in two cases (18%). In summary, LESA-like TH shows a striking association with autoimmunity and predisposes to lymphomas. Thus, a hematologic and rheumatologic workup should become standard in patients diagnosed with LESA-like TH. Radiologists and clinicians should be aware of LESA-like TH as a differential diagnosis for mediastinal mass lesions in patients with autoimmune diseases
Metal-macrofauna interactions determine microbial community structure and function in copper contaminated sediments
Peer reviewedPublisher PD
Recommended from our members
Phased nanopore assembly with Shasta and modular graph phasing with GFAse
Reference-free genome phasing is vital for understanding allele inheritance and the impact of single-molecule DNA variation on phenotypes. To achieve thorough phasing across homozygous or repetitive regions of the genome, long-read sequencing technologies are often used to perform phased de novo assembly. As a step toward reducing the cost and complexity of this type of analysis, we describe new methods for accurately phasing Oxford Nanopore Technologies (ONT) sequence data with the Shasta genome assembler and a modular tool for extending phasing to the chromosome scale called GFAse. We test using new variants of ONT PromethION sequencing, including those using proximity ligation, and show that newer, higher accuracy ONT reads substantially improve assembly quality
- âŠ