7 research outputs found

    Improving genome assemblies using multi-platform sequence data

    Get PDF
    Accurate de novo assembly using short reads generated by next generation sequencing technologies is still an open problem. Although there are several assembly algorithms developed for data generated with different sequencing technologies, and some that can make use of hybrid data, the assemblies are still far from being perfect. There is still a need for computational approaches to improve draft assemblies. Here we propose a new method to correct assembly mistakes when there are multiple types of data generated using different sequencing technologies that have different strengths and biases. We exploit the assembly of highly accurate short reads to correct the contigs obtained from less accurate long reads. We apply our method to Illumina, 454, and Ion Torrent data, and also compare our results with existing hybrid assemblers, Celera and Masurca. © Springer International Publishing Switzerland 2016

    Origin and spread of human mitochondrial DNA haplogroup U7

    Get PDF
    Human mitochondrial DNA haplogroup U is among the initial maternal founders in Southwest Asia and Europe and one that best indicates matrilineal genetic continuity between late Pleistocene hunter-gatherer groups and present-day populations of Europe. While most haplogroup U subclades are older than 30 thousand years, the comparatively recent coalescence time of the extant variation of haplogroup U7 (~16–19 thousand years ago) suggests that its current distribution is the consequence of more recent dispersal events, despite its wide geographical range across Europe, the Near East and South Asia. Here we report 267 new U7 mitogenomes that – analysed alongside 100 published ones – enable us to discern at least two distinct temporal phases of dispersal, both of which most likely emanated from the Near East. The earlier one began prior to the Holocene (~11.5 thousand years ago) towards South Asia, while the later dispersal took place more recently towards Mediterranean Europe during the Neolithic (~8 thousand years ago). These findings imply that the carriers of haplogroup U7 spread to South Asia and Europe before the suggested Bronze Age expansion of Indo-European languages from the Pontic-Caspian Steppe region

    A large patient study confirming that facioscapulohumeral muscular dystrophy (FSHD) disease expression is almost exclusively associated with an FSHD locus located on a 4qA‐defined 4qter subtelomere

    No full text
    Facioscapulohumeral muscular dystrophy (FSHD), an autosomal dominant disorder, represents the third most common human muscular dystrophy. The FSHD disease locus, at chromosome 4q35, is associated with large contractions of the polymorphic repeat sequence array D4Z4. In addition to FSHD disease association with large D4Z4 deletions, a biased interaction with a specific 4qter subtelomeric sequence has been described in patients. Two distinct 4qter subtelomeres, defined as types 4qA and 4qB, have been identified and shown to be equally prevalent in the Caucasian population. In almost all 4q35‐linked patients with FSHD, however, disease expression only occurs when large D4Z4 deletions are located on 4qA‐defined 4qter subtelomeres. Conversely, large D4Z4 repeat contractions situated on 4qB‐defined subtelomeres either are not disease‐causing or exhibit a greatly reduced disease penetrance. This study was initiated to confirm this direct FSHD disease association data by measuring the frequency of type 4qA‐defined and 4qB‐defined subtelomeric sequences in a large cohort of 164 unrelated patients with FSHD from Turkey and the UK, all known to have large D4Z4 deletions. An almost complete association was found between large D4Z4 repeat array deletions located on 4qA‐defined 4qter subtelomeres and disease expression in our large FSHD patient cohort. The observed failure of probes 4qA and 4qB to hybridise to two patient‐derived DNA samples confirms the presence of an additional rare type of 4qter subtelomeric sequence in humans

    Draft genome sequence of Halomonas smyrnensis AAD6<sup>T</sup>

    No full text
    Halomonas smyrnensis AAD6(T) is a Gram-negative, aerobic, exopolysaccharide-producing, and moderately halophilic bacterium that produces levan, a fructose homopolymer with many potential uses in various industries. We report the draft genome sequence of H. smyrnensis AAD6(T), which will accelerate research on the rational design and optimization of microbial levan production

    MEFV gene 3'-UTR Alu repeat polymorphisms in patients with familial Mediterranean fever

    No full text
    Objective. Familial Mediterranean fever (FMF), an autosomal recessively inherited outoinflammatory disorder, is caused by missense mutations in the pyrin-encoding MEFV gene. The MEFV initiations can be detected in the majority of FMF patients. but there is an important proportion of patients with the FMF phenotype who carry a single or no coding region initiation. This study aimed to investigate the promoter region and 3'-UTR polymorphisms of the MEFV gene in a group of FMF patients with no coding region mutations. to identify variations with a possible role in the regulation of MEFV expression

    Origin and spread of human mitochondrial DNA haplogroup U7

    No full text
    Human mitochondrial DNA haplogroup U is among the initial maternal founders in Southwest Asia and Europe and one that best indicates matrilineal genetic continuity between late Pleistocene hunter-gatherer groups and present-day populations of Europe. While most haplogroup U subclades are older than 30 thousand years, the comparatively recent coalescence time of the extant variation of haplogroup U7 (∼16-19 thousand years ago) suggests that its current distribution is the consequence of more recent dispersal events, despite its wide geographical range across Europe, the Near East and South Asia. Here we report 267 new U7 mitogenomes that - analysed alongside 100 published ones - enable us to discern at least two distinct temporal phases of dispersal, both of which most likely emanated from the Near East. The earlier one began prior to the Holocene (∼11.5 thousand years ago) towards South Asia, while the later dispersal took place more recently towards Mediterranean Europe during the Neolithic (∼8 thousand years ago). These findings imply that the carriers of haplogroup U7 spread to South Asia and Europe before the suggested Bronze Age expansion of Indo-European languages from the Pontic-Caspian Steppe region. © The Author(s) 2017
    corecore