7 research outputs found
Recommended from our members
The genome sequence of the European water vole, Arvicola amphibius Linnaeus 1758.
We present a genome assembly from an individual male Arvicola amphibius (the European water vole; Chordata; Mammalia; Rodentia; Cricetidae). The genome sequence is 2.30 gigabases in span. The majority of the assembly is scaffolded into 18 chromosomal pseudomolecules, including the X sex chromosome. Gene annotation of this assembly on Ensembl has identified 21,394 protein coding genes
Recommended from our members
A chromosomal reference genome sequence for the malaria mosquito, Anopheles moucheti, Evans, 1925.
We present a genome assembly from an individual male Anopheles moucheti (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), from a wild population in Cameroon. The genome sequence is 271 megabases in span. The majority of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was also assembled and is 15.5 kilobases in length
A chromosomal reference genome sequence for the malaria mosquito, Anopheles moucheti, Evans, 1925
International audienceWe present a genome assembly from an individual male Anopheles moucheti (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), from a wild population in Cameroon. The genome sequence is 271 megabases in span. The majority of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was also assembled and is 15.5 kilobases in length
Recommended from our members
A chromosomal reference genome sequence for the malaria mosquito, <i>Anopheles gambiae</i>, Giles, 1902, Ifakara strain.
We present a genome assembly from an individual female Anopheles gambiae (the malaria mosquito; Arthropoda; Insecta; Diptera; Culicidae), Ifakara strain. The genome sequence is 264 megabases in span. Most of the assembly is scaffolded into three chromosomal pseudomolecules with the X sex chromosome assembled. The complete mitochondrial genome was also assembled and is 15.4 kilobases in length
MitoHiFi: a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads
Abstract
Background
 PacBio high fidelity (HiFi) sequencing reads are both long (15–20 kb) and highly accurate (> Q20). Because of these properties, they have revolutionised genome assembly leading to more accurate and contiguous genomes. In eukaryotes the mitochondrial genome is sequenced alongside the nuclear genome often at very high coverage. A dedicated tool for mitochondrial genome assembly using HiFi reads is still missing.
Results
 MitoHiFi was developed within the Darwin Tree of Life Project to assemble mitochondrial genomes from the HiFi reads generated for target species. The input for MitoHiFi is either the raw reads or the assembled contigs, and the tool outputs a mitochondrial genome sequence fasta file along with annotation of protein and RNA genes. Variants arising from heteroplasmy are assembled independently, and nuclear insertions of mitochondrial sequences are identified and not used in organellar genome assembly. MitoHiFi has been used to assemble 374 mitochondrial genomes (368 Metazoa and 6 Fungi species) for the Darwin Tree of Life Project, the Vertebrate Genomes Project and the Aquatic Symbiosis Genome Project. Inspection of 60 mitochondrial genomes assembled with MitoHiFi for species that already have reference sequences in public databases showed the widespread presence of previously unreported repeats.
Conclusions
 MitoHiFi is able to assemble mitochondrial genomes from a wide phylogenetic range of taxa from Pacbio HiFi data. MitoHiFi is written in python and is freely available on GitHub (https://github.com/marcelauliano/MitoHiFi). MitoHiFi is available with its dependencies as a Docker container on GitHub (ghcr.io/marcelauliano/mitohifi:master).
</jats:sec
Scalable, accessible, and reproducible reference genome assembly and evaluation in Galaxy
Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals)
Advancing genomics through the Global Invertebrate Genomics Alliance (GIGA)
The Global Invertebrate Genomics Alliance (GIGA), a collaborative network of diverse scientists, marked its second anniversary with a workshop in Munich, Germany in 2015, where international attendees focused on discussing current progress, milestones and bioinformatics resources. The community determined the recruitment and training of talented researchers as one of the most pressing future needs and identified opportunities for network funding. GIGA also promotes future research efforts to prioritise taxonomic diversity and create new synergies. Here, we announce the generation of a central and simple data repository portal with a wide coverage of available sequence data, via the compagen platform, in parallel with more focused and specialised organism databases to globally advance invertebrate genomics. This article serves the objectives of GIGA by disseminating current progress and future prospects in the science of invertebrate genomics with the aim of promotion and facilitation of interdisciplinary and international research.SCOPUS: er.jSCOPUS: re.jinfo:eu-repo/semantics/publishe