Search CORE

4 research outputs found

Landscape Genomics of White-Footed Mice (Peromyscus leucopus) along an Urban-to-Rural Gradient in the New York City Metropolitan Area

Author: Abueg Linelle Ann Lacson
Publication venue: Fordham Research Commons
Publication date: 01/01/2019
Field of study

Urbanization can change an area’s habitat in ways that pose novel selection pressures on native species, and previous work has shown evidence for divergent selection in white-footed mice populations in New York City (NYC) parks compared to nearby rural populations. This study aims to 1) identify potential candidate genes exhibiting signatures of selection with increasing levels of urbanization, and 2) compare these results with previous findings that NYC populations of P. leucopus experience directional selection for metabolic processes and immune function. I approached these aims using a SNP dataset derived from exomes of 95 P. leucopus specimens sampled from sites in and around NYC. Outlier detection consisted of methods which rely on measures of population genetics (such as FST) and genotype-environment analyses that incorporate environmental factors (such as degree of urbanization). I ran Gene Ontology enrichment tests on the resulting outliers to see what biological functions are overrepresented among the outliers. I found overrepresentation of genes related to metabolic function as well as ciliary function, particularly with regard to spermatogenesis, which corroborates previous findings in this system. I additionally found multiple unconventional myosins and other proteins that imply possible selection on genes related to hearing function

Fordham University: DigitalResearch@Fordham

MitoHiFi: a python pipeline for mitochondrial genome assembly from PacBio high fidelity reads

Author: Abueg Linelle Ann
Barnes I
Blaxter M
Blaxter Mark
Broad G
Durbin R
Durbin Richard
Formenti Giulio
Gaya E
Hall N
Hart M
Holland P
Hollingsworth P
Kersey P
Krasheninnikova K
Lawniczak M
Lewis O
Martin F
McCarthy Shane
Mieszkowska N
Myers Eugene
Richards T
Rodinho Nunes Ferreira João Gabriel
Torrance James
Twyford A
Uliano da Silva Marcela
Wilson W::0000-0002-3227-663X
Publication venue: England
Publication date: 18/07/2023
Field of study

Abstract Background PacBio high fidelity (HiFi) sequencing reads are both long (15–20 kb) and highly accurate (> Q20). Because of these properties, they have revolutionised genome assembly leading to more accurate and contiguous genomes. In eukaryotes the mitochondrial genome is sequenced alongside the nuclear genome often at very high coverage. A dedicated tool for mitochondrial genome assembly using HiFi reads is still missing. Results MitoHiFi was developed within the Darwin Tree of Life Project to assemble mitochondrial genomes from the HiFi reads generated for target species. The input for MitoHiFi is either the raw reads or the assembled contigs, and the tool outputs a mitochondrial genome sequence fasta file along with annotation of protein and RNA genes. Variants arising from heteroplasmy are assembled independently, and nuclear insertions of mitochondrial sequences are identified and not used in organellar genome assembly. MitoHiFi has been used to assemble 374 mitochondrial genomes (368 Metazoa and 6 Fungi species) for the Darwin Tree of Life Project, the Vertebrate Genomes Project and the Aquatic Symbiosis Genome Project. Inspection of 60 mitochondrial genomes assembled with MitoHiFi for species that already have reference sequences in public databases showed the widespread presence of previously unreported repeats. Conclusions MitoHiFi is able to assemble mitochondrial genomes from a wide phylogenetic range of taxa from Pacbio HiFi data. MitoHiFi is written in python and is freely available on GitHub (https://github.com/marcelauliano/MitoHiFi). MitoHiFi is available with its dependencies as a Docker container on GitHub (ghcr.io/marcelauliano/mitohifi:master). </jats:sec

PEARL (Univ. of Plymouth)

Scalable, accessible, and reproducible reference genome assembly and evaluation in Galaxy

Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonstrate that it delivers high-quality reference genomes at scale across a set of vertebrate species arising over the last ~500 million years. The pipeline is versatile and combines PacBio HiFi long-reads and Hi-C-based haplotype phasing in a new graph-based paradigm. Standardized quality control is performed automatically to troubleshoot assembly issues and assess biological complexities. We make the pipeline freely accessible through Galaxy, accommodating researchers even without local computational resources and enhanced reproducibility by democratizing the training and assembly process. We demonstrate the flexibility and reliability of the pipeline by assembling reference genomes for 51 vertebrate species from major taxonomic groups (fish, amphibians, reptiles, birds, and mammals)

Diposit Digital de Documents de la UAB

The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update

Author: Abueg Linelle Ann L
Afgan Enis
Allart Olivier
Awan Ahmed
Bacon Wendi
Baker Dannon
Bassetti Madeline
Batut Bérénice
Begines José Manuel Domínguez
Beltran Alejandra
Bernt Matthias
Blankenberg Daniel
Bombarely Aureliano
Bras Yvan Le
Bretaudeau Anthony
Bromhead Catherine
Burke Melissa
Capon Patrick
Chavero-Díez María
Chilton John
Collins Tyler
Coppens Frederik
Coraor Nate
Corguillé Gildas Le
Cuccuru Gianmauro
Cumbo Fabio
Davis John
de Geest Paul
de Koning Willem
Demko Martin
Desanto Assunta
Doyle Maria
Droesbeke Bert
Erxleben-Eggenhofer Anika
Formenti Giulio
Fouilloux Anne
Föll Melanie
Gangazhe Rendani
Genthon Tanguy
Goecks Jeremy
Goonasekera Nuwan
Goué Nadia
Griffin Timothy
Grüning Björn
Guerler Aysam
Gundersen Sveinung
Gustafsson Ove Johan Ragnar
Hall Christina
Harrop Thomas
Hecht Helge
Heidari Alireza
Heisner Tillman
Heyl Florian
Hiltemann Saskia
Hotz Hans-Rudolf
Hyde Cameron
Jagtap Pratik
Jakiela Julia
Johnson James
Joshi Jayadev
Jossé Marie
Jum’ah Khaled
Kalaš Matúš
Kamieniecka Katarzyna
Kayikcioglu Tunc
Konkol Markus
Kostrykin Leonid
Kucher Natalie
Kumar Anup
Kuntz Mira
Lariviere Delphine
Lazarus Ross
Lee Justin
Leo Simone
Liborio Leandro
Libouban Romane
Lopez-Delisle Lucille
Los Laila
Mahmoud Alexandru
Makunin Igor
Marin Pierre
Mehta Subina
Mok Winnie
Moreno Pablo
Morier-Genoud François
Mosher Stephen
Müller Teresa
Nasr Engy
Nekrutenko Anton
Nelson Tiffanie
Oba Asime
Ostrovsky Alexander
Polunina Polina
Poterlowicz Krzysztof
Price Elliott
Price Gareth
Rasche Helena
Raubenolt Bryan
Royaux Coline
Sargent Luke
Savage Michelle
Savchenko Denys
Savchenko Volodymyr
Schatz Michael
Seguineau Pauline
Serrano-Solano Beatriz
Soranzo Nicola
Srikakulam Sanjay Kumar
Suderman Keith
Syme Anna
Tabernero David López
Tangaro Marco Antonio
Tedds Jonathan
Tekman Mehmet
Thanki Anil
Uhl Michael
van den Beek Marius
Varshney Deepti
Vessio Jenn
Videm Pavankumar
von Kuster Greg
Watson Gregory
Whitaker-Allen Natalie
Winter Uwe
Wolstencroft Martin
Zambelli Federico
Zierep Paul
Zoabi Rand
Čech Martin
Publication venue: Oxford University Press
Publication date: 05/07/2024
Field of study

International audienceAbstract Galaxy (https://galaxyproject.org) is deployed globally, predominantly through free-to-use services, supporting user-driven research that broadens in scope each year. Users are attracted to public Galaxy services by platform stability, tool and reference dataset diversity, training, support and integration, which enables complex, reproducible, shareable data analysis. Applying the principles of user experience design (UXD), has driven improvements in accessibility, tool discoverability through Galaxy Labs/subdomains, and a redesigned Galaxy ToolShed. Galaxy tool capabilities are progressing in two strategic directions: integrating general purpose graphical processing units (GPGPU) access for cutting-edge methods, and licensed tool support. Engagement with global research consortia is being increased by developing more workflows in Galaxy and by resourcing the public Galaxy services to run them. The Galaxy Training Network (GTN) portfolio has grown in both size, and accessibility, through learning paths and direct integration with Galaxy tools that feature in training courses. Code development continues in line with the Galaxy Project roadmap, with improvements to job scheduling and the user interface. Environmental impact assessment is also helping engage users and developers, reminding them of their role in sustainability, by displaying estimated CO2 emissions generated by each Galaxy job

HAL-CentraleSupelec

HAL Clermont Université

HAL-IRD

HAL-CEA

HAL-Rennes 1