Search CORE

108 research outputs found

Confronting Sources of Systematic Error to Resolve Historically Contentious Relationships: A Case Study Using Gadiform Fishes (Teleostei, Paracanthopterygii, Gadiformes)

Author: (...)
Dikow Rebecca B.
Hilton Eric J.
Roa-Varón Adela
Publication venue: W&M ScholarWorks
Publication date: 01/06/2021
Field of study

Reliable estimation of phylogeny is central to avoid inaccuracy in downstream macroevolutionary inferences. However, limitations exist in the implementation of concatenated and summary coalescent approaches, and Bayesian and full coalescent inference methods may not yet be feasible for computation of phylogeny using complicated models and large data sets. Here, we explored methodological (e.g., optimality criteria, character sampling, model selection) and biological (e.g., heterotachy, branch length heterogeneity) sources of systematic error that can result in biased or incorrect parameter estimates when reconstructing phylogeny by using the gadiform fishes as a model clade. Gadiformes include some of the most economically important fishes in the world (e.g., Cods, Hakes, and Rattails). Despite many attempts, a robust higher-level phylogenetic framework was lacking due to limited character and taxonomic sampling, particularly from several species-poor families that have been recalcitrant to phylogenetic placement. We compiled the first phylogenomic data set, including 14,208 loci (⁠\u3e role= presentation \u3e\u3e2.8 M bp) from 58 species representing all recognized gadiform families, to infer a time-calibrated phylogeny for the group. Data were generated with a gene-capture approach targeting coding DNA sequences from single-copy protein-coding genes. Species-tree and concatenated maximum-likelihood (ML) analyses resolved all family-level relationships within Gadiformes. While there were a few differences between topologies produced by the DNA and the amino acid data sets, most of the historically unresolved relationships among gadiform lineages were consistently well resolved with high support in our analyses regardless of the methodological and biological approaches used. However, at deeper levels, we observed inconsistency in branch support estimates between bootstrap and gene and site coefficient factors (gCF, sCF). Despite numerous short internodes, all relationships received unequivocal bootstrap support while gCF and sCF had very little support, reflecting hidden conflict across loci. Most of the gene-tree and species-tree discordance in our study is a result of short divergence times, and consequent lack of informative characters at deep levels, rather than incomplete lineage sorting. We use this phylogeny to establish a new higher-level classification of Gadiformes as a way of clarifying the evolutionary diversification of the order. We recognize 17 families in five suborders: Bregmacerotoidei, Gadoidei, Ranicipitoidei, Merluccioidei, and Macrouroidei (including two subclades). A time-calibrated analysis using 15 fossil taxa suggests that Gadiformes evolved ∼ role= presentation \u3e∼79.5 Ma in the late Cretaceous, but that most extant lineages diverged after the Cretaceous–Paleogene (K-Pg) mass extinction (66 Ma). Our results reiterate the importance of examining phylogenomic analyses for evidence of systematic error that can emerge as a result of unsuitable modeling of biological factors and/or methodological issues, even when data sets are large and yield high support for phylogenetic relationships. [Branch length heterogeneity; Codfishes; commercial fish species; Cretaceous-Paleogene (K-Pg); heterotachy; systematic error; target enrichment.

College of William & Mary: W&M Publish

A target enrichment method for gathering phylogenetic information from hundreds of loci: An example from the Compositae.

Author: Burke John M
Dikow Rebecca B
Funk Vicki A
Kozik Alex
Mandel Jennifer R
Masalia Rishi R
Michelmore Richard W
Rieseberg Loren H
Staton S Evan
Publication venue: eScholarship, University of California
Publication date: 01/02/2014
Field of study

UnlabelledPremise of the studyThe Compositae (Asteraceae) are a large and diverse family of plants, and the most comprehensive phylogeny to date is a meta-tree based on 10 chloroplast loci that has several major unresolved nodes. We describe the development of an approach that enables the rapid sequencing of large numbers of orthologous nuclear loci to facilitate efficient phylogenomic analyses. •Methods and resultsWe designed a set of sequence capture probes that target conserved orthologous sequences in the Compositae. We also developed a bioinformatic and phylogenetic workflow for processing and analyzing the resulting data. Application of our approach to 15 species from across the Compositae resulted in the production of phylogenetically informative sequence data from 763 loci and the successful reconstruction of known phylogenetic relationships across the family. •ConclusionsThese methods should be of great use to members of the broader Compositae community, and the general approach should also be of use to researchers studying other families

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Confronting sources of systematic error to resolve historically contentious relationships: A case study using gadiform fishes (Teleostei, Paracanthopterygii, Gadiformes)

Author: Adela Roa-Varon
Baldwin Carole C.
Carnevale Giorgio
Chenhong Li
Hilton Eric J.
Luke Tornabene
Rebecca Dikow
Publication venue
Publication date: 01/01/2021
Field of study

Institutional Research Information System University of Turin

Applications of deep convolutional neural networks to digitized natural history collections

Author: Brown Abel
Dikow Rebecca
Dorr Laurence
Frandsen Paul
Funk Vicki
Metallo Adam
Orli Sylvia
Peters Melinda
Schuettpelz Eric
Publication venue: Pensoft Publishers
Publication date: 01/01/2017
Field of study

Natural history collections contain data that are critical for many scientific endeavors. Recent efforts in mass digitization are generating large datasets from these collections that can provide unprecedented insight. Here, we present examples of how deep convolutional neural networks can be applied in analyses of imaged herbarium specimens. We first demonstrate that a convolutional neural network can detect mercury-stained specimens across a collection with 90% accuracy. We then show that such a network can correctly distinguish two morphologically similar plant families 96% of the time. Discarding the most challenging specimen images increases accuracy to 94% and 99%, respectively. These results highlight the importance of mass digitization and deep learning approaches and reveal how they can together deliver powerful new investigative tools

Crossref

ZENODO

Directory of Open Access Journals

ARPHA OAI-PMH Endpoint

ARPHA Preprints

Reciprocal genomic evolution in the ant-fungus agricultural symbiosis

Author: Boomsma Jacobus Jan
Brady Seán G.
Chen Zhensheng
Deng Yuan
Dikow Rebecca B.
Hu Haofu
Li Cai
Ma Chunyu
Nash David Richard
Nygaard Sanne
Rabeling Christian
Schiøtt Morten
Schultz Ted R.
Wcislo William T.
Xie Qiaolin
Yang Zhikai
Zhang Guojie
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The attine ant–fungus agricultural symbiosis evolved over tens of millions of years, producing complex societies with industrial-scale farming analogous to that of humans. Here we document reciprocal shifts in the genomes and transcriptomes of seven fungus-farming ant species and their fungal cultivars. We show that ant subsistence farming probably originated in the early Tertiary (55–60 MYA), followed by further transitions to the farming of fully domesticated cultivars and leaf-cutting, both arising earlier than previously estimated. Evolutionary modifications in the ants include unprecedented rates of genome-wide structural rearrangement, early loss of arginine biosynthesis and positive selection on chitinase pathways. Modifications of fungal cultivars include loss of a key ligninase domain, changes in chitin synthesis and a reduction in carbohydrate-degrading enzymes as the ants gradually transitioned to functional herbivory. In contrast to human farming, increasing dependence on a single cultivar lineage appears to have been essential to the origin of industrial-scale ant agriculture

Copenhagen University Research Information System

PubMed Central

Whole genome analysis of clouded leopard species reveals an ancient divergence and distinct demographic histories

Author: Aiden Erez Lieberman
Bursell Madeline G.
Dikow Rebecca B.
Dudchenko Olga
Figueiró Henrique V.
Flanagan Joseph P.
Frandsen Paul B.
Goossens Benoit
Johnson Warren E.
Koepfli Klaus-Peter
Nathan Senthilvel K.S.S.
Publication venue: 'Elsevier BV'
Publication date: 09/12/2022
Field of study

Similar to other apex predator species, populations of mainland (Neofelis nebulosa) and Sunda (Neofelis diardi) clouded leopards are declining. Understanding their patterns of genetic variation can provide critical insights on past genetic erosion and a baseline for understanding their long-term conservation needs. As a step toward this goal, we present draft genome assemblies for the two clouded leopard species to quantify their phylogenetic divergence, genome-wide diversity, and historical population trends. We estimate that the two species diverged 5.1 Mya, much earlier than previous estimates of 1.41 Mya and 2.86 Mya, suggesting they separated when Sundaland was becoming increasingly isolated from mainland Southeast Asia. The Sunda clouded leopard displays a distinct and reduced effective population size trajectory, consistent with a lower genome-wide heterozygosity and SNP density, relative to the mainland clouded leopard. Our results provide new insights into the evolutionary history and genetic health of this unique lineage of felids

Online Research @ Cardiff

PubMed Central

Genome-level homology and phylogeny of Shewanella (Gammaproteobacteria: lteromonadales: Shewanellaceae)

Author: A Morandi
A Rokas
A Stamatakis
AE Darling
AE Darling
AE Murray
AJ Drummond
C Ané
CR Woese
CW Saltikov
CY Lee
D Liao
DR Lovley
F Ziemke
F Ziemke
FD Chester
H Gao
HA Derby
I Brettar
I Brettar
J-S Zhao
J-S Zhao
JA Eisen
JA Eisen
JC Makemson
JP Bowman
JP Euzéby
JP Gogarten
JS Sparks
JY D'Aoust
K Katoh
K Venkateswaran
K Venkateswaran
KA Perry
KC Nixon
KT Konstantinidis
L Baumann
L Liu
LL Cavalli-Sforza
LS Haggerty
MCC dePinna
MJ Sanderson
MP Di Bonaventura
MR Leonardo
MT MacDonell
N Saitou
P Goloboff
P Stothard
PA Goloboff
PA Goloboff
PA Goloboff
RB Dikow
Rebecca B Dikow
SJ Hackett
TD Read
U Francke
X Xiao
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The explosion in availability of whole genome data provides the opportunity to build phylogenetic hypotheses based on these data as well as the ability to learn more about the genomes themselves. The biological history of genes and genomes can be investigated based on the taxomonic history provided by the phylogeny. A phylogenetic hypothesis based on complete genome data is presented for the genus <it>Shewanella </it>(Gammaproteobacteria: Alteromonadales: Shewanellaceae). Nineteen taxa from <it>Shewanella </it>(16 species and 3 additional strains of one species) as well as three outgroup species representing the genera <it>Aeromonas </it>(Gammaproteobacteria: Aeromonadales: Aeromonadaceae), <it>Alteromonas </it>(Gammaproteobacteria: Alteromonadales: Alteromonadaceae) and <it>Colwellia </it>(Gammaproteobacteria: Alteromonadales: Colwelliaceae) are included for a total of 22 taxa. Results Putatively homologous regions were found across unannotated genomes and tested with a phylogenetic analysis. Two genome-wide data-sets are considered, one including only those genomic regions for which all taxa are represented, which included 3,361,015 aligned nucleotide base-pairs (bp) and a second that additionally includes those regions present in only subsets of taxa, which totaled 12,456,624 aligned bp. Alignment columns in these large data-sets were then randomly sampled to create smaller data-sets. After the phylogenetic hypothesis was generated, genome annotations were projected onto the DNA sequence alignment to compare the historical hypothesis generated by the phylogeny with the functional hypothesis posited by annotation. Conclusions Individual phylogenetic analyses of the 243 locally co-linear genome regions all failed to recover the genome topology, but the smaller data-sets that were random samplings of the large concatenated alignments all produced the genome topology. It is shown that there is not a single orthologous copy of 16S rRNA across the taxon sampling included in this study and that the relationships among the multiple copies are consistent with 16S rRNA undergoing concerted evolution. Unannotated whole genome data can provide excellent raw material for generating hypotheses of historical homology, which can be tested with phylogenetic analysis and compared with hypotheses of gene function.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Genomic architecture and introgression shape a butterfly radiation

Author: Blaxter Mark
Blumberg Andrew,
Challis Richard
Chouteau Mathieu
Clavijo Bernardo
Counterman Brian,
Dasmahapatra Kanchon,
Davey John
Di Palma Federica
Dikow Rebecca,
Edelman Nathaniel,
Frandsen Paul,
García-Accinelli Gonzalo
Gilson R,
Jaffe David
Jiggins Chris,
Joron Mathieu
Kronforst Marcus
Kumar Sujai
Mallet James
McMillan W. Owen,
Miyagi Michael
Moreira Gilson
Neafsey Daniel,
Papa Riccardo
Patterson Nick
Reed Robert
Salazar Camilo
Van Belleghem Steven,
Wakeley John
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 31/10/2019
Field of study

We use twenty de novo genome assemblies to probe the speciation history and architecture of gene flow in rapidly radiating Heliconius butterflies. Our tests to distinguish incomplete lineage sorting from introgression indicate that gene flow has obscured several ancient phylogenetic relationships in this group over large swathes of the genome. Introgressed loci are underrepresented in low recombination and gene-rich regions, consistent with the purging of foreign alleles more tightly linked to incompatibility loci. We identify a hitherto unknown inversion that traps a color pattern switch locus. We infer that this inversion was transferred between lineages via introgression and is convergent with a similar rearrangement in another part of the genus. These multiple de novo genome sequences enable improved understanding of the importance of introgression and selective processes in adaptive radiation

Crossref

HAL Descartes

ArchiMer - Institutional Archive of Ifremer

UCL Discovery

Edinburgh Research Explorer

HAL-IRD

edocUR

White Rose Research Online

Hal-Diderot

Phyllis Diller Gag File Index Card Transcriptions

Author: Rebecca Dikow (7205195)
Publication venue
Publication date: 16/11/2022
Field of study

Transcriptions of Phyllis Diller's Gag File index cards. Dataset Card available here: https://github.com/Smithsonian/dataset-cards/NMAH-Phyllis-Diller-gag-file.md</p

FigShare

Phyllis Diller Gag File Index Card Images

Author: Rebecca Dikow (7205195)
Publication venue
Publication date: 16/11/2022
Field of study

Images of index cards with Phyllis Diller's jokes along with a Dataset Card available here: https://github.com/Smithsonian/dataset-cards/NMAH-Phyllis-Diller-gag-file.md </p

FigShare