143 research outputs found
Early History of Mammals Is Elucidated with the ENCODE Multiple Species Sequencing Data
Understanding the early evolution of placental mammals is one of the most challenging issues in mammalian phylogeny. Here, we addressed this question by using the sequence data of the ENCODE consortium, which include 1% of mammalian genomes in 18 species belonging to all main mammalian lineages. Phylogenetic reconstructions based on an unprecedented amount of coding sequences taken from 218 genes resulted in a highly supported tree placing the root of Placentalia between Afrotheria and Exafroplacentalia (Afrotheria hypothesis). This topology was validated by the phylogenetic analysis of a new class of genomic phylogenetic markers, the conserved noncoding sequences. Applying the tests of alternative topologies on the coding sequence dataset resulted in the rejection of the Atlantogenata hypothesis (Xenarthra grouping with Afrotheria), while this test rejected the second alternative scenario, the Epitheria hypothesis (Xenarthra at the base), when using the noncoding sequence dataset. Thus, the two datasets support the Afrotheria hypothesis; however, none can reject both of the remaining topological alternatives
Non-alignment comparison of human and high primate genomes
Compositional spectra (CS) analysis based on k-mer scoring of DNA sequences
was employed in this study for dot-plot comparison of human and primate
genomes. The detection of extended conserved synteny regions was based on
continuous fuzzy similarity rather than on chains of discrete anchors (genes or
highly conserved noncoding elements). In addition to the high correspondence
found in the comparisons of whole-genome sequences, a good similarity was also
found after masking gene sequences, indicating that CS analysis manages to
reveal phylogenetic signal in the organization of noncoding part of the genome
sequences, including repetitive DNA and the genome "dark matter". Obviously,
the possibility to reveal parallel ordering depends on the signal of common
ancestor sequence organization varying locally along the corresponding segments
of the compared genomes. We explored two sources contributing to this signal:
sequence composition (GC content) and sequence organization (abundances of
k-mers in the usual A,T,G,C or purine-pyrimidine alphabets). Whole-genome
comparisons based on GC distribution along the analyzed sequences indeed gives
reasonable results, but combining it with k-mer abundances dramatically
improves the ordering quality, indicating that compositional and organizational
heterogeneity comprise complementary sources of information on evolutionary
conserved similarity of genome sequences
A custom capture sequence approach for oculocutaneous albinism identifies structural variant alleles at the OCA2 locus
Oculocutaneous albinism (OCA) is a heritable disorder of pigment production that manifests as hypopigmentation and altered eye development. Exon sequencing of known OCA genes is unsuccessful in producing a complete molecular diagnosis for a significant number of affected individuals. We sequenced the DNA of individuals with OCA using short-read custom capture sequencing that targeted coding, intronic and non-coding regulatory regions of known OCA genes and GWAS-associated pigmentation loci. We identified an OCA2 complex structural variant (CxSV), defined by a 143kb inverted segment reintroduced in intron 1, upstream of the native location. The corresponding CxSV junctions were observed in 11/390 probands screened. The 143kb CxSV presents in one family as a copy number variant (CNV) duplication for the 143kb region. In the remaining 10/11 families, the 143kb CxSV acquired an additional 184kb deletion across the same region, restoring exons 3–19 of OCA2 to a copy-number neutral state. Allele-associated haplotype analysis found rare SNVs rs374519281 and rs139696407 are linked with the 143kb CxSV in both OCA2 alleles. For individuals in which customary molecular evaluation does not reveal a biallelic OCA diagnosis, we recommend preliminary screening for these haplotype-associated rare variants, followed by junction-specific validation for the OCA2 143kb CxSV
The transcription factors Ets1 and Sox10 interact during murine melanocyte development
Melanocytes, the pigment-producing cells, arise from multipotent neural crest (NC) cells during embryogenesis. Many genes required for melanocyte development were identified using mouse pigmentation mutants. The variable spotting mouse pigmentation mutant arose spontaneously at the Jackson Laboratory. We identified a G-to-A nucleotide transition in exon 3 of the Ets1 gene in variable spotting, which results in a missense G102E mutation. Homozygous variable spotting mice exhibit sporadic white spotting. Similarly, mice carrying a targeted deletion of Ets1 exhibit hypopigmentation; nevertheless, the function of Ets1 in melanocyte development is unknown. The transcription factor Ets1 is widely expressed in developing organs and tissues, including the NC. In the chick, Ets1 is required for the expression of Sox10, a transcription factor critical for the development of various NC derivatives, including melanocytes. We show that Ets1 is required early for murine NC cell and melanocyte precursor survival in vivo. Given the importance of Ets1 for Sox10 expression in the chick, we investigated a potential genetic interaction between these genes by comparing the hypopigmentation phenotypes of single and double heterozygous mice. The incidence of hypopigmentation in double heterozygotes was significantly greater than in single heterozygotes. The area of hypopigmentation in double heterozygotes was significantly larger than would be expected from the addition of the areas of hypopigmentation of single heterozygotes, suggesting that Ets1 and Sox10 interact synergistically in melanocyte development. Since Sox10 is also essential for enteric ganglia development, we examined the distal colons of Ets1 null mutants and found a significant decrease in enteric innervation, which was exacerbated by Sox10 heterozygosity. At the molecular level, Ets1 was found to activate an enhancer critical for Sox10 expression in NC-derived structures. Furthermore, enhancer activation was significantly inhibited by the variable spotting mutation. Together, these results suggest that Ets1 and Sox10 interact to promote proper melanocyte and enteric ganglia development from the NC
Revealing mammalian evolutionary relationships by comparative analysis of gene clusters
Many software tools for comparative analysis of genomic sequence data have been released in recent decades. Despite this, it remains challenging to determine evolutionary relationships in gene clusters due to their complex histories involving duplications, deletions, inversions, and conversions. One concept describing these relationships is orthology. Orthologs derive from a common ancestor by speciation, in contrast to paralogs, which derive from duplication. Discriminating orthologs from paralogs is a necessary step in most multispecies sequence analyses, but doing so accurately is impeded by the occurrence of gene conversion events. We propose a refined method of orthology assignment based on two paradigms for interpreting its definition: by genomic context or by sequence content. X-orthology (based on context) traces orthology resulting from speciation and duplication only, while N-orthology (based on content) includes the influence of conversion events
Systems Modeling to Improve the Hydro-Ecological Performance of Diked Wetlands
Water scarcity and invasive vegetation threaten arid-region wetlands and wetland managers seek ways to enhance wetland ecosystem services with limited water, labor, and financial resources. While prior systems modeling efforts have focused on water management to improve flow-based ecosystem and habitat objectives, here we consider water allocation and invasive vegetation management that jointly target the concurrent hydrologic and vegetation habitat needs of priority wetland bird species. We formulate a composite weighted usable area for wetlands (WU) objective function that represents the wetland surface area that provides suitable water level and vegetation cover conditions for priority bird species. Maximizing the WU is subject to constraints such as water balance, hydraulic infrastructure capacity, invasive vegetation growth and control, and a limited financial budget to control vegetation. We apply the model at the Bear River Migratory Bird Refuge on the Great Salt Lake, Utah, compare model-recommended management actions to past Refuge water and vegetation control activities, and find that managers can almost double the area of suitable habitat by more dynamically managing water levels and managing invasive vegetation in August at the beginning of the window for control operations. Scenario and sensitivity analyses show the importance to jointly consider hydrology and vegetation system components rather than only the hydrological component
Developmental Pathway of the MPER-Directed HIV-1-Neutralizing Antibody 10E8
Antibody 10E8 targets the membrane-proximal external region (MPER) of HIV-1 gp41, neutralizes >97% of HIV-1 isolates, and lacks the auto-reactivity often associated with MPER-directed antibodies. The developmental pathway of 10E8 might therefore serve as a promising template for vaccine design, but samples from time-of-infection—often used to infer the B cell record—are unavailable. In this study, we used crystallography, next-generation sequencing (NGS), and functional assessments to infer the 10E8 developmental pathway from a single time point. Mutational analysis indicated somatic hypermutation of the 2nd-heavy chain-complementarity determining region (CDR H2) to be critical for neutralization, and structures of 10E8 variants with V-gene regions reverted to genomic origin for heavy-and-light chains or heavy chain-only showed structural differences >2 Å relative to mature 10E8 in the CDR H2 and H3. To understand these developmental changes, we used bioinformatic sieving, maximum likelihood, and parsimony analyses of immunoglobulin transcripts to identify 10E8-lineage members, to infer the 10E8-unmutated common ancestor (UCA), and to calculate 10E8-developmental intermediates. We were assisted in this analysis by the preservation of a critical D-gene segment, which was unmutated in most 10E8-lineage sequences. UCA and early intermediates weakly bound a 26-residue-MPER peptide, whereas HIV-1 neutralization and epitope recognition in liposomes were only observed with late intermediates. Antibody 10E8 thus develops from a UCA with weak MPER affinity and substantial differences in CDR H2 and H3 from the mature 10E8; only after extensive somatic hypermutation do 10E8-lineage members gain recognition in the context of membrane and HIV-1 neutralization
Light whole genome sequence for SNP discovery across domestic cat breeds
<p>Abstract</p> <p>Background</p> <p>The domestic cat has offered enormous genomic potential in the veterinary description of over 250 hereditary disease models as well as the occurrence of several deadly feline viruses (feline leukemia virus -- FeLV, feline coronavirus -- FECV, feline immunodeficiency virus - FIV) that are homologues to human scourges (cancer, SARS, and AIDS respectively). However, to realize this bio-medical potential, a high density single nucleotide polymorphism (SNP) map is required in order to accomplish disease and phenotype association discovery.</p> <p>Description</p> <p>To remedy this, we generated 3,178,297 paired fosmid-end Sanger sequence reads from seven cats, and combined these data with the publicly available 2X cat whole genome sequence. All sequence reads were assembled together to form a 3X whole genome assembly allowing the discovery of over three million SNPs. To reduce potential false positive SNPs due to the low coverage assembly, a low upper-limit was placed on sequence coverage and a high lower-limit on the quality of the discrepant bases at a potential variant site. In all domestic cats of different breeds: female Abyssinian, female American shorthair, male Cornish Rex, female European Burmese, female Persian, female Siamese, a male Ragdoll and a female African wildcat were sequenced lightly. We report a total of 964 k common SNPs suitable for a domestic cat SNP genotyping array and an additional 900 k SNPs detected between African wildcat and domestic cats breeds. An empirical sampling of 94 discovered SNPs were tested in the sequenced cats resulting in a SNP validation rate of 99%.</p> <p>Conclusions</p> <p>These data provide a large collection of mapped feline SNPs across the cat genome that will allow for the development of SNP genotyping platforms for mapping feline diseases.</p
Initial Sequence and Comparative Analysis of the Cat Genome
The genome sequence (1.9-fold coverage) of an inbred Abyssinian domestic cat was assembled, mapped, and annotated with a comparative approach that involved cross-reference to annotated genome assemblies of six mammals (human, chimpanzee, mouse, rat, dog, and cow). The results resolved chromosomal positions for 663,480 contigs, 20,285 putative feline gene orthologs, and 133,499 conserved sequence blocks (CSBs). Additional annotated features include repetitive elements, endogenous retroviral sequences, nuclear mitochondrial (numt) sequences, micro-RNAs, and evolutionary breakpoints that suggest historic balancing of translocation and inversion incidences in distinct mammalian lineages. Large numbers of single nucleotide polymorphisms (SNPs), deletion insertion polymorphisms (DIPs), and short tandem repeats (STRs), suitable for linkage or association studies were characterized in the context of long stretches of chromosome homozygosity. In spite of the light coverage capturing ∼65% of euchromatin sequence from the cat genome, these comparative insights shed new light on the tempo and mode of gene/genome evolution in mammals, promise several research applications for the cat, and also illustrate that a comparative approach using more deeply covered mammals provides an informative, preliminary annotation of a light (1.9-fold) coverage mammal genome sequence
- …