11 research outputs found
Large-scale discovery of insertion hotspots and preferential integration sites of human transposed elements
Throughout evolution, eukaryotic genomes have been invaded by transposable elements (TEs). Little is known about the factors leading to genomic proliferation of TEs, their preferred integration sites and the molecular mechanisms underlying their insertion. We analyzed hundreds of thousands nested TEs in the human genome, i.e. insertions of TEs into existing ones. We first discovered that most TEs insert within specific ‘hotspots’ along the targeted TE. In particular, retrotransposed Alu elements contain a non-canonical single nucleotide hotspot for insertion of other Alu sequences. We next devised a method for identification of integration sequence motifs of inserted TEs that are conserved within the targeted TEs. This method revealed novel sequences motifs characterizing insertions of various important TE families: Alu, hAT, ERV1 and MaLR. Finally, we performed a global assessment to determine the extent to which young TEs tend to nest within older transposed elements and identified a 4-fold higher tendency of TEs to insert into existing TEs than to insert within non-TE intergenic regions. Our analysis demonstrates that TEs are highly biased to insert within certain TEs, in specific orientations and within specific targeted TE positions. TE nesting events also reveal new characteristics of the molecular mechanisms underlying transposition
Neogenin May Functionally Substitute for Dcc in Chicken
Dcc is the key receptor that mediates attractive responses of axonal growth cones to netrins, a family of axon guidance cues used throughout evolution. However, a Dcc homolog has not yet been identified in the chicken genome, raising the possibility that Dcc is not present in avians. Here we show that the closely related family member neogenin may functionally substitute for Dcc in the developing chicken spinal cord. The expression pattern of chicken neogenin in the developing spinal cord is a composite of the distribution patterns of both rodent Dcc and neogenin. Moreover, whereas the loss of mouse neogenin has no effect on the trajectory of commissural axons, removing chicken neogenin by RNA interference results in a phenotype similar to the functional inactivation of Dcc in mouse. Taken together, these data suggest that the chick neogenin is functionally equivalent to rodent Dcc
Re-Annotation Is an Essential Step in Systems Biology Modeling of Functional Genomics Data
One motivation of systems biology research is to understand gene functions and interactions from functional genomics data such as that derived from microarrays. Up-to-date structural and functional annotations of genes are an essential foundation of systems biology modeling. We propose that the first essential step in any systems biology modeling of functional genomics data, especially for species with recently sequenced genomes, is gene structural and functional re-annotation. To demonstrate the impact of such re-annotation, we structurally and functionally re-annotated a microarray developed, and previously used, as a tool for disease research. We quantified the impact of this re-annotation on the array based on the total numbers of structural- and functional-annotations, the Gene Annotation Quality (GAQ) score, and canonical pathway coverage. We next quantified the impact of re-annotation on systems biology modeling using a previously published experiment that used this microarray. We show that re-annotation improves the quantity and quality of structural- and functional-annotations, allows a more comprehensive Gene Ontology based modeling, and improves pathway coverage for both the whole array and a differentially expressed mRNA subset. Our results also demonstrate that re-annotation can result in a different knowledge outcome derived from previous published research findings. We propose that, because of this, re-annotation should be considered to be an essential first step for deriving value from functional genomics data
A Complex Genomic Rearrangement Involving the Endothelin 3 Locus Causes Dermal Hyperpigmentation in the Chicken
Dermal hyperpigmentation or Fibromelanosis (FM) is one of the few examples of skin pigmentation phenotypes in the chicken, where most other pigmentation variants influence feather color and patterning. The Silkie chicken is the most widespread and well-studied breed displaying this phenotype. The presence of the dominant FM allele results in extensive pigmentation of the dermal layer of skin and the majority of internal connective tissue. Here we identify the causal mutation of FM as an inverted duplication and junction of two genomic regions separated by more than 400 kb in wild-type individuals. One of these duplicated regions contains endothelin 3 (EDN3), a gene with a known role in promoting melanoblast proliferation. We show that EDN3 expression is increased in the developing Silkie embryo during the time in which melanoblasts are migrating, and elevated levels of expression are maintained in the adult skin tissue. We have examined four different chicken breeds from both Asia and Europe displaying dermal hyperpigmentation and conclude that the same structural variant underlies this phenotype in all chicken breeds. This complex genomic rearrangement causing a specific monogenic trait in the chicken illustrates how novel mutations with major phenotypic effects have been reused during breed formation in domestic animals
Integrative mapping analysis of chicken microchromosome 16 organization
<p>Abstract</p> <p>Background</p> <p>The chicken karyotype is composed of 39 chromosome pairs, of which 9 still remain totally absent from the current genome sequence assembly, despite international efforts towards complete coverage. Some others are only very partially sequenced, amongst which microchromosome 16 (GGA16), particularly under-represented, with only 433 kb assembled for a full estimated size of 9 to 11 Mb. Besides the obvious need of full genome coverage with genetic markers for QTL (Quantitative Trait Loci) mapping and major genes identification studies, there is a major interest in the detailed study of this chromosome because it carries the two genetically independent <it>MHC </it>complexes <it>B </it>and <it>Y</it>. In addition, GGA16 carries the ribosomal RNA (<it>rRNA</it>) genes cluster, also known as the <it>NOR </it>(nucleolus organizer region). The purpose of the present study is to construct and present high resolution integrated maps of GGA16 to refine its organization and improve its coverage with genetic markers.</p> <p>Results</p> <p>We developed 79 STS (Sequence Tagged Site) markers to build a physical RH (radiation hybrid) map and 34 genetic markers to extend the genetic map of GGA16. We screened a BAC (Bacterial Artificial Chromosome) library with markers for the <it>MHC-B</it>, <it>MHC-Y </it>and <it>rRNA </it>complexes. Selected clones were used to perform high resolution FISH (Fluorescent <it>In Situ </it>Hybridization) mapping on giant meiotic lampbrush chromosomes, allowing meiotic mapping in addition to the confirmation of the order of the three clusters along the chromosome. A region with high recombination rates and containing PO41 repeated elements separates the two <it>MHC </it>complexes.</p> <p>Conclusions</p> <p>The three complementary mapping strategies used refine greatly our knowledge of chicken microchromosome 16 organisation. The characterisation of the recombination hotspots separating the two <it>MHC </it>complexes demonstrates the presence of PO41 repetitive sequences both in tandem and inverted orientation. However, this region still needs to be studied in more detail.</p
GPCR Genes Are Preferentially Retained after Whole Genome Duplication
One of the most interesting questions in biology is whether certain pathways have been favored during evolution, and if so, what properties could cause such a preference. Due to the lack of experimental evidence, whether select gene families have been preferentially retained over time after duplication in metazoan organisms remains unclear. Here, by syntenic mapping of nonchemosensory G protein-coupled receptor genes (nGPCRs which represent half the receptome for transmembrane signaling) in the vertebrate genomes, we found that, as opposed to the 8–15% retention rate for whole genome duplication (WGD)-derived gene duplicates in the entire genome of pufferfish, greater than 27.8% of WGD-derived nGPCRs which interact with a nonpeptide ligand were retained after WGD in pufferfish Tetraodon nigroviridis. In addition, we show that concurrent duplication of cognate ligand genes by WGD could impose selection of nGPCRs that interact with a polypeptide ligand. Against less than 2.25% probability for parallel retention of a pair of WGD-derived ligands and a pair of cognate receptor duplicates, we found a more than 8.9% retention of WGD-derived ligand-nGPCR pairs–threefold greater than one would surmise. These results demonstrate that gene retention is not uniform after WGD in vertebrates, and suggest a Darwinian selection of GPCR-mediated intercellular communication in metazoan organisms
MicroRNA Genes Derived from Repetitive Elements and Expanded by Segmental Duplication Events in Mammalian Genomes
MicroRNAs (miRNAs) are a class of small noncoding RNAs that regulate gene
expression by targeting mRNAs for translation repression or mRNA degradation.
Many miRNAs are being discovered and studied, but in most cases their origin,
evolution and function remain unclear. Here, we characterized miRNAs derived
from repetitive elements and miRNA families expanded by segmental duplication
events in the human, rhesus and mouse genomes. We applied a comparative genomics
approach combined with identifying miRNA paralogs in segmental duplication pair
data in a genome-wide study to identify new homologs of human miRNAs in the
rhesus and mouse genomes. Interestingly, using segmental duplication pair data,
we provided credible computational evidence that two miRNA genes are located in
the pseudoautosomal region of the human Y chromosome. We characterized all the
miRNAs whether they were derived from repetitive elements or not and identified
significant differences between the repeat-related miRNAs (RrmiRs) and
non-repeat-derived miRNAs in (1) their location in protein-coding and intergenic
regions in genomes, (2) the minimum free energy of their hairpin structures, and
(3) their conservation in vertebrate genomes. We found some lineage-specific
RrmiR families and three lineage-specific expansion families, and provided
evidence indicating that some RrmiR families formed and expanded during
evolutionary segmental duplication events. We also provided computational and
experimental evidence for the functions of the conservative RrmiR families in
the three species. Together, our results indicate that repetitive elements
contribute to the origin of miRNAs, and large segmental duplication events could
prompt the expansion of some miRNA families, including RrmiR families. Our study
is a valuable contribution to the knowledge of evolution and function of
non-coding region in genome
The anti-bacterial iron-restriction defence mechanisms of egg white; the potential role of three lipocalin-like proteins in resistance against Salmonella
Salmonella enterica serovar Enteritidis (SE) is the most frequently-detected Salmonella in foodborne outbreaks in the European Union. Among such outbreaks, egg and egg products were identified as the most common vehicles of infection. Possibly, the major antibacterial property of egg white is iron restriction, which results from the presence of the iron-binding protein, ovotransferrin. To circumvent iron restriction, SE synthesise catecholate siderophores (i.e. enterobactin and salmochelin) that can chelate iron from host iron-binding proteins. Here, we highlight the role of lipocalin-like proteins found in egg white that could enhance egg-white iron restriction through sequestration of certain siderophores, including enterobactin. Indeed, it is now apparent that the egg-white lipocalin, Ex-FABP, can inhibit bacterial growth via its siderophore-binding capacity in vitro. However, it remains unclear whether ex-FABP performs such a function in egg white or during bird infection. Regarding the two other lipocalins of egg white (Cal-γ and α-1-glycoprotein), there is currently no evidence to indicate that they sequester siderophores