920 research outputs found
Enabling comparative modeling of closely related genomes: Example genus Brucella
For many scientific applications, it is highly desirable to be able to compare metabolic models of closely related genomes. In this short report, we attempt to raise awareness to the fact that taking annotated genomes from public repositories and using them for metabolic model reconstructions is far from being trivial due to annotation inconsistencies. We are proposing a protocol for comparative analysis of metabolic models on closely related genomes, using fifteen strains of genus Brucella, which contains pathogens of both humans and livestock. This study lead to the identification and subsequent correction of inconsistent annotations in the SEED database, as well as the identification of 31 biochemical reactions that are common to Brucella, which are not originally identified by automated metabolic reconstructions. We are currently implementing this protocol for improving automated annotations within the SEED database and these improvements have been propagated into PATRIC, Model-SEED, KBase and RAST. This method is an enabling step for the future creation of consistent annotation systems and high-quality model reconstructions that will support in predicting accurate phenotypes such as pathogenicity, media requirements or type of respiration.We thank Jean Jacques Letesson, Maite Iriarte, Stephan Kohler and David O'Callaghan for their input on improving specific annotations. This project has been funded by the United States National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract No. HHSN272200900040C, awarded to BW Sobral, and from the United States National Science Foundation under Grant MCB-1153357, awarded to CS Henry. J.P.F. acknowledges funding from [FRH/BD/70824/2010] of the FCT (Portuguese Foundation for Science and Technology) Ph.D. scholarship
Complete chloroplast genome sequence of Holoparasite Cistanche Deserticola (Orobanchaceae) reveals gene loss and horizontal gene transfer from Its host Haloxylon Ammodendron (Chenopodiaceae)
The central function of chloroplasts is to carry out photosynthesis, and its gene content and structure are highly conserved across land plants. Parasitic plants, which have reduced photosynthetic ability, suffer gene losses from the chloroplast (cp) genome accompanied by the relaxation of selective constraints. Compared with the rapid rise in the number of cp genome sequences of photosynthetic organisms, there are limited data sets from parasitic plants. The authors report the complete sequence of the cp genome of Cistanche deserticola, a holoparasitic desert species belonging to the family Orobanchaceae
Chromosomal-level assembly of the Asian Seabass genome using long sequence reads and multi-layered scaffolding
We report here the ~670 Mb genome assembly of the Asian seabass (Lates calcarifer), a tropical marine teleost. We used long-read sequencing augmented by transcriptomics, optical and genetic mapping along with shared synteny from closely related fish species to derive a chromosome-level assembly with a contig N50 size over 1 Mb and scaffold N50 size over 25 Mb that span ~90% of the genome. The population structure of L. calcarifer species complex was analyzed by re-sequencing 61 individuals representing various regions across the species' native range. SNP analyses identified high levels of genetic diversity and confirmed earlier indications of a population stratification comprising three clades with signs of admixture apparent in the South-East Asian population. The quality of the Asian seabass genome assembly far exceeds that of any other fish species, and will serve as a new standard for fish genomics
Discovery and characterization of chromatin states for systematic annotation of the human genome
A plethora of epigenetic modifications have been described in the human genome and shown to play diverse roles in gene regulation, cellular differentiation and the onset of disease. Although individual modifications have been linked to the activity levels of various genetic functional elements, their combinatorial patterns are still unresolved and their potential for systematic de novo genome annotation remains untapped. Here, we use a multivariate Hidden Markov Model to reveal 'chromatin states' in human T cells, based on recurrent and spatially coherent combinations of chromatin marks. We define 51 distinct chromatin states, including promoter-associated, transcription-associated, active intergenic, large-scale repressed and repeat-associated states. Each chromatin state shows specific enrichments in functional annotations, sequence motifs and specific experimentally observed characteristics, suggesting distinct biological roles. This approach provides a complementary functional annotation of the human genome that reveals the genome-wide locations of diverse classes of epigenetic function.National Science Foundation (U.S.). (Award 0905968)National Human Genome Research Institute (U.S.) (Award U54-HG004570)National Human Genome Research Institute (U.S.) (Award RC1-HG005334
Ceruloplasmin is a novel adipokine which is overexpressed in adipose tissue of obese subjects and in obesity-associated cancer cells
Obesity confers an increased risk of developing specific cancer forms. Although the mechanisms are unclear, increased fat cell secretion of specific proteins (adipokines) may promote/facilitate development of malignant tumors in obesity via cross-talk between adipose tissue(s) and the tissues prone to develop cancer among obese. We searched for novel adipokines that were overexpressed in adipose tissue of obese subjects as well as in tumor cells derived from cancers commonly associated with obesity. For this purpose expression data from human adipose tissue of obese and non-obese as well as from a large panel of human cancer cell lines and corresponding primary cells and tissues were explored. We found expression of ceruloplasmin to be the most enriched in obesity-associated cancer cells. This gene was also significantly up-regulated in adipose tissue of obese subjects. Ceruloplasmin is the body's main copper carrier and is involved in angiogenesis. We demonstrate that ceruloplasmin is a novel adipokine, which is produced and secreted at increased rates in obesity. In the obese state, adipose tissue contributed markedly (up to 22%) to the total circulating protein level. In summary, we have through bioinformatic screening identified ceruloplasmin as a novel adipokine with increased expression in adipose tissue of obese subjects as well as in cells from obesity-associated cancers. Whether there is a causal relationship between adipose overexpression of ceruloplasmin and cancer development in obesity cannot be answered by these cross-sectional comparisons
An internal ribosome entry site in the 5′ untranslated region of epidermal growth factor receptor allows hypoxic expression
The expression of epidermal growth factor receptor (EGFR/ERBB1/HER1) is implicated in the progress of numerous cancers, a feature that has been exploited in the development of EGFR antibodies and EGFR tyrosine kinase inhibitors as anti-cancer drugs. However, EGFR also has important normal cellular functions, leading to serious side effects when EGFR is inhibited. One damaging characteristic of many oncogenes is the ability to be expressed in the hypoxic conditions associated with the tumour interior. It has previously been demonstrated that expression of EGFR is maintained in hypoxic conditions via an unknown mechanism of translational control, despite global translation rates generally being attenuated under hypoxic conditions. In this report, we demonstrate that the human EGFR 5′ untranslated region (UTR) sequence can initiate the expression of a downstream open reading frame via an internal ribosome entry site (IRES). We show that this effect is not due to either cryptic promoter activity or splicing events. We have investigated the requirement of the EGFR IRES for eukaryotic initiation factor 4A (eIF4A), which is an RNA helicase responsible for processing RNA secondary structure as part of translation initiation. Treatment with hippuristanol (a potent inhibitor of eIF4A) caused a decrease in EGFR 5′ UTR-driven reporter activity and also a reduction in EGFR protein level. Importantly, we show that expression of a reporter gene under the control of the EGFR IRES is maintained under hypoxic conditions despite a fall in global translation rates
High sample throughput genotyping for estimating C-lineage introgression in the dark honeybee: an accurate and cost-effective SNP-based tool
The natural distribution of the honeybee (Apis mellifera L.) has been changed by humans in recent
decades to such an extent that the formerly widest-spread European subspecies, Apis mellifera
mellifera, is threatened by extinction through introgression from highly divergent commercial strains
in large tracts of its range. Conservation efforts for A. m. mellifera are underway in multiple European
countries requiring reliable and cost-efficient molecular tools to identify purebred colonies. Here, we
developed four ancestry-informative SNP assays for high sample throughput genotyping using the
iPLEX Mass Array system. Our customized assays were tested on DNA from individual and pooled,
haploid and diploid honeybee samples extracted from different tissues using a diverse range of
protocols. The assays had a high genotyping success rate and yielded accurate genotypes. Performance
assessed against whole-genome data showed that individual assays behaved well, although the
most accurate introgression estimates were obtained for the four assays combined (117 SNPs).
The best compromise between accuracy and genotyping costs was achieved when combining two
assays (62 SNPs). We provide a ready-to-use cost-effective tool for accurate molecular identification
and estimation oinfo:eu-repo/semantics/publishedVersio
The landscape of Neandertal ancestry in present-day humans
Analyses of Neandertal genomes have revealed that Neandertals have contributed genetic variants to modern humans1–2. The antiquity of Neandertal gene flow into modern humans means that regions that derive from Neandertals in any one human today are usually less than a hundred kilobases in size. However, Neandertal haplotypes are also distinctive enough that several studies have been able to detect Neandertal ancestry at specific loci1,3–8. Here, we have systematically inferred Neandertal haplotypes in the genomes of 1,004 present-day humans12. Regions that harbor a high frequency of Neandertal alleles in modern humans are enriched for genes affecting keratin filaments suggesting that Neandertal alleles may have helped modern humans adapt to non-African environments. Neandertal alleles also continue to shape human biology, as we identify multiple Neandertal-derived alleles that confer risk for disease. We also identify regions of millions of base pairs that are nearly devoid of Neandertal ancestry and enriched in genes, implying selection to remove genetic material derived from Neandertals. Neandertal ancestry is significantly reduced in genes specifically expressed in testis, and there is an approximately 5-fold reduction of Neandertal ancestry on chromosome X, which is known to harbor a disproportionate fraction of male hybrid sterility genes20–22. These results suggest that part of the reduction in Neandertal ancestry near genes is due to Neandertal alleles that reduced fertility in males when moved to a modern human genetic background
Characterization of globulin storage proteins of a low prolamin cereal species in relation to celiac disease
Brachypodium distachyon, a small annual grass with seed storage globulins as primary protein reserves was used in our study to analyse the toxic nature of non-prolamin seed storage proteins related to celiac disease. The main storage proteins of B. distachyon are the 7S globulin type proteins and the 11S, 12S seed storage globulins similar to oat and rice. Immunoblot analyses using serum samples from celiac disease patients were carried out followed by the identification of immune-responsive proteins using mass spectrometry. Serum samples from celiac patients on a gluten-free diet, from patients with Crohn's disease and healthy subjects, were used as controls. The identified proteins with intense serum-IgA reactivity belong to the 7S and 11-12S seed globulin family. Structure prediction and epitope predictions analyses confirmed the presence of celiac disease-related linear B cell epitope homologs and the presence of peptide regions with strong HLA-DQ8 and DQ2 binding capabilities. These results highlight that both MHC-II presentation and B cell response may be developed not only to prolamins but also to seed storage globulins. This is the first study of the non-prolamin type seed storage proteins of Brachypodium from the aspect of the celiac disease
Metabolic labeling of RNA uncovers principles of RNA production and degradation dynamics in mammalian cells
available in PMC 2011 November 01.Cellular RNA levels are determined by the interplay of RNA production, processing and degradation. However, because most studies of RNA regulation do not distinguish the separate contributions of these processes, little is known about how they are temporally integrated. Here we combine metabolic labeling of RNA at high temporal resolution with advanced RNA quantification and computational modeling to estimate RNA transcription and degradation rates during the response of mouse dendritic cells to lipopolysaccharide. We find that changes in transcription rates determine the majority of temporal changes in RNA levels, but that changes in degradation rates are important for shaping sharp 'peaked' responses. We used sequencing of the newly transcribed RNA population to estimate temporally constant RNA processing and degradation rates genome wide. Degradation rates vary significantly between genes and contribute to the observed differences in the dynamic response. Certain transcripts, including those encoding cytokines and transcription factors, mature faster. Our study provides a quantitative approach to study the integrative process of RNA regulation.Human Frontier Science Program (Strasbourg, France)Howard Hughes Medical InstituteBurroughs Wellcome Fund (Career Award at the Scientific Interface
- …
