27 research outputs found

    OMSim : a simulator for optical map data

    Get PDF
    Motivation: The Bionano Genomics platform allows for the optical detection of short sequence patterns in very long DNA molecules (up to 2.5 Mbp). Molecules with overlapping patterns can be assembled to generate a consensus optical map of the entire genome. In turn, these optical maps can be used to validate or improve de novo genome assembly projects or to detect large-scale structural variation in genomes. Simulated optical map data can assist in the development and benchmarking of tools that operate on those data, such as alignment and assembly software. Additionally, it can help to optimize the experimental setup for a genome of interest. Such a simulator is currently not available. Results: We have developed a simulator, OMSim, that produces synthetic optical map data that mimics real Bionano Genomics data. These simulated data have been tested for compatibility with the Bionano Genomics Irys software system and the Irys-scaffolding scripts. OMSim is capable of handling very large genomes (over 30 Gbp) with high throughput and low memory requirements

    The grapevine gene nomenclature system

    Get PDF
    [Background] Grapevine (Vitis vinifera L.) is one of the most important fruit crops in the world and serves as a valuable model for fruit development in woody species. A major breakthrough in grapevine genomics was achieved in 2007 with the sequencing of the Vitis vinifera cv. PN40024 genome. Subsequently, data on structural and functional characterization of grape genes accumulated exponentially. To better exploit the results obtained by the international community, we think that a coordinated nomenclature for gene naming in species with sequenced genomes is essential. It will pave the way for the accumulation of functional data that will enable effective scientific discussion and discovery. The exploitation of data that were generated independently of the genome release is hampered by their heterogeneous nature and by often incompatible and decentralized storage. Classically, large amounts of data describing gene functions are only available in printed articles and therefore remain hardly accessible for automatic text mining. On the other hand, high throughput >Omics> data are typically stored in public repositories, but should be arranged in compendia to better contribute to the annotation and functional characterization of the genes.[Results] With the objective of providing a high quality and highly accessible annotation of grapevine genes, the International Grapevine Genome Project (IGGP) commissioned an international Super-Nomenclature Committee for Grape Gene Annotation (sNCGGa) to coordinate the effort of experts to annotate the grapevine genes. The goal of the committee is to provide a standard nomenclature for locus identifiers and to define conventions for a gene naming system in this paper.[Conclusions] Learning from similar initiatives in other plant species such as Arabidopsis, rice and tomato, a versatile nomenclature system has been developed in anticipation of future genomic developments and annotation issues. The sNCGGa's first outreach to the grape community has been focused on implementing recommended guidelines for the expert annotators by: (i) providing a common annotation platform that enables community-based gene curation, (ii) developing a gene nomenclature scheme reflecting the biological features of gene products that is consistent with that used in other organisms in order to facilitate comparative analyses. © 2014 Grimplet et al.Authors would like to thank the Grape Research Coordination Network (NSF grant DBI 0741876) for financial support, the International Grape Genome Program and the COST action FA1106 “Quality fruit”. J.G. was supported by the Ramon y Cajal program (RYC-2011-07791). We acknowledge support of the publication fee by the CSIC Open Access Publication Support Initiative through its Unit of Information Resources for Research (URICI).Peer Reviewe

    GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts

    Get PDF
    Genomic projects heavily depend on genome annotations and are limited by the current deficiencies in the published predictions of gene structure and function. It follows that, improved annotation will allow better data mining of genomes, and more secure planning and design of experiments. The purpose of the GeneFarm project is to obtain homogeneous, reliable, documented and traceable annotations for Arabidopsis nuclear genes and gene products, and to enter them into an added-value database. This re-annotation project is being performed exhaustively on every member of each gene family. Performing a family-wide annotation makes the task easier and more efficient than a gene-by-gene approach since many features obtained for one gene can be extrapolated to some or all the other genes of a family. A complete annotation procedure based on the most efficient prediction tools available is being used by 16 partner laboratories, each contributing annotated families from its field of expertise. A database, named GeneFarm, and an associated user-friendly interface to query the annotations have been developed. More than 3000 genes distributed over 300 families have been annotated and are available at http://genoplante-info.infobiogen.fr/Genefarm/. Furthermore, collaboration with the Swiss Institute of Bioinformatics is underway to integrate the GeneFarm data into the protein knowledgebase Swiss-Pro

    GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts

    Get PDF
    Genomic projects heavily depend on genome annotations and are limited by the current deficiencies in the published predictions of gene structure and function. It follows that, improved annotation will allow better data mining of genomes, and more secure planning and design of experiments. The purpose of the GeneFarm project is to obtain homogeneous, reliable, documented and traceable annotations for Arabidopsis nuclear genes and gene products, and to enter them into an added-value database. This re-annotation project is being performed exhaustively on every member of each gene family. Performing a family-wide annotation makes the task easier and more efficient than a gene-by-gene approach since many features obtained for one gene can be extrapolated to some or all the other genes of a family. A complete annotation procedure based on the most efficient prediction tools available is being used by 16 partner laboratories, each contributing annotated families from its field of expertise. A database, named GeneFarm, and an associated user-friendly interface to query the annotations have been developed. More than 3000 genes distributed over 300 families have been annotated and are available at http://genoplante-info.infobiogen.fr/Genefarm/. Furthermore, collaboration with the Swiss Institute of Bioinformatics is underway to integrate the GeneFarm data into the protein knowledgebase Swiss-Prot

    Autoantibodies against type I IFNs in patients with critical influenza pneumonia

    Full text link
    In an international cohort of 279 patients with hypoxemic influenza pneumonia, we identified 13 patients (4.6%) with autoantibodies neutralizing IFN-alpha and/or -omega, which were previously reported to underlie 15% cases of life-threatening COVID-19 pneumonia and one third of severe adverse reactions to live-attenuated yellow fever vaccine. Autoantibodies neutralizing type I interferons (IFNs) can underlie critical COVID-19 pneumonia and yellow fever vaccine disease. We report here on 13 patients harboring autoantibodies neutralizing IFN-alpha 2 alone (five patients) or with IFN-omega (eight patients) from a cohort of 279 patients (4.7%) aged 6-73 yr with critical influenza pneumonia. Nine and four patients had antibodies neutralizing high and low concentrations, respectively, of IFN-alpha 2, and six and two patients had antibodies neutralizing high and low concentrations, respectively, of IFN-omega. The patients' autoantibodies increased influenza A virus replication in both A549 cells and reconstituted human airway epithelia. The prevalence of these antibodies was significantly higher than that in the general population for patients 70 yr of age (3.1 vs. 4.4%, P = 0.68). The risk of critical influenza was highest in patients with antibodies neutralizing high concentrations of both IFN-alpha 2 and IFN-omega (OR = 11.7, P = 1.3 x 10(-5)), especially those <70 yr old (OR = 139.9, P = 3.1 x 10(-10)). We also identified 10 patients in additional influenza patient cohorts. Autoantibodies neutralizing type I IFNs account for similar to 5% of cases of life-threatening influenza pneumonia in patients <70 yr old

    In silico cloning of genes encoding neuropeptides, neurohormones and their putative G-protein coupled receptors in a spider mite

    No full text
    The genome of the spider mite was prospected for the presence of genes coding neuropeptides, neurohormones and their putative G-protein coupled receptors. Fifty one candidate genes were found to encode neuropeptides or neurohormones. These include all known insect neuropeptides and neurohormones, with the exception of sulfakinin, corazonin, neuroparsin and PTTH. True orthologs of adipokinetic hormone (AKH) were neither found, but there are three genes encoding peptides similar in structure to both AKH and the AKH-corazonin-related peptide. We were also unable to identify the precursors for pigment dispersing factor (PDF) or the recently discovered trissin. However, the spider mite probably does have such genes, as we found their putative receptors. A novel arthropod neuropeptide gene was identified that shows similarity to previously described molluscan neuropeptide genes and was called EFLamide. A total of 65 putative neuropeptide GPCR genes were also identifieid, of these 58 belong to the A-family and 7 to the B-family. Phylogenetic analysis showed that 50 of them are closely related to insect GPCRs, which allowed the identification of their putative ligand in 39 cases with varying degrees of certainty. Other spider mite GPCRs however have no identifiable orthologs in the genomes of the four holometabolous insect species best analyzed. Whereas some of the latter have orthologs in hemimetabolous insect species, crustaceans or ticks, for others such arthropod homologs are currently unknown. © 2011 Elsevier Ltd.Additional funds for transcriptome studies that supported the annotation were from a University of Utah SEED grant and equipment funds from the US National Science Foundation (0820985; to Richard M. Clark).Peer Reviewe

    Toward the Improvement of the Gene Centered Information in Grapevine Genomics

    Get PDF
    Trabajo presentado en la Plant and Animal Genome XXII Conference, celebrada en San Diego del 11 al 15 de enero de 2014.A major breakthrough in grapevine genomics was achieved in 2007 with the sequencing of the Vitis vinifera cv. PN40024 genome. Subsequently, invaluable data related to the structural and functional characterization of the genes accumulated exponentially. Unfortunately, part of this data is episodic by nature or was acquired prior to the genome release. Furthermore, because of heterogeneous sources these data were greatly under-exploited with issues like incompatibility of data but also decentralized storage making retrieval more difficult. Classically, a large amount of useful data describing gene functions only appears in printed articles that describe the experiments carried out to answer the initial question of the publishing research team without being deposited in a common database, then remaining inaccessible for automatic data mining. High throughput “omics” data are typically stored in public repositories, but generally remain confined to their initial aim. With the objective of providing a high quality and highly accessible annotation of the grapevine genes, the International Grapevine Genome Project (IGGP) commissioned an international Supernomenclature Committee for Grape Gene Annotation to coordinate the expert-annotation effort of the grapevine genes. The goal of the committee is to provide a standard nomenclature for locus identifiers and to define the convention for the gene naming system. This system will be implemented on a new version of grapevine annotation that is being developed on an improved version of the grapevine genome chromosome assembly.Acknowledgements: GRCN NSF program, Grant Plant-KBBE-2008-GrapeReSeq, EU COST FA 1106.Peer Reviewe

    Genome-Wide Analysis of Hydrogen Peroxide-Regulated Gene Expression in Arabidopsis Reveals a High Light-Induced Transcriptional Cluster Involved in Anthocyanin Biosynthesis

    Get PDF
    In plants, reactive oxygen species and, more particularly, hydrogen peroxide (H(2)O(2)) play a dual role as toxic by-products of normal cell metabolism and as regulatory molecules in stress perception and signal transduction. Peroxisomal catalases are an important sink for photorespiratory H(2)O(2). Using ATH1 Affymetrix microarrays, expression profiles were compared between control and catalase-deficient Arabidopsis (Arabidopsis thaliana) plants. Reduced catalase levels already provoked differences in nuclear gene expression under ambient growth conditions, and these effects were amplified by high light exposure in a sun simulator for 3 and 8 h. This genome-wide expression analysis allowed us to reveal the expression characteristics of complete pathways and functional categories during H(2)O(2) stress. In total, 349 transcripts were significantly up-regulated by high light in catalase-deficient plants and 88 were down-regulated. From this data set, H(2)O(2) was inferred to play a key role in the transcriptional up-regulation of small heat shock proteins during high light stress. In addition, several transcription factors and candidate regulatory genes involved in H(2)O(2) transcriptional gene networks were identified. Comparisons with other publicly available transcriptome data sets of abiotically stressed Arabidopsis revealed an important intersection with H(2)O(2)-deregulated genes, positioning elevated H(2)O(2) levels as an important signal within abiotic stress-induced gene expression. Finally, analysis of transcriptional changes in a combination of a genetic (catalase deficiency) and an environmental (high light) perturbation identified a transcriptional cluster that was strongly and rapidly induced by high light in control plants, but impaired in catalase-deficient plants. This cluster comprises the complete known anthocyanin regulatory and biosynthetic pathway, together with genes encoding unknown proteins
    corecore