Search CORE

Relationship between operon preference and functional properties of persistent genes in bacterial genomes

Author: Bratlie Marit S
Drabløs Finn
Johansen Jostein
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Genes in bacteria may be organised into operons, leading to strict co-expression of the genes that participate in the same operon. However, comparisons between different bacterial genomes have shown that much of the operon structure is dynamic on an evolutionary time scale. This indicates that there are opposing effects influencing the tendency for operon formation, and these effects may be reflected in properties like evolutionary rate, complex formation, metabolic pathways and gene fusion. Results We have used multi-species protein-protein comparisons to generate a high-quality set of genes that are persistent in bacterial genomes (i.e. they have close to universal distribution). We have analysed these genes with respect to operon participation and important functional properties, including evolutionary rate and protein-protein interactions. Conclusions Genes for ribosomal proteins show a very slow rate of evolution. This is consistent with a strong tendency for the genes to participate in operons and for their proteins to be involved in essential and well defined complexes. Persistent genes for non-ribosomal proteins can be separated into two classes according to tendency to participate in operons. Those with a strong tendency for operon participation make proteins with fewer interaction partners that seem to participate in relatively static complexes and possibly linear pathways. Genes with a weak tendency for operon participation tend to produce proteins with more interaction partners, but possibly in more dynamic complexes and convergent pathways. Genes that are not regulated through operons are therefore more evolutionary constrained than the corresponding operon-associated genes and will on average evolve more slowly.</p

Springer - Publisher Connector

Directory of Open Access Journals

Springer - Publisher Connector

Design of Selective Inhibitors of Tyrosine Kinase 2

Author: Finn Drabløs
Kristin Tøndel
Publication venue
Publication date: 01/01/2005
Field of study

Abstract: Selective inhibitors of tyrosine kinase 2 (Tyk2) were searched for using database screening, de novo ligand design and computational docking in Tyk2 and seven other protein kinases. None of the structures in the National Cancer Institute database seem to inhibit Tyk2 selectively, but five of the designed structures seem promising

CiteSeerX

Improved benchmarks for computational motif discovery

Author: Abul Osman
Drabløs Finn
Sandve Geir Kjetil
Walseng Vegard
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background An important step in annotation of sequenced genomes is the identification of transcription factor binding sites. More than a hundred different computational methods have been proposed, and it is difficult to make an informed choice. Therefore, robust assessment of motif discovery methods becomes important, both for validation of existing tools and for identification of promising directions for future research. Results We use a machine learning perspective to analyze collections of transcription factors with known binding sites. Algorithms are presented for finding position weight matrices (PWMs), IUPAC-type motifs and mismatch motifs with optimal discrimination of binding sites from remaining sequence. We show that for many data sets in a recently proposed benchmark suite for motif discovery, none of the common motif models can accurately discriminate the binding sites from remaining sequence. This may obscure the distinction between the potential performance of the motif discovery tool itself versus the intrinsic complexity of the problem we are trying to solve. Synthetic data sets may avoid this problem, but we show on some previously proposed benchmarks that there may be a strong bias towards a presupposed motif model. We also propose a new approach to benchmark data set construction. This approach is based on collections of binding site fragments that are ranked according to the optimal level of discrimination achieved with our algorithms. This allows us to select subsets with specific properties. We present one benchmark suite with data sets that allow good discrimination between positive and negative instances with the common motif models. These data sets are suitable for evaluating algorithms for motif discovery that rely on these models. We present another benchmark suite where PWM, IUPAC and mismatch motif models are not able to discriminate reliably between positive and negative instances. This suite could be used for evaluating more powerful motif models. Conclusion Our improved benchmark suites have been designed to differentiate between the performance of motif discovery algorithms and the power of motif models. We provide a web server where users can download our benchmark suites, submit predictions and visualize scores on the benchmarks

Safety in numbers: multiple occurrences of highly similar homologs among Azotobacter vinelandii carbohydrate metabolism proteins probably confer adaptive benefits

Author: Finn Drabløs
Helga Ertesvåg
Jostein Johansen
Mali Mærk
Svein Valla
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Springer - Publisher Connector

Gene duplications in prokaryotes can be associated with environmental adaptation

Author: Bratlie Marit S
Drabløs Finn
Huang Da Wei
Johansen Jostein
Lempicki Richard A
Sherman Brad T
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Gene duplication is a normal evolutionary process. If there is no selective advantage in keeping the duplicated gene, it is usually reduced to a pseudogene and disappears from the genome. However, some paralogs are retained. These gene products are likely to be beneficial to the organism, e.g. in adaptation to new environmental conditions. The aim of our analysis is to investigate the properties of paralog-forming genes in prokaryotes, and to analyse the role of these retained paralogs by relating gene properties to life style of the corresponding prokaryotes. Results Paralogs were identified in a number of prokaryotes, and these paralogs were compared to singletons of persistent orthologs based on functional classification. This showed that the paralogs were associated with for example energy production, cell motility, ion transport, and defence mechanisms. A statistical overrepresentation analysis of gene and protein annotations was based on paralogs of the 200 prokaryotes with the highest fraction of paralog-forming genes. Biclustering of overrepresented gene ontology terms versus species was used to identify clusters of properties associated with clusters of species. The clusters were classified using similarity scores on properties and species to identify interesting clusters, and a subset of clusters were analysed by comparison to literature data. This analysis showed that paralogs often are associated with properties that are important for survival and proliferation of the specific organisms. This includes processes like ion transport, locomotion, chemotaxis and photosynthesis. However, the analysis also showed that the gene ontology terms sometimes were too general, imprecise or even misleading for automatic analysis. Conclusions Properties described by gene ontology terms identified in the overrepresentation analysis are often consistent with individual prokaryote lifestyles and are likely to give a competitive advantage to the organism. Paralogs and singletons dominate different categories of functional classification, where paralogs in particular seem to be associated with processes involving interaction with the environment.</p

Directory of Open Access Journals

Resolving the zinc binding capacity of honey bee vitellogenin and locating its putative binding sites

Author: Amdam Gro Vang
Dobrovolska Olena
Drabløs Finn
Enger Øyvind
Halskau Øyvind
Leipart Vilde
Turcu Diana Cornelia
Publication venue: 'Wiley'
Publication date: 01/01/2022
Field of study

publishedVersio

Brage NMBU

University of Bergen

EpiFactors : a comprehensive database of human epigenetic factors and complexes

Author: Andreas Lennartsson
Finn Drabløs
Grigory Khimulya
Ilya E. Vorontsov
Ivan V. Kulakovskiy
Jaenisch
Pouda Panahandeh
Rezvan Ehsani
Takeya Kasukawa
Yulia A. Medvedeva
Zhu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

Altres ajuts: Russian Fund For Basic Research(RFFI)grant 14-04-0018 i grant 15-34-20423, Ake Olsson's foundation, Swedish Cancer foundation, Swedish Childhood cancer foundation, Dynasty Foundation Fellowship, RIKEN Omics Science Center, RIKEN Preventive Medicine and Diagnosis Innovation Program i RIKEN Center for Life Science Technologies.Abstract: Epigenetics refers to stable and long-term alterations of cellular traits that are not caused by changes in the DNA sequence per se. Rather, covalent modifications of DNA and histones affect gene expression and genome stability via proteins that recognize and act upon such modifications. Many enzymes that catalyse epigenetic modifications or are critical for enzymatic complexes have been discovered, and this is encouraging investigators to study the role of these proteins in diverse normal and pathological processes. Rapidly growing knowledge in the area has resulted in the need for a resource that compiles, organizes and presents curated information to the researchers in an easily accessible and user-friendly form. Here we present EpiFactors, a manually curated database providing information about epigenetic regulators, their complexes, targets and products. EpiFactors contains information on 815 proteins, including 95 histones and protamines. For 789 of these genes, we include expressions values across several samples, in particular a collection of 458 human primary cell samples (for approximately 200 cell types, in many cases from three individual donors), covering most mammalian cell steady states, 255 different cancer cell lines (representing approximately 150 cancer subtypes) and 134 human postmortem tissues. Expression values were obtained by the FANTOM5 consortium using Cap Analysis of Gene Expression technique. EpiFactors also contains information on 69 protein complexes that are involved in epigenetic regulation. The resource is practical for a wide range of users, including biologists, pharmacologists and clinicians

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Diposit Digital de Documents de la UAB

A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs

Author: Bailey
Barski
Barski
Boeva
Chen
Fejes
Finn Drabløs
Johnson
Jothi
Kharchenko
Laajala
Matys
Mikkelsen
Morten Beck Rye
Park
Pepke
Pål Sætrom
Robertson
Rozowsky
Schones
Segal
Tanay
Tuteja
Valouev
Wallerman
Wilbanks
Zhang
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

Chromatin immunoprecipitation (ChIP) followed by high throughput sequencing (ChIP-seq) is rapidly becoming the method of choice for discovering cell-specific transcription factor binding locations genome wide. By aligning sequenced tags to the genome, binding locations appear as peaks in the tag profile. Several programs have been designed to identify such peaks, but program evaluation has been difficult due to the lack of benchmark data sets. We have created benchmark data sets for three transcription factors by manually evaluating a selection of potential binding regions that cover typical variation in peak size and appearance. Performance of five programs on this benchmark showed, first, that external control or background data was essential to limit the number of false positive peaks from the programs. However, >80% of these peaks could be manually filtered out by visual inspection alone, without using additional background data, showing that peak shape information is not fully exploited in the evaluated programs. Second, none of the programs returned peak-regions that corresponded to the actual resolution in ChIP-seq data. Our results showed that ChIP-seq peaks should be narrowed down to 100–400 bp, which is sufficient to identify unique peaks and binding sites. Based on these results, we propose a meta-approach that gives improved peak definitions