Search CORE

HAL: Hyper Article en Ligne

HAL Descartes

HAL-MINES ParisTech

Soft skills: An important asset acquired from organizing regional student group activities

Author: Abeel T.
Abeel T.
de Ridder J.
de Ridder J.
Meysman P.
Meysman P.
Oluwagbemi O.
Oluwagbemi O.
Publication venue: Public Library of Science
Publication date: 01/01/2014
Field of study

Contributing to a student organization, such as the International Society for Computational Biology Student Council (ISCB-SC) and its Regional Student Group (RSG) program, takes time and energy. Both are scarce commodities, especially when you are trying to find your place in the world of computational biology as a graduate student. It comes as no surprise that organizing ISCB-SC-related activities sometimes interferes with day-to-day research and shakes up your priority list. However, we unanimously agree that the rewards, both in the short as well as the long term, make the time spent on these extracurricular activities more than worth it. In this article, we will explain what makes this so worthwhile: soft skills

Middlesex University Research Repository

Features of mammalian microRNA promoters emerge from polymerase II chromatin immunoprecipitation data

Author: A Bird
A Marson
A Rodriguez
A Sandelin
A Sandelin
AP Bird
Arindam Bhattacharjee
Ben Gordon
CD Schmid
Christopher K. Patil
D Karolchik
David L. Corcoran
DL Corcoran
DP Bartel
DS Prestridge
DS Prestridge
E Wingender
F Ozsolak
GD Stormo
GG Loots
GM Borchert
H Wakaguri
HJ Bussemaker
HK Saini
I Rigoutsos
IP Ioshikhes
J Taylor
J van Helden
K Woods
KD Taganov
Kusum V. Pandit
M Gardiner-Garden
M Megraw
MJ Buck
MP Brown
N Liu
Naftali Kaminski
NJ Martinez
O Chapelle
P Carninci
P Jin
Panayiotis V. Benos
R Gangal
R Shalgi
RM Kuhn
S Baskerville
S Fujita
S Mahony
S Mahony
SJ Cooper
T Abeel
T Thum
T Wang
TA Down
U Ohler
U Ohler
WJ Kent
X Zhao
X Zhou
Y Lee
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/04/2009
Field of study

Background: MicroRNAs (miRNAs) are short, non-coding RNA regulators of protein coding genes. miRNAs play a very important role in diverse biological processes and various diseases. Many algorithms are able to predict miRNA genes and their targets, but their transcription regulation is still under investigation. It is generally believed that intragenic miRNAs (located in introns or exons of protein coding genes) are co-transcribed with their host genes and most intergenic miRNAs transcribed from their own RNA polymerase II (Pol II) promoter. However, the length of the primary transcripts and promoter organization is currently unknown. Methodology: We performed Pol II chromatin immunoprecipitation (ChIP)-chip using a custom array surrounding regions of known miRNA genes. To identify the true core transcription start sites of the miRNA genes we developed a new tool (CPPP). We showed that miRNA genes can be transcribed from promoters located several kilobases away and that their promoters share the same general features as those of protein coding genes. Finally, we found evidence that as many as 26% of the intragenic miRNAs may be transcribed from their own unique promoters. Conclusion: miRNA promoters have similar features to those of protein coding genes, but miRNA transcript organization is more complex. © 2009 Corcoran et al

Public Library of Science (PLOS)

D-Scholarship@Pitt

Discriminative and informative features for biomolecular text mining with ensemble feature selection

Author: Reverter
S. Van Landeghem
T. Abeel
Y. Saeys
Y. Van de Peer
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

Motivation: In the field of biomolecular text mining, black box behavior of machine learning systems currently limits understanding of the true nature of the predictions. However, feature selection (FS) is capable of identifying the most relevant features in any supervised learning setting, providing insight into the specific properties of the classification algorithm. This allows us to build more accurate classifiers while at the same time bridging the gap between the black box behavior and the end-user who has to interpret the results

CiteSeerX

Ghent University Academic Bibliography

TSpace (University of Toronto)

Highlights from the 6th International Society for Computational Biology Student Council Symposium at the 18th Annual International Conference on Intelligent Systems for Molecular Biology

Author: Christiaan Klijn
F Xin
G Macintyre
H Hettling
J Behr
J Larson
M McDowall
Magali Michaut
MP Magariños
P Surendran
P Vanhee
S Banton
S Carmona
S Shah
T Abeel
Thomas Abeel
XF Li
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

This meeting report gives an overview of the keynote lectures and a selection of the student oral and poster presentations at the 6th International Society for Computational Biology Student Council Symposium that was held as a precursor event to the annual international conference on Intelligent Systems for Molecular Biology (ISMB). The symposium was held in Boston, MA, USA on July 9th, 2010

Springer - Publisher Connector

TU Delft Repository

Brage Nord Open Research Archive

The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea

Author: A D’Hont
A Mitchell
AJ Drummond
Alexander Jueterbock
Amy Mraz
Anna Kersting
AW De Cock
B Lallemand
B Li
BC Meyers
Bram Verhelst
BW Touchette
C Addo-Quaye
C den Hartog
C Hoede
C Trapnell
Carlos M. Duarte
Chiara Lauritano
Christoffer Boström
CM Duarte
DB Jaffe
DH Les
Emanuela Dattolo
Emanuele De Paoli
Erich Bornberg-Bauer
F Maumus
Florian Maumus
G Michel
G Ostlund
Gabriele Procaccini
Gareth A. Pearson
Gurvan Michel
H Jiang
H Quesneville
Hope Tice
J Collen
J Fostier
J Gouzy
J Heled
J Kuo
JA Berry
JA Doyle
Jane Grimwood
Janina Brakel
Jeanine L. Olsen
Jeremy Schmutz
Jerry W. Jenkins
JJ Doyle
JL Olsen
Jonas Collen
JT Clarke
JW Fourqurean
K Vanneste
K Vanneste
K Vanneste
Kevin Vanneste
L Li
L Nauheimer
L Sterck
L Zhang
LJ Pillitteri
M Gandolfo
M Stanke
M Van Bel
M Waycott
MA Beilstein
Mansi Chovatia
Mats Töpel
MG Grabherr
Mojgan Amirebrahimi
N Goldman
Pamela J. Green
PI Macreadie
Pierre Rouzé
R Costanza
RA Chavez Montes
RJ Orth
Rolf Lohaus
RS Aquino
S Degroeve
S Falcon
S Foissac
S Guindon
S Mazzuca
S Proost
S Proost
SA Smith
Simon Dittami
SR Hanson
SU Franssen
SW Burge
T Abeel
T Flutre
T Janssen
TBH Reusch
Thierry Tonon
Thorsten B. H. Reusch
Till Bayer
TM Lowe
V Ter-Hovhannisyan
W Crepet
W Li
W Wang
WJD Iles
Wytze T. Stam
Y Wang
Yao-Cheng Lin
Yves Van de Peer
Z Xi
Z Yang
ZA Popper
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Seagrasses colonized the sea(1) on at least three independent occasions to form the basis of one of the most productive and widespread coastal ecosystems on the planet(2). Here we report the genome of Zostera marina (L.), the first, to our knowledge, marine angiosperm to be fully sequenced. This reveals unique insights into the genomic losses and gains involved in achieving the structural and physiological adaptations required for its marine lifestyle, arguably the most severe habitat shift ever accomplished by flowering plants. Key angiosperm innovations that were lost include the entire repertoire of stomatal genes(3), genes involved in the synthesis of terpenoids and ethylene signalling, and genes for ultraviolet protection and phytochromes for far-red sensing. Seagrasses have also regained functions enabling them to adjust to full salinity. Their cell walls contain all of the polysaccharides typical of land plants, but also contain polyanionic, low-methylated pectins and sulfated galactans, a feature shared with the cell walls of all macroalgae(4) and that is important for ion homoeostasis, nutrient uptake and O-2/CO2 exchange through leaf epidermal cells. The Z. marina genome resource will markedly advance a wide range of functional ecological studies from adaptation of marine ecosystems under climate warming(5,6), to unravelling the mechanisms of osmoregulation under high salinities that may further inform our understanding of the evolution of salt tolerance in crop plants(7)

OceanRep

NILU Brage

Archivio istituzionale della ricerca - Università degli Studi di Udine

University of Groningen

NORA - Norwegian Open Research Archives

Proceedings - University of Groningen

ARTS repository - University of Groningen

Ghent University Academic Bibliography

eScholarship - University of California

Archivsystem Ask23

Sapientia (Univ. do Algarve)

HAL: Hyper Article en Ligne

Swepub

UPSpace at the University of Pretoria

Dissertations of the University of Groningen

Comparative and Functional Genomics of Rhodococcus opacus PD630 for Biofuels Development

Author: A Arakaki
A Argyrou
A Marchler-Bauer
A Pohlmann
A Stamatakis
AF Alvarez
AI Saeed
AJ Enright
AK Pandey
AL Delcher
AL Delcher
Alex L. B. Leach
AM Waterhouse
Anthony C. DeBono
Anthony J. Sinskey
AR Horswill
Brian Desany
Bruce W. Birren
C Kaddor
Chinnappa D. Kodira
Christine Dancel
Christopher A. Desjardins
D Jendrossek
D Portevin
D Post
DE Vance
Dirk Gevers
DL Rainwater
DL Rainwater
DP MacEachran
E Puglisi
E Schwartz
E Schweizer
E Severi
E Severi
E Vimr
ER Goncalves
F Abascal
F David
F-F Hsu
G Timmins
HM Alvarez
HM Alvarez
HM Alvarez
I Letunic
I Matsunaga
IB Lomakin
IC Sutcliffe
Ion Ghiviriga
J Hughes
J Rengarajan
Jason P. Affourtit
Jason W. Holder
Jeremy Zucker
Jil C. Ulrich
JM Mathieu
K Isono
K Katoh
K Kurosawa
K Lagesen
K Raman
KC Yam
KR Robrock
L Diacovich
L Li
M Brudno
M Green
M Hernandez
M Seto
M Wu
MA Larkin
MJ de Hoon
MP Mansour
MP McLeod
O Lenz
O Zimhony
OP Peoples
PA Lessard
Paul A. Godfrey
Paul M. Richardson
PD Karp
PR Romero
Qiandong Zeng
R Edgar
R Gande
R Gande
R Kalscheuer
R Van der Geize
RD Finn
RL Hunter
S Griffiths-Jones
S Guindon
S Kikuchi
S Rajakumari
SC Slater
SK Parker
T Chopra
T Lee
T Sirakova
TD Sirakova
TD Sirakova
Thomas Abeel
TM Lowe
U Grafe
X Yang
Y Hu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

The Actinomycetales bacteria Rhodococcus opacus PD630 and Rhodococcus jostii RHA1 bioconvert a diverse range of organic substrates through lipid biosynthesis into large quantities of energy-rich triacylglycerols (TAGs). To describe the genetic basis of the Rhodococcus oleaginous metabolism, we sequenced and performed comparative analysis of the 9.27 Mb R. opacus PD630 genome. Metabolic-reconstruction assigned 2017 enzymatic reactions to the 8632 R. opacus PD630 genes we identified. Of these, 261 genes were implicated in the R. opacus PD630 TAGs cycle by metabolic reconstruction and gene family analysis. Rhodococcus synthesizes uncommon straight-chain odd-carbon fatty acids in high abundance and stores them as TAGs. We have identified these to be pentadecanoic, heptadecanoic, and cis-heptadecenoic acids. To identify bioconversion pathways, we screened R. opacus PD630, R. jostii RHA1, Ralstonia eutropha H16, and C. glutamicum 13032 for growth on 190 compounds. The results of the catabolic screen, phylogenetic analysis of the TAGs cycle enzymes, and metabolic product characterizations were integrated into a working model of prokaryotic oleaginy.Cambridge-MIT InstituteMassachusetts Institute of Technology. (Seed Grant program)Shell Oil CompanyNational Institute of Allergy and Infectious Diseases (U.S.)United States. National Institutes of HealthNational Institutes of Health. Department of Health and Human Services (Contract No. HHSN272200900006C

CiteSeerX

Public Library of Science (PLOS)

DSpace@MIT

The Francis Crick Institute

The impact of sequence length and number of sequences on promoter prediction performance

Author: C Cortes
D Dineen
J Han
J Zeng
JR Landis
K Florquin
L Breiman
Luiz H de C Merschmann
M Kuhn
N Japkowicz
P Baldi
P Meysman
R Yamashita
Renata Guerra-Sá
S Carvalho
Sávio G Carvalho
T Abeel
T Abeel
T Abeel
TM Cover
U Ohler
V Grishkevich
Y Gan
Y Gan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

BACKGROUND: The advent of rapid evolution on sequencing capacity of new genomes has evidenced the need for data analysis automation aiming at speeding up the genomic annotation process and reducing its cost. Given that one important step for functional genomic annotation is the promoter identification, several studies have been taken in order to propose computational approaches to predict promoters. Different classifiers and characteristics of the promoter sequences have been used to deal with this prediction problem. However, several works in literature have addressed the promoter prediction problem using datasets containing sequences of 250 nucleotides or more. As the sequence length defines the amount of dataset attributes, even considering a limited number of properties to characterize the sequences, datasets with a high number of attributes are generated for training classifiers. Once high-dimensional datasets can degrade the classifiers predictive performance or even require an infeasible processing time, predicting promoters by training classifiers from datasets with a reduced number of attributes, it is essential to obtain good predictive performance with low computational cost. To the best of our knowledge, there is no work in literature that verified in a systematic way the relation between the sequences length and the predictive performance of classifiers. Thus, in this work, we have evaluated the impact of sequence length variation and training dataset size (number of sequences) on the predictive performance of classifiers. RESULTS: We have built sixteen datasets composed of different sized sequences (ranging in length from 12 to 301 nucleotides) and evaluated them using the SVM, Random Forest and k-NN classifiers. The best predictive performances reached by SVM and Random Forest remained relatively stable for datasets composed of sequences varying in length from 301 to 41 nucleotides, while k-NN achieved its best performance for the dataset composed of 101 nucleotides. We have also analyzed, using sequences composed of only 41 nucleotides, the impact of increasing the number of sequences in a dataset on the predictive performance of the same three classifiers. Datasets containing 14,000, 80,000, 100,000 and 120,000 sequences were built and evaluated. All classifiers achieved better predictive performance for datasets containing 80,000 sequences or more. CONCLUSION: The experimental results show that several datasets composed of shorter sequences achieved better predictive performance when compared with datasets composed of longer sequences, and also consumed a significantly shorter processing time. Furthermore, increasing the number of sequences in a dataset proved to be beneficial to the predictive power of classifiers

RIUFOP (Univ. Federal de Ouro Preto)

ProSOM: core promoter prediction based on unsupervised clustering of DNA physical profiles

Author: Aerts
Bajic
Bajic
Bajic
Baldi
Brent
Carninci
Chen
Choi
Davuluri
Deng
Down
Fickett
Florquin
Goni
Gross
Kanhere
Kawaji
Knudsen
Liolios
P. Rouze
Pedersen
Ponger
Prestridge
Reese
Sandelin
Scherf
Sonnenburg
T. Abeel
Wang
Wang
Won
Y. Saeys
Y. Van de Peer
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

Motivation: More and more genomes are being sequenced, and to keep up with the pace of sequencing projects, automated annotation techniques are required. One of the most challenging problems in genome annotation is the identification of the core promoter. Because the identification of the transcription initiation region is such a challenging problem, it is not yet a common practice to integrate transcription start site prediction in genome annotation projects. Nevertheless, better core promoter prediction can improve genome annotation and can be used to guide experimental work

Ghent University Academic Bibliography

Comparative analysis of mycobacterium and related actinomycetes yields insight into the evolution of mycobacterium tuberculosis pathogenesis

Author: Abeel Thomas
Dolganov Gregory
Galagan James
Iacobelli-Martinez Milena
Kidd Matthew J
Koehrsen Mike
Maer Andreia M
McGuire Abigail Manson
Park Sang Tae
Peterson Matthew
Raman Sahadevan
Regev Aviv
Riley Robert
Schoolnik Gary K
Sisk Peter
Stolte Christian
Wapinski Ilan
Weiner Brian
White Jared
Yamamoto Robert T
Zucker Jeremy
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background The sequence of the pathogen <it>Mycobacterium tuberculosis </it>(<it>Mtb</it>) strain <it>H37Rv </it>has been available for over a decade, but the biology of the pathogen remains poorly understood. Genome sequences from other <it>Mtb </it>strains and closely related bacteria present an opportunity to apply the power of comparative genomics to understand the evolution of <it>Mtb </it>pathogenesis. We conducted a comparative analysis using 31 genomes from the Tuberculosis Database (TBDB.org), including 8 strains of <it>Mtb </it>and <it>M. bovis</it>, 11 additional Mycobacteria, 4 Corynebacteria, 2 Streptomyces, <it>Rhodococcus jostii RHA1, Nocardia farcinia, Acidothermus cellulolyticus, Rhodobacter sphaeroides, Propionibacterium acnes</it>, and <it>Bifidobacterium longum</it>. Results Our results highlight the functional importance of lipid metabolism and its regulation, and reveal variation between the evolutionary profiles of genes implicated in saturated and unsaturated fatty acid metabolism. It also suggests that DNA repair and molybdopterin cofactors are important in pathogenic Mycobacteria. By analyzing sequence conservation and gene expression data, we identify nearly 400 conserved noncoding regions. These include 37 predicted promoter regulatory motifs, of which 14 correspond to previously validated motifs, as well as 50 potential noncoding RNAs, of which we experimentally confirm the expression of four. Conclusions Our analysis of protein evolution highlights gene families that are associated with the adaptation of environmental Mycobacteria to obligate pathogenesis. These families include fatty acid metabolism, DNA repair, and molybdopterin biosynthesis. Our analysis reinforces recent findings suggesting that small noncoding RNAs are more common in Mycobacteria than previously expected. Our data provide a foundation for understanding the genome and biology of <it>Mtb </it>in a comparative context, and are available online and through TBDB.org.</p

DSpace@MIT

Harvard University - DASH

Springer - Publisher Connector