Search CORE

132 research outputs found

eHive: An Artificial Intelligence workflow system for genomic analysis

Author: Beal Kathryn
Fitzgerald Stephen
Flicek Paul
Gordon Leo
Herrero Javier
Schuster Michael
Severin Jessica
Ureta-Vidal Abel
Vilella Albert J
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The Ensembl project produces updates to its comparative genomics resources with each of its several releases per year. During each release cycle approximately two weeks are allocated to generate all the genomic alignments and the protein homology predictions. The number of calculations required for this task grows approximately quadratically with the number of species. We currently support 50 species in Ensembl and we expect the number to continue to grow in the future. Results We present eHive, a new fault tolerant distributed processing system initially designed to support comparative genomic analysis, based on blackboard systems, network distributed autonomous agents, dataflow graphs and block-branch diagrams. In the eHive system a MySQL database serves as the central blackboard and the autonomous agent, a Perl script, queries the system and runs jobs as required. The system allows us to define dataflow and branching rules to suit all our production pipelines. We describe the implementation of three pipelines: (1) pairwise whole genome alignments, (2) multiple whole genome alignments and (3) gene trees with protein homology inference. Finally, we show the efficiency of the system in real case scenarios. Conclusions eHive allows us to produce computationally demanding results in a reliable and efficient way with minimal supervision and high throughput. Further documentation is available at: <url>http://www.ensembl.org/info/docs/eHive/</url>.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

UCL Discovery

Mother-to-child Transmission of Human T-cell Lymphotropic Virus Type 1

Author: Ando
Edmond
Furnia
Hideaki Masuzaki
Hino
Hiroshi Doi
Hiroyuki Moriuchi
Iwanaga
Katamine
Katsuya
Li
Maloney
Orland
Shigeru Katamine
Takezaki
Ureta-Vidal
Vine
Wiktor
Yamamoto
Yoshida
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 01/02/2013
Field of study

Crossref

Nagasaki University's Academic Output SITE: NAOSITE

Institutional Repositories DataBase (IRDB)

Nagasaki university's Academic Output SITE

EMMA—mouse mutant resources for the international scientific community

Author: A. Pickard
A. Ureta-Vidal
A.-M. Mallon
C.-K. Chen
Collins
D. Smedley
Davisson
E. Birney
Eppig
F. Scavizzi
G. P. Tocchini-Valentini
G. Soulat
J. Leyritz
J. Sengerova
K. Pickford
Knight
L. Montoliu
M. Fray
M. H. de Angelis
M. Hagn
M. Massimi
M. Raspa
Marschall
P. Wilkinson
R. H. Butler
R. Matteoni
S. Brown
S. Fessele
S. Marschall
Smedley
V. Larrigaldie
Y. Herault
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

The laboratory mouse is the premier animal model for studying human disease and thousands of mutants have been identified or produced, most recently through gene-specific mutagenesis approaches. High throughput strategies by the International Knockout Mouse Consortium (IKMC) are producing mutants for all protein coding genes. Generating a knock-out line involves huge monetary and time costs so capture of both the data describing each mutant alongside archiving of the line for distribution to future researchers is critical. The European Mouse Mutant Archive (EMMA) is a leading international network infrastructure for archiving and worldwide provision of mouse mutant strains. It operates in collaboration with the other members of the Federation of International Mouse Resources (FIMRe), EMMA being the European component. Additionally EMMA is one of four repositories involved in the IKMC, and therefore the current figure of 1700 archived lines will rise markedly. The EMMA database gathers and curates extensive data on each line and presents it through a user-friendly website. A BioMart interface allows advanced searching including integrated querying with other resources e.g. Ensembl. Other resources are able to display EMMA data by accessing our Distributed Annotation System server. EMMA database access is publicly available at http://www.emmanet.org

TreeFam: 2008 Update

Author: A. Coghlan
A. J. Vilella
A. Moses
A. Ureta-Vidal
Brown
Chen
Dehal
Edgar
Fitch
Guindon
H. Li
Haas
Haas
Hertz-Fowler
Huerta-Cepas
J. Qin
J. Ruan
J. Wang
J.-K. Heriche
K. Kristiansen
Koonin
Krishnamurthy
L. Bolund
L. J. M. Coin
Li
Meinel
O'Brien
Povey
R. Durbin
R. Li
S. Vang
T. Liu
Tatusov
Wu
Y. Guo
Y. Hu
Yu
Z. Chen
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

TreeFam (http://www.treefam.org) was developed to provide curated phylogenetic trees for all animal gene families, as well as orthologue and paralogue assignments. Release 4.0 of TreeFam contains curated trees for 1314 families and automatically generated trees for another 14 351 families. We have expanded TreeFam to include 25 fully sequenced animal genomes, as well as four genomes from plant and fungal outgroup species. We have also introduced more accurate approaches for automatically grouping genes into families, for building phylogenetic trees, and for inferring orthologues and paralogues. The user interface for viewing phylogenetic trees and family information has been improved. Furthermore, a new perl API lets users easily extract data from the TreeFam mysql database

Crossref

PubMed Central

University of Southern Denmark Research Output

University of Melbourne Institutional Repository

University of Queensland eSpace

Ensembl 2005

Author: Andrews D
Birney E
Caccamo M
Cameron G
Chen Y
Clamp M
Clarke L
Coates G
Cox T
Cunningham F
Curwen V
Cutts T
Down T
Durbin R
Fernandez-Suarez X M
Gilbert J
Hammond M
Herrero J
Hotz H
Howe K
Hubbard T
Hubbard Tim
Iyer V
Jekosch K
Kahari A
Kasprzyk A
Keefe D
Keenan S
Kokocinsci F
London D
Longden I
McVicker G
Meidl P
Melsopp C
Potter S
Proctor G
Rae M
Rios D
Schuster M
Searle S
Severin J
Slater G
Smedley D
Smith J
Spooner W
Stabenau A
Stalker J
Storey R
Trevanion S
Ureta-Vidal A
Vogel J
White S
Woodwark C
Publication venue: Oxford University Press
Publication date: 17/12/2004
Field of study

The Ensembl (http://www.ensembl.org/) project provides a comprehensive and integrated source of annotation of large genome sequences. Over the last year the number of genomes available from the Ensembl site has increased by 7 to 16, with the addition of the six vertebrate genomes of chimpanzee, dog, cow, chicken, tetraodon and frog and the insect genome of honeybee. The majority have been annotated automatically using the Ensembl gene build system, showing its flexibility to reliably annotate a wide variety of genomes. With the increased number of vertebrate genomes, the comparative analysis provided to users has been greatly improved, with new website interfaces allowing annotation of different genomes to be directly compared. The Ensembl software system is being increasingly widely reused in different projects showing the benefits of a completely open approach to software development and distribution

CiteSeerX

Crossref

PubMed Central

King's Research Portal

Local Gene Regulation Details a Recognition Code within the LacI Transcriptional Factor Family

Author: A Glasfeld
A Sandelin
A Sarai
A Ureta-Vidal
AE Kazakov
AV Morozov
BM Hall
BW Matthews
C Francke
CE Bell
CG Kalodimos
CI Jørgensen
CO Pabo
CO Pabo
EJ Alm
Eric J. Alm
FM Camas
Francisco M. Camas
G Kolesov
G Paillard
Gary D. Stormo
GP Smith
J Boch
J Castresana
J Nardelli
J Sartorius
J Schultz
JL Betz
JO Korbel
JR Desjarlais
Juan F. Poyatos
L Milk
M Lewis
M Lewis
M Lewis
M Perros
M Suzuki
MA Schumacher
MA Schumacher
MJ Moscou
MJ Weickert
MM Gromiha
NC Seeman
NM Luscombe
P Baldi
PB Warren
PV Benos
R Hershberg
RC Edgar
RK Salinas
S Mahony
S Mahony
SA Wolfe
SJ Maerlk
T Sera
TA Desai
V Espinosa Angarica
W Thompson
WW Wasserman
Y Choo
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

The specific binding of regulatory proteins to DNA sequences exhibits no clear patterns of association between amino acids (AAs) and nucleotides (NTs). This complexity of protein-DNA interactions raises the question of whether a simple set of wide-coverage recognition rules can ever be identified. Here, we analyzed this issue using the extensive LacI family of transcriptional factors (TFs). We searched for recognition patterns by introducing a new approach to phylogenetic footprinting, based on the pervasive presence of local regulation in prokaryotic transcriptional networks. We identified a set of specificity correlations –determined by two AAs of the TFs and two NTs in the binding sites– that is conserved throughout a dominant subgroup within the family regardless of the evolutionary distance, and that act as a relatively consistent recognition code. The proposed rules are confirmed with data of previous experimental studies and by events of convergent evolution in the phylogenetic tree. The presence of a code emphasizes the stable structural context of the LacI family, while defining a precise blueprint to reprogram TF specificity with many practical applications.Ministerio de Ciencia e Innovación, Spain (Formación de Profesorado Universitario fellowship)Ministerio de Ciencia e Innovación, Spain (grant BFU2008-03632/BMC)Madrid (Spain : Region) (grant CCG08-CSIC/SAL-3651

CiteSeerX

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

Digital.CSIC

Ensembl 2008.

Author: Aken B.
Ballester Benoit
Beal K.
Birney E.
Caccamo M.
Chen Y.
Clarke L.
Coates G.
Cox T.
Cunningham F.
Curwen V.
Cutts T.
Down T.
Durbin R.
Dyer S.
Eyre T.
Fernandez-Banet J.
Fernandez-Suarez X. M.
Fitzgerald S.
Flicek P.
Graf S.
Gräf S.
Haider S.
Hammond M.
Herrero J.
Holland R.
Howe K.
Hubbard J.
Jenkinson A.
Johnson N.
Kahari A.
Kasprzyk A.
Keefe D.
Kokocinski F.
Kulesha E.
Kähäri A.
Lawson D.
Longden I.
Megy K.
Meidl P.
Overduin B.
Parker A.
Pritchard B.
Prlic A.
Proctor G.
Rice S.
Rios D.
Schuster M.
Sealy I.
Searle S.
Slater G.
Smedley D.
Smith J.
Spudich G.
Trevanion S.
Ureta-Vidal A.
Vilella A.
Vogel J.
White S.
Wood M.
Publication venue: Nucleic Acids Res
Publication date: 13/11/2007
Field of study

The Ensembl project (http://www.ensembl.org) is a comprehensive genome information system featuring an integrated set of genome annotation, databases and other information for chordate and selected model organism and disease vector genomes. As of release 47 (October 2007), Ensembl fully supports 35 species, with preliminary support for six additional species. New species in the past year include platypus and horse. Major additions and improvements to Ensembl since our previous report include extensive support for functional genomics data in the form of a specialized functional genomics database, genome-wide maps of protein-DNA interactions and the Ensembl regulatory build; support for customization of the Ensembl web interface through the addition of user accounts and user groups; and increased support for genome resequencing. We have also introduced new comparative genomics-based data mining options and report on the continued development of our software infrastructure

Hal - Université Grenoble Alpes

King's Research Portal

Ensembl 2007

Author: Aken B. L.
Ballester B.
Beal K.
Birney E.
Caccamo M.
Chen Y.
Clarke L.
Coates G.
Cox T.
Cunningham F.
Curwen V.
Cutts T.
Down T.
Durbin R.
Dyer S. C.
Fernandez-Banet J.
Fernandez-Suarez X. M.
Fitzgerald S.
Flicek P.
Graf S.
Haider S.
Hammond M.
Herrero J.
Holland R.
Howe K.
Howe K.
Hubbard T. J. P.
Johnson N.
Kahari A.
Kasprzyk A.
Keefe D.
Kokocinski F.
Kulesha E.
Lawson D.
Longden I.
Megy K.
Meidl P.
Melsopp C.
Ouverdin B.
Parker A.
Prlic A.
Proctor G.
Rice S.
Rios D.
Schuster M.
Sealy I.
Searle S.
Severin J.
Slater G.
Smedley D.
Smith J.
Spudich G.
Trevanion S.
Ureta-Vidal A.
Vilella A.
Vogel J.
White S.
Wood M.
Publication venue: Oxford University Press
Publication date: 01/12/2006
Field of study

The Ensembl () project provides a comprehensive and integrated source of annotation of chordate genome sequences. Over the past year the number of genomes available from Ensembl has increased from 15 to 33, with the addition of sites for the mammalian genomes of elephant, rabbit, armadillo, tenrec, platypus, pig, cat, bush baby, common shrew, microbat and european hedgehog; the fish genomes of stickleback and medaka and the second example of the genomes of the sea squirt (Ciona savignyi) and the mosquito (Aedes aegypti). Some of the major features added during the year include the first complete gene sets for genomes with low-sequence coverage, the introduction of new strain variation data and the introduction of new orthology/paralog annotations based on gene trees

HAL AMU

PubMed Central

UCL Discovery

King's Research Portal

No evidence of vertical transmission of HTLV-I in bottle-fed children

Author: Achiléa L BITTENCOURT
BARBOSA H.S.
BITTENCOURT A.L.
BITTENCOURT A.L.
Celia PEDROSO
DULIEGE A.M.
Ester C. SABINO
HINO S.
HIRATA M.
IKEDA K.
KIND C.
KUSUHARA K.
LA GRENADE L.
Licia MOREIRA
LIN H.
MAGUIRE A.
MANDELBROT L.
Maria Cecília COSTA
MONPLAISIR N.
OKI T.
SABINO E.C.
SUGIYAMA H.
TAKAHASHI K.
URETA-VIDAL A.
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref