Search CORE

4 research outputs found

Evolving DNA motifs to predict GeneChip probe performance

Author: AP Harrison
BJ Ross
DJ Montana
F Naef
GJ Upton
HG Beyer
JR Koza
M Brameier
M Brameier
M O'Neill
MA Stalteri
ML Wong
NJ Radcliff
PA Whigham
PA Whigham
RI McKay
T Barrett
T Bäck
T Handstad
WB Langdon
WB Langdon
WB Langdon
WB Langdon
WB Langdon
WB Langdon
WB Langdon
WB Langdon
WB Langdon
WB Langdon
WB Langdon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Background: Affymetrix High Density Oligonuclotide Arrays (HDONA) simultaneously measure expression of thousands of genes using millions of probes. We use correlations between measurements for the same gene across 6685 human tissue samples from NCBI's GEO database to indicated the quality of individual HG-U133A probes. Low correlation indicates a poor probe. Results: Regular expressions can be automatically created from a Backus-Naur form (BNF) context-free grammar using strongly typed genetic programming. Conclusion: The automatically produced motif is better at predicting poor DNA sequences than an existing human generated RE, suggesting runs of Cytosine and Guanine and mixtures should all be avoided. © 2009 Langdon and Harrison; licensee BioMed Central Ltd

University of Essex Research Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

UCL Discovery

PubMed Central

A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models

Author: A Andreeva
A Ben-Hur
A Karwath
A Karwath
A Shah
Alessandra Carbone
B Liu
B Qian
B Webb-Robertson
C Ferreira
C Leslie
D Higgins
F Wilcoxon
G Yona
Gerson Zaverucha
H Rangwala
H Saigo
J Bernardes
J Davis
J Gough
J Quinlan
J Soeding
J Weston
Juliana S Bernardes
L De Raedt
L Dehaspe
L Liao
N Shan-Hwei
Q Dong
Q Su
R Agrawal
R Hughey
R King
R King
R Kuang
R Sadreyev
S Altschul
S Altschul
S Brenner
S Eddy
S Eddy
S Kawashima
S Lee
T Handstad
T Jaakkola
T Lingner
U Syed
V Alexandrov
V Atalay
Y Hou
Y Hou
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Remote homology detection is a hard computational problem. Most approaches have trained computational models by using either full protein sequences or multiple sequence alignments (MSA), including all positions. However, when we deal with proteins in the "twilight zone" we can observe that only some segments of sequences (motifs) are conserved. We introduce a novel logical representation that allows us to represent physico-chemical properties of sequences, conserved amino acid positions and conserved physico-chemical positions in the MSA. From this, Inductive Logic Programming (ILP) finds the most frequent patterns (motifs) and uses them to train propositional models, such as decision trees and support vector machines (SVM). Results We use the SCOP database to perform our experiments by evaluating protein recognition within the same superfamily. Our results show that our methodology when using SVM performs significantly better than some of the state of the art methods, and comparable to other. However, our method provides a comprehensible set of logical rules that can help to understand what determines a protein function. Conclusions The strategy of selecting only the most frequent patterns is effective for the remote homology detection. This is possible through a suitable first-order logical representation of homologous properties, and through a set of frequent patterns, found by an ILP system, that summarizes essential features of protein functions.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

HAL-Inserm

PubMed Central

MixChIP: a probabilistic method for cell type specific protein-DNA binding analysis

Author: AM Newman
B Langmead
C Anghel
CA Meyer
CS Ross-Innes
D Venet
DA Liebner
H Lähdesmäki
Harri Lähdesmäki
J Clarke
J Gertz
JT Leek
K Liang
PF Peddi
PV Kharchenko
R Byrd
R Gaujoux
S Anders
SG Landt
Sini Rautio
SS Shen-Orr
T Bailey
T Erkkilä
T Handstad
The ENCODE Project Consortium
X Zheng
Y Li
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Genome-wide target analysis of NEUROD2 provides new insights into regulation of cortical projection neuron migration and differentiation

Author: A Longo
AE Ayoub
Alkan Kabakcioglu
AP Fong
AS Nord
B Chen
B Cubelos
B Cubelos
B Langmead
B Langmead
C Schuurmans
CL Araya
D Karolchik
DP Leone
E Forster
E Leyva-Diaz
EA Alcamo
Efil Bayam
EP Consortium
F Guillemot
F Spitz
G Ince-Dunn
G Wilkinson
Gizem Guzelsoy
Gokhan Guner
Gulayse Ince-Dunn
Gulcan Semra Sahin
I Bormuth
J Ko
J Renaud
JA Cooper
JM Olson
K Tanabe
KR Rosenbloom
KY Kwan
L Roybon
L Yuan
LC Greig
M Ashburner
M Nieto
MB McCormick
MK Sakharkar
MM Andzelm
O Britanova
O Halperin-Barlev
P Machanick
P Mattar
R Kraut
RF Hevner
S Heinz
SA Wilke
T Handstad
T Hu
U Beffert
W Huang da
W Huang da
W Sikora-Wohlfeld
WL McKenna
Y Konishi
Y Zhang
Y Zhang
Z Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref