Search CORE

188 research outputs found

Automated DNA Motif Discovery

Author: Graillet Olivia Sanchez
Harrison A. P.
Langdon W. B.
Publication venue
Publication date: 01/01/2010
Field of study

Ensembl's human non-coding and protein coding genes are used to automatically find DNA pattern motifs. The Backus-Naur form (BNF) grammar for regular expressions (RE) is used by genetic programming to ensure the generated strings are legal. The evolved motif suggests the presence of Thymine followed by one or more Adenines etc. early in transcripts indicate a non-protein coding gene. Keywords: pseudogene, short and microRNAs, non-coding transcripts, systems biology, machine learning, Bioinformatics, motif, regular expression, strongly typed genetic programming, context-free grammar.Comment: 12 pages, 2 figure

arXiv.org e-Print Archive

UCL Discovery

Publications at Bielefeld University

Using surveys of Affymetrix GeneChips to study antisense expression.

Author: Harrison AP
Rowsell J
Sanchez-Graillet O
Stalteri MA
Upton GJG
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2010
Field of study

We have used large surveys of Affymetrix GeneChip data in the public domain to conduct a study of antisense expression across diverse conditions. We derive correlations between groups of probes which map uniquely to the same exon in the antisense direction. When there are no probes assigned to an exon in the sense direction we find that many of the antisense groups fail to detect a coherent block of transcription. We find that only a minority of these groups contain coherent blocks of antisense expression suggesting transcription. We also derive correlations between groups of probes which map uniquely to the same exon in both sense and antisense direction. In some of these cases the locations of sense probes overlap with the antisense probes, and the sense and antisense probe intensities are correlated with each other. This configuration suggests the existence of a Natural Antisense Transcript (NAT) pair. We find the majority of such NAT pairs detected by GeneChips are formed by a transcript of an established gene and either an EST or an mRNA. In order to determine the exact antisense regulatory mechanism indicated by the correlation of sense probes with antisense probes, a further investigation is necessary for every particular case of interest. However, the analysis of microarray data has proved to be a good method to reconfirm known NATs, discover new ones, as well as to notice possible problems in the annotation of antisense transcripts

University of Essex Research Repository

CiteSeerX

Crossref

Directory of Open Access Journals

Publications at Bielefeld University

Identifying the impact of G-quadruplexes on Affymetrix 3' arrays using cloud computing.

Author: Harrison AP
Memon FN
Owen AM
Sanchez-Graillet O
Upton GJG
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2010
Field of study

A tetramer quadruplex structure is formed by four parallel strands of DNA/ RNA containing runs of guanine. These quadruplexes are able to form because guanine can Hoogsteen hydrogen bond to other guanines, and a tetrad of guanines can form a stable arrangement. Recently we have discovered that probes on Affymetrix GeneChips that contain runs of guanine do not measure gene expression reliably. We associate this finding with the likelihood that quadruplexes are forming on the surface of GeneChips. In order to cope with the rapidly expanding size of GeneChip array datasets in the public domain, we are exploring the use of cloud computing to replicate our experiments on 3' arrays to look at the effect of the location of G-spots (runs of guanines). Cloud computing is a recently introduced high-performance solution that takes advantage of the computational infrastructure of large organisations such as Amazon and Google. We expect that cloud computing will become widely adopted because it enables bioinformaticians to avoid capital expenditure on expensive computing resources and to only pay a cloud computing provider for what is used. Moreover, as well as financial efficiency, cloud computing is an ecologically-friendly technology, it enables efficient data-sharing and we expect it to be faster for development purposes. Here we propose the advantageous use of cloud computing to perform a large data-mining analysis of public domain 3' arrays

University of Essex Research Repository

Crossref

Directory of Open Access Journals

Publications at Bielefeld University

C-TrO: an ontology for summarization and aggregation of the level of evidence in clinical trials

Author: Cimiano Philipp
Ell Basil
Sanchez Graillet Olivia
Witte Christian
Publication venue
Publication date: 01/01/2019
Field of study

Sanchez Graillet O, Cimiano P, Witte C, Ell B. C-TrO: an ontology for summarization and aggregation of the level of evidence in clinical trials. In: Proceedings of the Workshop Ontologies and Data in Life Sciences (ODLS 2019) in the Joint Ontology Workshops' (JOWO 2019). 2019

Publications at Bielefeld University

A Novel Homozygous Mutation Affecting Integrin α6 in a Case of Junctional Epidermolysis Bullosa with Pyloric Atresia Detected In Utero by Ultrasound Examination

Author: Bardon Claudine Blanchet
Descamps Philippe
Gache Yannick
Lépinard Catherine
Meneguzzi Guerrino
Ortonne Jean Paul
Romero-Graillet Christine
Spadafora Anne
Publication venue: The Society for Investigative Dermatology, Inc. Published by Elsevier Inc.
Publication date: 30/11/1998
Field of study

Elsevier - Publisher Connector

Negation of protein-protein interactions: analysis and extraction

Author: Alfarano
Apweiler
Chapman
Chapman
Debusmann
Friedman
Givon
Järvinen
Kim
Knight
Leroy
Leroy
Massimo Poesio
Mutalik
Olivia Sanchez-Graillet
Settles
Temkin
Tottie
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2007
Field of study

Sanchez Graillet O, Poesio M. Negation of protein-protein interactions: analysis and extraction. Bioinformatics. 2007;23(13):i424--i432.**Motivation**: Negative information about protein–protein interactions—from uncertainty about the occurrence of an interaction to knowledge that it did not occur—is often of great use to biologists and could lead to important discoveries. Yet, to our knowledge, no proposals focusing on extracting such information have been proposed in the text mining literature. **Results**: In this work, we present an analysis of the types of negative information that is reported, and a heuristic-based system using a full dependency parser to extract such information. We performed a preliminary evaluation study that shows encouraging results of our system. Finally, we have obtained an initial corpus of negative protein–protein interactions as basis for the construction of larger ones. **Availability**:The corpus is available by request from the authors

Crossref

Publications at Bielefeld University

Melanocytes: interface of cell biology and pathobiology with a focus on nitric oxide and cGMP signaling

Author: C Romero-Graillet
K Ivanova
K Ivanova
K Ivanova
K Ivanova
Krassimira Ivanova
Pranab Kumar Das
Rupert Gerzer
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The PPI affix dictionary (PPIAD) and BioMethod Lexicon: importance of affixes and tags for recognition of entity mentions and experimental protein interactions

Author: Alfonso Valencia
Andrew Chatr-aryamontri
Ashish V Tendulkar
Florian Leitner
J Hakenberg
L Smith
M Krallinger
M Narayanaswamy
Martin Krallinger
O Sanchez-Graillet
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

PIE: an online prediction system for protein–protein interactions from text

Author: B.-T. Zhang
Chen
Cohen
Hoffmann
I.-H. Lee
Jang
Jensen
Kim
Krallinger
R. Sriram
S. Kim
S.-J. Kim
S.-Y. Shin
Sanchez-Graillet
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

Protein–protein interaction (PPI) extraction has been an important research topic in bio-text mining area, since the PPI information is critical for understanding biological processes. However, there are very few open systems available on the Web and most of the systems focus on keyword searching based on predefined PPIs. PIE (Protein Interaction information Extraction system) is a configurable Web service to extract PPIs from literature, including user-provided papers as well as PubMed articles. After providing abstracts or papers, the prediction results are displayed in an easily readable form with essential, yet compact features. The PIE interface supports more features such as PDF file extraction, PubMed search tool and network communication, which are useful for biologists and bio-system developers. The PIE system utilizes natural language processing techniques and machine learning methodologies to predict PPI sentences, which results in high precision performance for Web users. PIE is freely available at http://bi.snu.ac.kr/pie/

CiteSeerX

Crossref

PubMed Central