Search CORE

EDP Sciences OAI-PMH repository (1.2.0)

Relationship between gene co-expression and probe localization on microarray slides

Author: AB Khodursky
BA Cohen
G Orphanides
G Zhu
J Courcelle
J Qian
JL DeRisi
L Florens
LA Manuelidis
MB Eisen
PJ Roy
PO Brown
PT Spellman
PT Spellman
RJ Cho
T Cremer
TR. Hughes
YH Yang
Publication venue: BioMed Central
Publication date: 01/01/2003
Field of study

BACKGROUND: Microarray technology allows simultaneous measurement of thousands of genes in a single experiment. This is a potentially useful tool for evaluating co-expression of genes and extraction of useful functional and chromosomal structural information about genes. RESULTS: In this work we studied the association between the co-expression of genes, their location on the chromosome and their location on the microarray slides by analyzing a number of eukaryotic expression datasets, derived from the S. cerevisiae, C. elegans, and D. melanogaster. We find that in several different yeast microarray experiments the distribution of the number of gene pairs with correlated expression profiles as a function of chromosomal spacing is peaked at short separations and has two superimposed periodicities. The longer periodicity has a spacing of 22 genes (~42 Kb), and the shorter periodicity is 2 genes (~4 Kb). CONCLUSION: The relative positioning of DNA probes on microarray slides and source plates introduces subtle but significant correlations between pairs of genes. Careful consideration of this spatial artifact is important for analysis of microarray expression data. It is particularly relevant to recent microarray analyses that suggest that co-expressed genes cluster along chromosomes or are spaced by multiples of a fixed number of genes along the chromosome

Springer - Publisher Connector

Integrated analysis of breast cancer cell lines reveals unique signaling pathways

Author: Barbara L Weber
Carolyn L Talcott
Jeffrey R Jackson
Joe W Gray
Keith R Laderoute
Laura M Heiser
Merrill Knapp
Nicholas J Wang
Paul T Spellman
Ph.D Paul T Spellman
Richard F Wooster
Safiyyah Ziyad
Sylvie Laquerre
Wen-Lin Kuo
Yinghui Guan
Zhi Hu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Mapping of sub-networks in the EGFR-MAPK pathway in different breast cancer cell lines reveals that PAK1 may be a marker for sensitivity to MEK inhibitors

CiteSeerX

Springer - Publisher Connector

Public Library of Science (PLOS)

UNT Digital Library

SMART: Unique splitting-while-merging framework for gene clustering

Author: A Thalamuthu
AD Lanterman
AE Teschendorff
AK Jain
Asoke K. Nandi
B Abu-Jamous
B Fritzke
B Fritzke
CR Lin
CS Wallace
D Dembele
D Jiang
David J. Roberts
G Celeux
H Akaike
J Qin
J Rissanen
KY Yeung
L Hubert
L Mavridis
L Zhao
MAT Figueiredo
P Tamayo
PT Spellman
R Xu
R Xu
RJ Cho
Rui Fa
S Bandyopadhyay
S Monti
S Wu
Sergio Gómez
T Kohonen
T Pramila
TR Golub
WM Rand
YJ Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 08/04/2014
Field of study

Copyright @ 2014 Fa et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Successful clustering algorithms are highly dependent on parameter settings. The clustering performance degrades significantly unless parameters are properly set, and yet, it is difficult to set these parameters a priori. To address this issue, in this paper, we propose a unique splitting-while-merging clustering framework, named “splitting merging awareness tactics” (SMART), which does not require any a priori knowledge of either the number of clusters or even the possible range of this number. Unlike existing self-splitting algorithms, which over-cluster the dataset to a large number of clusters and then merge some similar clusters, our framework has the ability to split and merge clusters automatically during the process and produces the the most reliable clustering results, by intrinsically integrating many clustering techniques and tasks. The SMART framework is implemented with two distinct clustering paradigms in two algorithms: competitive learning and finite mixture model. Nevertheless, within the proposed SMART framework, many other algorithms can be derived for different clustering paradigms. The minimum message length algorithm is integrated into the framework as the clustering selection criterion. The usefulness of the SMART framework and its algorithms is tested in demonstration datasets and simulated gene expression datasets. Moreover, two real microarray gene expression datasets are studied using this approach. Based on the performance of many metrics, all numerical results show that SMART is superior to compared existing self-splitting algorithms and traditional algorithms. Three main properties of the proposed SMART framework are summarized as: (1) needing no parameters dependent on the respective dataset or a priori knowledge about the datasets, (2) extendible to many different applications, (3) offering superior performance compared with counterpart algorithms.National Institute for Health Researc

Brunel University Research Archive

Genotype List String: a grammar for describing HLA and KIR genotyping results in a text string

Author: Bochtler W
Cooley S
Gragert L
Guethlein LA
Heuer ML
Hollenbach JA
Mack SJ
Maiers M
Marsh SGE
Milius RP
Mueller CR
Pollack J
Robinson J
Spellman S
Trachtenberg EA
Publication venue: WILEY-BLACKWELL
Publication date: 12/07/2013
Field of study

Knowledge of an individual's human leukocyte antigen (HLA) genotype is essential for modern medical genetics, and is crucial for hematopoietic stem cell and solid-organ transplantation. However, the high levels of polymorphism known for the HLA genes make it difficult to generate an HLA genotype that unambiguously identifies the alleles that are present at a given HLA locus in an individual. For the last 20 years, the histocompatibility and immunogenetics community has recorded this HLA genotyping ambiguity using allele codes developed by the National Marrow Donor Program (NMDP). While these allele codes may have been effective for recording an HLA genotyping result when initially developed, their use today results in increased ambiguity in an HLA genotype, and they are no longer suitable in the era of rapid allele discovery and ultra-high allele polymorphism. Here, we present a text string format capable of fully representing HLA genotyping results. This Genotype List (GL) String format is an extension of a proposed standard for reporting killer-cell immunoglobulin-like receptor (KIR) genotype data that can be applied to any genetic data that use a standard nomenclature for identifying variants. The GL String format uses a hierarchical set of operators to describe the relationships between alleles, lists of possible alleles, phased alleles, genotypes, lists of possible genotypes, and multilocus unphased genotypes, without losing typing information or increasing typing ambiguity. When used in concert with appropriate tools to create, exchange, and parse these strings, we anticipate that GL Strings will replace NMDP allele codes for reporting HLA genotypes

UCL Discovery

eScholarship - University of California

Beyond element-wise interactions: identifying complex interactions in biological processes

Author: A Kahvejian
AJ Tate
B Gourévitch
C Granger
C Zou
Christophe Ladroue
CJ Needham
CWJ Granger
H Parkinson
HW Mewes
J Geweke
J Pearl
J Peirce
J Shendure
J Wu
J Yu
JF Geweke
Jianfeng Feng
K Friston
K Sachs
Keith Kendrick
L Royer
M Ding
M Eichler
M Fletcher
MC Teixeira
N Wiener
O David
PT Spellman
R Aebersold
RA Horn
RS Wang
S Guo
S Klamt
S Mukherjee
Shuixia Guo
SM Kosslyn
T Barrett
Vladimir Brezina
Y Chen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 22/09/2009
Field of study

Background: Biological processes typically involve the interactions of a number of elements (genes, cells) acting on each others. Such processes are often modelled as networks whose nodes are the elements in question and edges pairwise relations between them (transcription, inhibition). But more often than not, elements actually work cooperatively or competitively to achieve a task. Or an element can act on the interaction between two others, as in the case of an enzyme controlling a reaction rate. We call “complex” these types of interaction and propose ways to identify them from time-series observations. Methodology: We use Granger Causality, a measure of the interaction between two signals, to characterize the influence of an enzyme on a reaction rate. We extend its traditional formulation to the case of multi-dimensional signals in order to capture group interactions, and not only element interactions. Our method is extensively tested on simulated data and applied to three biological datasets: microarray data of the Saccharomyces cerevisiae yeast, local field potential recordings of two brain areas and a metabolic reaction. Conclusions: Our results demonstrate that complex Granger causality can reveal new types of relation between signals and is particularly suited to biological data. Our approach raises some fundamental issues of the systems biology approach since finding all complex causalities (interactions) is an NP hard problem

Public Library of Science (PLOS)

Warwick Research Archives Portal Repository

Carbohydrate structures of the human-immunodeficiency-virus (HIV) recombinant envelope glycoprotein gp120 produced in Chinese-hamster ovary cells

Author: J Solomon
L J Basa
M Larkin
M W Spellman
T Feizi
T Mizuochi
Publication venue: 'Portland Press Ltd.'
Publication date
Field of study

arXiv.org e-Print Archive

The Iterative Signature Algorithm for the analysis of large scale gene expression data

Author: A. Brazma
A. Schulze
C.M. Perou
D.D. Lee
E. Lander
G. Getz
G. Sherlock
J. Ihmels
J.E. Staunton
J.L. DeRisi
Jan Ihmels
L. Lazzeroni
M. Bittner
M. Bittner
M. Schena
M.B. Eisen
N.S. Holter
Naama Barkai
O. Alter
P. Tamayo
P.T. Spellman
R.B. Altman
S. Tavazoie
Sven Bergmann
T. Hastie
T.G. Kolda
U. Alon
U. Scherf
Y. Cheng
Publication venue: 'American Physical Society (APS)'
Publication date: 08/10/2002
Field of study

We present a new approach for the analysis of genome-wide expression data. Our method is designed to overcome the limitations of traditional techniques, when applied to large-scale data. Rather than alloting each gene to a single cluster, we assign both genes and conditions to context-dependent and potentially overlapping transcription modules. We provide a rigorous definition of a transcription module as the object to be retrieved from the expression data. An efficient algorithm, that searches for the modules encoded in the data by iteratively refining sets of genes and conditions until they match this definition, is established. Each iteration involves a linear map, induced by the normalized expression matrix, followed by the application of a threshold function. We argue that our method is in fact a generalization of Singular Value Decomposition, which corresponds to the special case where no threshold is applied. We show analytically that for noisy expression data our approach leads to better classification due to the implementation of the threshold. This result is confirmed by numerical analyses based on in-silico expression data. We discuss briefly results obtained by applying our algorithm to expression data from the yeast S. cerevisiae.Comment: Latex, 36 pages, 8 figure

A simple spreadsheet-based, MIAME-supportive format for microarray data: MAGE-TAB

Author: A Brazma
AI Saeed
Alvis Brazma
Anna Farne
AR Jones
B Dysvik
BR Zeeberg
CA Ball
Catherine A Ball
Christian J Stoeckert
Donald S Maier
E Manduchi
Ele Holloway
Farrell Wymore
Gavin Sherlock
Helen C Causton
Helen Parkinson
J White
John Quackenbush
Joseph White
Junmin Liu
Kjell Petersen
M Navarange
Michael Miller
MT Vass
P Spellman
Patricia L Whetzel
Paul T Spellman
Philippe Rocca-Serra
PL Whetzel
PT Spellman
R Anbazhagan
Rafael A Irizarry
Tim F Rayner
Ugis Sarkans
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Sharing of microarray data within the research community has been greatly facilitated by the development of the disclosure and communication standards MIAME and MAGE-ML by the MGED Society. However, the complexity of the MAGE-ML format has made its use impractical for laboratories lacking dedicated bioinformatics support. RESULTS: We propose a simple tab-delimited, spreadsheet-based format, MAGE-TAB, which will become a part of the MAGE microarray data standard and can be used for annotating and communicating microarray data in a MIAME compliant fashion. CONCLUSION: MAGE-TAB will enable laboratories without bioinformatics experience or support to manage, exchange and submit well-annotated microarray data in a standard format using a spreadsheet. The MAGE-TAB format is self-contained, and does not require an understanding of MAGE-ML or XML

University of Bergen

Springer - Publisher Connector