Search CORE

arXiv.org e-Print Archive

Similarity-Detection and Localization

Author: H. Kinzelbach
L. Balents
L.-H. Tang
M. Kardar
M. Q. Zhang
M. S. Waterman
M. S. Waterman
M. Schöniger
Michael Lässig
S. B. Needleman
S. F. Altschul
T. F. Smith
T. Hwa
T. Nattermann
Terence Hwa
W. S. Fitch
Publication venue: 'American Physical Society (APS)'
Publication date: 14/11/1995
Field of study

The detection of similarities between long DNA and protein sequences is studied using concepts of statistical physics. It is shown that mutual similarities can be detected by sequence alignment methods only if their amount exceeds a threshold value. The onset of detection is a continuous phase transition which can be viewed as a localization-delocalization transition. The ``fidelity'' of the alignment is the order parameter of that transition; it leads to criteria for the selection of optimal alignment parameters.Comment: 4 pages including 4 figures (308kb post-script file

Scholar Commons - Institutional Repository of the University of South Carolina

MPG.PuRe

Sequence Alignment with Matched Sections

Author: Griggs Jerrold R
Hanlon Philip J
Waterman Michael S
Publication venue: Scholar Commons
Publication date: 01/10/1986
Field of study

In molecular biology, two finite sequences are compared by displaying one sequence written over another in an alignment. The number of alignments of two sequences is related to the Stanton-Cowan numbers. This paper gives asymptotics for the number of alignments of two sequences of length n with matching sections of size at least b

Integrative missing value estimation for microarray data

Author: Hu Jianjun
Li Haifeng
Waterman Michael S
Zhou Xianghong Jasmine
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. RESULTS: We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests. CONCLUSION: We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets

Scholar Commons - Institutional Repository of the University of South Carolina

An integrative modular approach to systematically predict gene-phenotype associations

Author: Dai Chao
Mehan Michael R
Nunez-Iglesias Juan
Waterman Michael S
Zhou Xianghong Jasmine
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms. Results We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules. Conclusion We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p

University of Melbourne Institutional Repository

Sequence information signal processor

Author: Chow Edward T.
Hunkapillar Timothy J.
Peterson John C.
Waterman Michael S.
Publication venue
Publication date: 12/10/1999
Field of study

An electronic circuit is used to compare two sequences, such as genetic sequences, to determine which alignment of the sequences produces the greatest similarity. The circuit includes a linear array of series-connected processors, each of which stores a single element from one of the sequences and compares that element with each successive element in the other sequence. For each comparison, the processor generates a scoring parameter that indicates which segment ending at those two elements produces the greatest degree of similarity between the sequences. The processor uses the scoring parameter to generate a similar scoring parameter for a comparison between the stored element and the next successive element from the other sequence. The processor also delivers the scoring parameter to the next processor in the array for use in generating a similar scoring parameter for another pair of elements. The electronic circuit determines which processor and alignment of the sequences produce the scoring parameter with the highest value

NASA Technical Reports Server

Sequence information signal processor for local and global string comparisons

Author: Chow Edward T.
Hunkapillar Timothy J.
Peterson John C.
Waterman Michael S.
Publication venue
Publication date: 20/05/1997
Field of study

A sequence information signal processing integrated circuit chip designed to perform high speed calculation of a dynamic programming algorithm based upon the algorithm defined by Waterman and Smith. The signal processing chip of the present invention is designed to be a building block of a linear systolic array, the performance of which can be increased by connecting additional sequence information signal processing chips to the array. The chip provides a high speed, low cost linear array processor that can locate highly similar global sequences or segments thereof such as contiguous subsequences from two different DNA or protein sequences. The chip is implemented in a preferred embodiment using CMOS VLSI technology to provide the equivalent of about 400,000 transistors or 100,000 gates. Each chip provides 16 processing elements, and is designed to provide 16 bit, two's compliment operation for maximum score precision of between -32,768 and +32,767. It is designed to provide a comparison between sequences as long as 4,194,304 elements without external software and between sequences of unlimited numbers of elements with the aid of external software. Each sequence can be assigned different deletion and insertion weight functions. Each processor is provided with a similarity measure device which is independently variable. Thus, each processor can contribute to maximum value score calculation using a different similarity measure

NASA Technical Reports Server

Gene Aging Nexus: a web database and data mining platform for microarray data on aging

Author: Chiu Chi-Hsien
Finch Caleb E.
Kamath Kiran
Mehan Michael R.
Nunez-Iglesias Juan
Pan Fei
Pulapura Sudip
Waterman Michael S.
Zhang Kangyu
Zhou Xianghong Jasmine
Publication venue: Oxford University Press
Publication date: 07/11/2006
Field of study

The recent development of microarray technology provided unprecedented opportunities to understand the genetic basis of aging. So far, many microarray studies have addressed aging-related expression patterns in multiple organisms and under different conditions. The number of relevant studies continues to increase rapidly. However, efficient exploitation of these vast data is frustrated by the lack of an integrated data mining platform or other unifying bioinformatic resource to enable convenient cross-laboratory searches of array signals. To facilitate the integrative analysis of microarray data on aging, we developed a web database and analysis platform ‘Gene Aging Nexus’ (GAN) that is freely accessible to the research community to query/analyze/visualize cross-platform and cross-species microarray data on aging. By providing the possibility of integrative microarray analysis, GAN should be useful in building the systems-biology understanding of aging. GAN is accessible at

University of Melbourne Institutional Repository

Feasibility of trial procedures for a randomised controlled trial of a community based group exercise intervention for falls prevention for visually impaired older people: the VIOLET study

Author: A Dhital
A Kumar
AJ Campbell
American Geriatrics Society
AS Zigmund
B Klein
B Steinman
C Brundle
Cathy Bailey
D Kendrick
D Podsiadlo
DA Skelton
DA Skelton
Dawn A. Skelton
Denise Howel
Dorothy Coe
DP Gill
E Lamoreux
E Lamoureux
F Bunn
G Lancaster
GA Zijlstra
GA Zijlstra
GI Kempen
H Peyre
H Waterman
Heather Waterman
ID Cameron
J Davis
J Evans
J Painter
JC Davis
JC Mundt
Jennifer Wilkinson
Joanne Gray
L Yardley
Lex D. de Jong
M Boer de
M Herdman
Michael Clarke
NICE 21 Falls Guideline
Nicola Adams
Panel on Prevention of Falls in Older Persons
Rosy Lampitt
S Campbell
S Heinrich
S Iliffe
S Lamb
S Lord
S Nandy
S Parry
S Petrou
Sheena Gawler
Steve W Parry
T Boyce
T Hadjistavropoulos
T Keely
Tony Fouweather
V Power
Vincent Deary
Y Tian
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Background Visually impaired older people (VIOP) have a higher risk of falling than their sighted peers, and are likely to avoid physical activity. The aim was to adapt the existing Falls Management Exercise (FaME) programme for VIOP, delivered in the community, and to investigate the feasibility of conducting a definitive randomised controlled trial (RCT) of this adapted intervention. Methods Two-centre randomised mixed methods pilot trial and economic evaluation of the adapted group-based FaME programme for VIOP versus usual care. A one hour exercise programme ran weekly over 12 weeks at the study sites (Newcastle and Glasgow), delivered by third sector (voluntary and community) organisations. Participants were advised to exercise at home for an additional two hours over the week. Those randomised to the usual activities group received no intervention. Outcome measures were completed at baseline, 12 and 24 weeks. The potential primary outcome was the Short Form Falls Efficacy Scale – International (SFES-I). Participants’ adherence was assessed by reviewing attendance records and self-reported compliance to the home exercises. Adherence with the course content (fidelity) by instructors was assessed by a researcher. Adverse events were collected in a weekly phone call. Results Eighteen participants, drawn from community-living VIOP were screened; 68 met the inclusion criteria; 64 participants were randomised with 33 allocated to the intervention and 31 to the usual activities arm. 94% of participants provided data at the 12 week visit and 92% at 24 weeks. Adherence was high. The intervention was found to be safe with 76% attending nine or more classes. Median time for home exercise was 50 min per week. There was little or no evidence that fear of falling, balance and falls risk, physical activity, emotional, attitudinal or quality of life outcomes differed between trial arms at follow-up. Conclusions The intervention, FaME, was implemented successfully for VIOP and all progression criteria for a main trial were met. The lack of difference between groups on fear of falling was unsurprising given it was a pilot study but there may have been other contributory factors including suboptimal exercise dose and apparent low risk of falls in participants. These issues need addressing for a future trial

Northumbria Research Link

Online Research @ Cardiff

ResearchOnline@GCU

espace@Curtin

Integrating the promotion of physical activity within a smoking cessation programme: Findings from collaborative action research in UK Stop Smoking Services

Author: A McEwen
Adrian H Taylor
AH Taylor
AH Taylor
AH Taylor
AH Taylor
BH Marcus
C Tudor-Locke
C Tudor-Locke
D Gilbourne
DJ Hyman
DM Bravata
Emma S Everson-Hock
ES Everson
ES Everson-Hock
ES Everson-Hock
H Waterman
J McNiff
J Zoellner
JJ Prochaska
JR Hughes
L Al-Chalabi
M Ussher
M Ussher
M Ussher
M Ussher
ME Jung
Medical Research Council (MRC)
Michael Ussher
NW Burton
PJ Gardner
R Maddison
R West
R West
RE Thayer
RE Thayer
Smoking in England
SN Blair
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Background: Within the framework of collaborative action research, the aim was to explore the feasibility of developing and embedding physical activity promotion as a smoking cessation aid within UK 6/7-week National Health Service (NHS) Stop Smoking Services. Methods: In Phase 1 three initial cycles of collaborative action research (observation, reflection, planning, implementation and re-evaluation), in an urban Stop Smoking Service, led to the development of an integrated intervention in which physical activity was promoted as a cessation aid, with the support of a theoretically based self-help guide, and self monitoring using pedometers. In Phase 2 advisors underwent training and offered the intervention, and changes in physical activity promoting behaviour and beliefs were monitored. Also, changes in clients’ stage of readiness to use physical activity as a cessation aid, physical activity beliefs and behaviour and physical activity levels were assessed, among those who attended the clinic at 4-week post-quit. Qualitative data were collected, in the form of clinic observation, informal interviews with advisors and field notes. Results: The integrated intervention emerged through cycles of collaboration as something quite different to previous practice. Based on field notes, there were many positive elements associated with the integrated intervention in Phase 2. Self-reported advisors’ physical activity promoting behaviour increased as a result of training and adapting to the intervention. There was a significant advancement in clients’ stage of readiness to use physical activity as a smoking cessation aid. Conclusions: Collaboration with advisors was key in ensuring that a feasible intervention was developed as an aid to smoking cessation. There is scope to further develop tailored support to increasing physical activity and smoking cessation, mediated through changes in perceptions about the benefits of, and confidence to do physical activity

Stirling Online Research Repository (RIOXX)